International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 4 | Views: 267 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Informative Article | Computer Science and Information Technology | India | Volume 10 Issue 9, September 2021 | Popularity: 5.5 / 10


     

A Survey of Text Clustering Techniques: Algorithms, Applications, and Challenges

Akshata Upadhye


Abstract: Text clustering is one of the fundamental tasks in natural language processing which involves grouping similar documents together based on their content thus enabling efficient organization and analysis of textual data. In this paper we provide a comprehensive survey of text clustering techniques, its applications, challenges, and future directions. We begin by discussing the fundamentals of text clustering, including key concepts such as similarity measures, text feature representations, and clustering algorithms. We also explore popular text clustering algorithms such as K-means, hierarchical clustering, density- based clustering, spectral clustering, affinity propagation, and Latent Dirichlet Allocation (LDA) popularly used for topic modelling. For every algorithm we discuss its methodology, strengths, limitations, and parameter tuning considerations. We also dive deep into the real-world applications of text clustering across diverse domains, including document organization, information retrieval, text summarization, sentiment analysis, and recommendation systems and highlight their effectiveness with case studies and examples. We also identify several challenges and open research questions in text clustering, such as scalability, handling high-dimensional data, incorporating domain specific knowledge in clustering, evaluation metrics, and integration with other NLP tasks such NER, classification, etc. Finally, we propose potential future directions for research to address these challenges in order to advance the field of text clustering. In conclusion, text clustering continues to be an interesting area of research with immense potential for applications in various domains which helps drive innovation in natural language processing.


Keywords: Text clustering, natural language processing, clustering algorithms, document organization, sentiment analysis, scalability


Edition: Volume 10 Issue 9, September 2021


Pages: 1749 - 1752


DOI: https://www.doi.org/10.21275/SR24304163737



Make Sure to Disable the Pop-Up Blocker of Web Browser




Text copied to Clipboard!
Akshata Upadhye, "A Survey of Text Clustering Techniques: Algorithms, Applications, and Challenges", International Journal of Science and Research (IJSR), Volume 10 Issue 9, September 2021, pp. 1749-1752, https://www.ijsr.net/getabstract.php?paperid=SR24304163737, DOI: https://www.doi.org/10.21275/SR24304163737



Similar Articles

Downloads: 0

Research Paper, Computer Science and Information Technology, India, Volume 12 Issue 3, March 2023

Pages: 900 - 903

Web Mining

Sunkara Nagasivaanjaneya Reddy, R. Nagarjuna Yadav, Alka Choksi

Share this Article

Downloads: 0

Research Paper, Computer Science and Information Technology, Saudi Arabia, Volume 12 Issue 6, June 2023

Pages: 2755 - 2761

Graph-based Model for Keyphrases Extraction from Arabic Text (GMKE)

Amirah Al Shammari, Abdullah Al Ghamdi

Share this Article

Downloads: 0

Research Paper, Computer Science and Information Technology, India, Volume 13 Issue 4, April 2024

Pages: 1295 - 1298

Real-Time Data Processing and Analysis in MIS: Challenges and Solutions

Dhruv K Singhal

Share this Article

Downloads: 0

Informative Article, Computer Science and Information Technology, India, Volume 12 Issue 12, December 2023

Pages: 2132 - 2134

A Guide to DynamoDB Global Tables for Effective Multi-Region Replication

Krishna Mohan Pitchikala

Share this Article

Downloads: 0

Research Paper, Computer Science and Information Technology, United States of America, Volume 11 Issue 8, August 2022

Pages: 1539 - 1542

Streamlining Infrastructure as Code in Azure DevOps: Automation Strategies for Scalability

Satheesh Reddy Gopireddy

Share this Article
Top