International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 120 | Views: 316 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

M.Tech / M.E / PhD Thesis | Computer Science & Engineering | India | Volume 5 Issue 6, June 2016 | Popularity: 7.1 / 10


     

Document Clustering using Improved K-means Algorithm

Anjali Vashist, Rajender Nath


Abstract: Clustering is an efficient technique that organizes a large quantity of unordered text documents into a small number of significant and coherent clusters, thereby providing a basis for intuitive and informative navigation and browsing mechanisms. It is studied by the researchers at broad level because of its broad application in several areas such as web mining, search engines, and information extraction. It clusters the documents based on various similarity measures. The existing K-means (document clustering algorithm) was based on random center generation and every time the clusters generated was different In this paper, an Improved Document Clustering algorithm is given which generates number of clusters for any text documents based on fixed center generation, collect only exclusive words from different documents in dataset and uses cosine similarity measures to place similar documents in proper clusters. Experimental results showed that accuracy of proposed algorithm is high compare to existing algorithm in terms of F-Measure, Recall, Precision and time complexity.


Keywords: Document Clustering, Cosine Similarity, Term Finder, Tf-Idf, Threshold


Edition: Volume 5 Issue 6, June 2016


Pages: 2206 - 2210



Make Sure to Disable the Pop-Up Blocker of Web Browser




Text copied to Clipboard!
Anjali Vashist, Rajender Nath, "Document Clustering using Improved K-means Algorithm", International Journal of Science and Research (IJSR), Volume 5 Issue 6, June 2016, pp. 2206-2210, https://www.ijsr.net/getabstract.php?paperid=NOV164735, DOI: https://www.doi.org/10.21275/NOV164735



Similar Articles

Downloads: 0

Research Paper, Computer Science & Engineering, Singapore, Volume 13 Issue 5, May 2024

Pages: 711 - 722

General Multi-Objective Performance Expression for Population-Based Search and Optimization

Eik Fun Khor

Share this Article

Downloads: 0

Research Paper, Computer Science & Engineering, Kazakhstan, Volume 13 Issue 11, November 2024

Pages: 1485 - 1488

Enhancing Recommendation Systems with Fuzzy Logic-Based Collaborative Filtering

Yernar Seitay

Share this Article

Downloads: 1 | Monthly Hits: ⮙1

Student Project, Computer Science & Engineering, India, Volume 11 Issue 5, May 2022

Pages: 650 - 654

Automatic Text Summarization and Audio Generation

Tanooja K, Tejasri K, Akhilesh T, Prasanna Kavya M

Share this Article

Downloads: 1 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Research Paper, Computer Science & Engineering, India, Volume 13 Issue 5, May 2024

Pages: 1490 - 1494

An Efficient Secure Data Aggregation Strategy in Wireless Sensor Network using MAC Authentication

Mamta, Dr. Shiva Prakash

Share this Article

Downloads: 2 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Research Paper, Computer Science & Engineering, India, Volume 10 Issue 12, December 2021

Pages: 1257 - 1264

Digital Image Watermarking Technique Using Discrete Wavelet Transform and Discrete Cosine Transform

Bhupendra Ram

Share this Article



Top