Downloads: 126 | Views: 243 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1
M.Tech / M.E / PhD Thesis | Computer Science & Engineering | India | Volume 3 Issue 6, June 2014 | Rating: 6.3 / 10
Efficient Text Clustering for Distributed Network
Chithra Purushothaman | Lakshmi S [35]
Abstract: Text clustering is an important technique for improving the quality of information retrieval in both centralized and distributed environment. Most of the existing text clustering algorithms are designed for central execution; which are not work well on highly distributed environment. In this paper; an algorithm called probabilistic text clustering for distributed network such as peer to peer network is proposed. This algorithm achieves high scalability for assigning documents to clusters. It enables a peer to compare each of its documents only with very few selected clusters; maintain cluster quality.
Keywords: text clustering, k- means, p2p network, DHT, centroid
Edition: Volume 3 Issue 6, June 2014,
Pages: 362 - 365