Downloads: 124 | Views: 290

Research Paper | Computer Science & Engineering | India | Volume 3 Issue 1, January 2014 | Popularity: 6.8 / 10

Evaluation of Similarities Measure in Document Clustering

Hemalatha Immandhi

Abstract: Clustering is a technique of collecting data into subsets in such a manner that identical instances are collected together, at the same time as different instances belong to different groups. The occurrences are thereby organized into an efficient depiction that characterizes the populace being sectioned. Clustering of entities is as earliest as the human need for describing the salient characteristics of mean and objects and identifying them with a style. Consequently, it squeezes a choice of scientific regulations from mathematics and statistics to biology and genetics, the entire of which uses different terms to describe the topologies formed using this analysis. As of biological taxonomies to medical syndromes and genetic genotypes to manufacturing group technology-the problem is same forming groups i. e. cluster text documents that have sparse and high dimensional data objects. Subsequently we originate new clustering criterion functions and corresponding clustering algorithms respectively. Divisive algorithms initiated with just only one cluster that contains all sample data. After that, the single cluster splits into two or more clusters that have higher dissimilarity between them until the number of clusters becomes number of samples or as specified by the user. The most important work is to build up a novel hierarchical algorithm for document clustering which provides maximum efficiency and performance. It is mainly spotlighted in studying and making use of cluster overlapping phenomenon to design cluster merging criteria. Recommending a new method to compute the overlap rate in order to improve time efficiency and the veracity is mainly concentrated. Multi-view learning algorithms characteristically assume a complete bipartite mapping between the different views in order to exchange information during the learning process. The remaining of this paper is ordered.

Keywords: Technology, clustering, Algorithm, data, analysis

Edition: Volume 3 Issue 1, January 2014

Pages: 39 - 41

Make Sure to Disable the Pop-Up Blocker of Web Browser

Text copied to Clipboard!

Hemalatha Immandhi, "Evaluation of Similarities Measure in Document Clustering", International Journal of Science and Research (IJSR), Volume 3 Issue 1, January 2014, pp. 39-41, URL: https://www.ijsr.net/getabstract.php?paperid=02013726, DOI: https://www.doi.org/10.21275/02013726

Downloads: 656 | Views: 2003

Computer Science & Engineering, India, Volume 9 Issue 7, July 2020

Pages: 1454 - 1458

Heart Disease Prediction with Machine Learning Approaches

Megha Kamboj

Downloads: 401 | Views: 720

Computer Science & Engineering, India, Volume 7 Issue 11, November 2018

Pages: 1951 - 1955

Hadoop Performance Improvement using Metadata and Securing with Oauth Token

Swapnali A. Salunkhe, Amol B. Rajmane

Downloads: 386 | Views: 701

Computer Science & Engineering, India, Volume 9 Issue 12, December 2020

Pages: 1 - 3

Comparative Study of Conventional Desktop Computer and Compute Stick

Aadarsh Sooraj, Sooraj G.

Downloads: 354 | Views: 698

Computer Science & Engineering, India, Volume 3 Issue 6, June 2014

Pages: 629 - 632

Review Paper on Secure Hashing Algorithm and Its Variants

Priyanka Vadhera, Bhumika Lall

Downloads: 336 | Views: 688

Computer Science & Engineering, India, Volume 3 Issue 6, June 2014

Pages: 2148 - 2152

The Impact and Application of 3D Printing Technology

Thabiso Peter Mpofu, Cephas Mawere, Macdonald Mukosera