Rate the Article: Text Clustering and Classification on the Use of Side Information, IJSR, Call for Papers, Online Journal
International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064

Downloads: 107 | Views: 289

Review Papers | Computer Science & Engineering | India | Volume 3 Issue 10, October 2014 | Rating: 6.3 / 10

Text Clustering and Classification on the Use of Side Information

Shilpa S. Raut, Prof. V. B. Maral

Abstract: Side-information is present with the text document in many text mining applications. An user-access behavior from web logs, or other non-textual attributes embedded into the text document, the links in the document, document provenance information etc are nothing but side information. These attributes contains a vast amount of information for clustering purposes. But it is difficult to estimate the relative importance when some information is noisy. In that case, it will be risky to incorporate side-information into mining process as there is possibility that it will increase the quality of the representation for the mining process or may add a noise to process. Thus a proper way to carry out the mining process is needed such that it will maximize the advantages form using side information. So in this topic, an algorithm is designed, in order to give an effective clustering algorithm. This algorithm combines classical partitioning algorithms with probabilistic models, then show how to extend the approach to the classification problem.

Keywords: clustering, classifiers information, text mining, text collection, clustering methods

Edition: Volume 3 Issue 10, October 2014,

Pages: 2135 - 2136

Rate this Article

Select Rating (Lowest: 1, Highest: 10)


Your Comments (Only high quality comments will be accepted.)

Characters: 0

Your Full Name:

Your Valid Email Address:

Verification Code will appear in 2 Seconds ... Wait
