Downloads: 113 | Views: 250
Research Paper | Computer Science & Engineering | India | Volume 4 Issue 12, December 2015 | Popularity: 6.4 / 10
Text Clustering With Using Side Information
Shubhangi V. Airekar, Dhanshree S. Kulkurni
Abstract: Side information is present along with many text mining application. This side information may be provenance information, any links in the document, web logs which contain user access behavior, the links for any document or any other non textual attributes which are embedded into the text document. All these attributes may contain a large amount of information for clustering purposes. But it is difficult to calculate the concerned importance of this side information especially when some of the data is noisy. In that situation, it is risky to merge side information into the mining process because it can enhance the quality of the representation for the mining process or can add noise in this system. Thus, there should be a proper way to do this mining process so that it will make use of side information to maximize their advantages. Therefore, it is recommended to design an efficient algorithm which makes combination of classical portioning algorithm with probabilistic models in order to create an effective clustering approach.
Keywords: Data Mining, clustering, text mining, classifier information, text collection
Edition: Volume 4 Issue 12, December 2015
Pages: 1420 - 1423
DOI: https://www.doi.org/10.21275/NOV152287
Make Sure to Disable the Pop-Up Blocker of Web Browser
Similar Articles
Downloads: 0
Student Project, Computer Science & Engineering, India, Volume 11 Issue 6, June 2022
Pages: 1875 - 1880Microclustering with Outlier Detection for DADC
Aswathy Priya M.
Downloads: 0
Research Paper, Computer Science & Engineering, India, Volume 12 Issue 2, February 2023
Pages: 916 - 919Sentiment Analysis: A Case Study for Apparel Brands - FABINDIA v/s BIBA
Syed Aqsa Ahmed
Downloads: 1
Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014
Pages: 2205 - 2207A Survey of Generating Multi-Document Summarizations
Patil Ajita S., P. M. Mane
Downloads: 1 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1
Analysis Study Research Paper, Computer Science & Engineering, India, Volume 12 Issue 11, November 2023
Pages: 1840 - 1846Analysis of Placement for Electronics and Communication Engineering Students using Multiple Clustering
Dr. Dola Sanjay S
Downloads: 1 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1
Analysis Study Research Paper, Computer Science & Engineering, India, Volume 13 Issue 1, January 2024
Pages: 805 - 811Predicting the Energy Efficiency in Wireless Sensor Networks using LSTM and Random Forest Method
Aruna Reddy H., Shivamurthy G., Rajanna M.