Downloads: 113
Research Paper | Computer Science & Engineering | India | Volume 5 Issue 5, May 2016
Text Categorization using Jaccard Coefficient for Text Messages
Ankita Jadhao | Dr. A. J. Agrawal [3]
Abstract: There is wide growth in web application and electronic documents in day to day which needs automatic text classification of documents. Proper Classification methods provide the good results of the experiment and gives proper direction to the further processing of the text. The text is e-documents, news report, blogs, messages, comments on social media, e-books, web content etc which required text mining to extract meaningful knowledge from it. Some natural language techniques and machine learning algorithm are good to get the meaning of that e-document and classify them. There are lots of techniques are there for classification of the text documents, this paper is to understand different techniques and highlight the important methodology among them and helpful to selecting the classification technique which is appropriate to the text-classification process. And detail implementation of one of this method to classify the text message in two categories according the terms found in it. The coming text message is suspicious or not. In this case the Jaccard coefficient method gives the best result to classify message according to the words found in it. Text classification processes include several steps such as feature selection, vector representation and learning algorithm.
Keywords: Document Classification, Natural Language processing, Information retrieval, Text mining
Edition: Volume 5 Issue 5, May 2016,
Pages: 2046 - 2050
Similar Articles with Keyword 'Document Classification'
Downloads: 105
Research Paper, Computer Science & Engineering, India, Volume 4 Issue 12, December 2015
Pages: 2004 - 2008Document Classification Using Part of Speech in Text Mining
Sonam Tripathi | Tripti Sharma [2]
Downloads: 123
M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 5 Issue 5, May 2016
Pages: 751 - 754Performance Evaluation of Cluster Based Algorithm used for Text Document Classification
Rohit S. Patil | Manish Bhardwaj