Document Classification Using Part of Speech in Text Mining

Sonam Tripathi, Tripti Sharma; Sonam Tripathi, Tripti Sharma

doi:10.21275/NOV152438

Downloads: 105 | Views: 421

Research Paper | Computer Science & Engineering | India | Volume 4 Issue 12, December 2015 | Popularity: 6.8 / 10

Document Classification Using Part of Speech in Text Mining

Sonam Tripathi, Tripti Sharma

Abstract: Text mining is a practice that is used to find beneficial in arrangement from the large amount of data sets. Data mining has guidelines called as frequent pattern and association rule that is important for finding frequent patterns. Text Mining is the detection by computer of new, previously unidentified in arrangement by automatically mining in arrangement from dissimilar written resources. Text mining methods are the fundamental and permitting tools for efficient organization, triangulation, retrieval and summarization of large document quantity. The problem is more often than not decomposed into two sub problems. The first is to find those kind of item sets whose occurrence goes beyond a predefined threshold set in the database, those item sets are describe frequent or large item sets. The second problem is to produce involvement rules from those huge item sets with the restriction of minimal self-confidence. In this work, the text mining is done by dividing the given set of paragraphs into tokens and classifying them accordingly. The techniques are purely composed of sequential pattern mining, closed pattern mining & frequent pattern mining. Hence, the discovered patterns in the field of text mining cannot be used further or again. All frequently used short patterns are not useful here. In this work, an effective pattern taxonomy model & part of speech have been proposed to overcome and solve the problem of low frequency & misinterpretation.

Keywords: Text mining, Association rule, Sequential pattern mining, Closed pattern mining, Frequent pattern mining

Edition: Volume 4 Issue 12, December 2015

Pages: 2004 - 2008

DOI: https://www.doi.org/10.21275/NOV152438

Please Disable the Pop-Up Blocker of Web Browser

Verification Code will appear in 2 Seconds ... Wait

Text copied to Clipboard!

Sonam Tripathi, Tripti Sharma, "Document Classification Using Part of Speech in Text Mining", International Journal of Science and Research (IJSR), Volume 4 Issue 12, December 2015, pp. 2004-2008, https://www.ijsr.net/getabstract.php?paperid=NOV152438, DOI: https://www.doi.org/10.21275/NOV152438