International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 116 | Views: 283 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Research Paper | Computer Science & Engineering | India | Volume 3 Issue 6, June 2014 | Popularity: 7 / 10


     

Text Studies Classification of Database of Genotypes and Phenotypes using K-Nearest Neighbor Algorithm

Kolekar Suresh S, Kumbhar Satish S


Abstract: The database of genotypes and phenotypes (dbGaP) is the new database to store and distribute data from studies of genome wide association. dbGaP launch by National Library of Medicine (NLM) which is part of National Institutes of Health (NIH). Searching relevant studies of particular interest accurately and completely is challenging task due to keyword based search method of dbGaP Entrez system. For given queries; the dbGaP retrieval system returns several studies that are unrelated; and it is very difficult to find how particular studies are retrieved and why they come out in a particular sequence. Thus; users have to evaluate every study description carefully to find relevant studies; which is time consuming task. Text mining is emerging research field which enable users to extract useful information from text documents and deals with retrieval; classification; clustering and machine learning techniques to classify different text document. In this research; an empirical approach is proposed and implemented with K-nearest neighbor (KNN) machine learning algorithms to classify dbGaP study text in heart; lung and blood studies. It is evident from results that this text based classification outperforms conventional keyword based search of document retrieval system provided by dbGaP.


Keywords: Bioinformatics, Data Mining, Text Mining, database of Genotypes and Phenotypes


Edition: Volume 3 Issue 6, June 2014


Pages: 1146 - 1149



Make Sure to Disable the Pop-Up Blocker of Web Browser




Text copied to Clipboard!
Kolekar Suresh S, Kumbhar Satish S, "Text Studies Classification of Database of Genotypes and Phenotypes using K-Nearest Neighbor Algorithm", International Journal of Science and Research (IJSR), Volume 3 Issue 6, June 2014, pp. 1146-1149, https://www.ijsr.net/getabstract.php?paperid=2014436, DOI: https://www.doi.org/10.21275/2014436



Similar Articles

Downloads: 0

New Innovation and Idea, Computer Science & Engineering, India, Volume 11 Issue 10, October 2022

Pages: 1009 - 1012

Twin Pairing Algorithm for Longest Common Subsequence

Sathya Narayanan P S

Share this Article

Downloads: 0

Research Paper, Computer Science & Engineering, India, Volume 12 Issue 2, February 2023

Pages: 916 - 919

Sentiment Analysis: A Case Study for Apparel Brands - FABINDIA v/s BIBA

Syed Aqsa Ahmed

Share this Article

Downloads: 1

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 6, June 2014

Pages: 1884 - 1886

Performance Analysis of Clustal W Algorithm on Linux Cluster

Swati Jasrotia, Salam Din

Share this Article

Downloads: 1

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 2205 - 2207

A Survey of Generating Multi-Document Summarizations

Patil Ajita S., P. M. Mane

Share this Article

Downloads: 2 | Weekly Hits: ⮙2 | Monthly Hits: ⮙2

Research Paper, Computer Science & Engineering, India, Volume 12 Issue 6, June 2023

Pages: 2584 - 2586

Innovative Data Mining Techniques for Healthcare and Social Sciences

Ankita Moreshwar Itankar, Vijaya Kamble

Share this Article
Top