International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064

ser



Downloads: 106 | Views: 175

M.Tech / M.E / PhD Thesis | Computer Science & Engineering | India | Volume 5 Issue 5, May 2016 | Rating: 6.8 / 10


Text Document Annotation and Retrieval Based on Content of the Document and Query Workload

Arunima P V | Ravinarayana B [2]


Abstract: Performing search and retrieval in large collection of textual data is a complex task. For effective searching of text documents, annotations are used. Annotations are the tags, s, attribute-value pairs, comments or summary that are attached to a document or a part of the document. Annotations can be considered as a structured representation of unstructured data. Since manually annotating each document in a large collection is not feasible, automatic annotation techniques are used. In this work, an automatic annotation generation technique based on the content of the document and query workload is introduced. Annotations are generated in the form of attribute-value pairs as they are more expressive than simple annotations. The system generates both attributes and values for a document by analyzing the content of the document, annotations of the existing documents and query workload. These annotations are later used during the search and retrieval process for matching with the queries given by the user. As an enhancement to the system, a new ranking method for ranking the retrieved documents is also introduced.


Keywords: Attribute-Value pair Annotation, Annotation Generation, Result Ranking, Query Workload


Edition: Volume 5 Issue 5, May 2016,


Pages: 2398 - 2403


How to Download this Article?

Type Your Valid Email Address below to Receive the Article PDF Link


Verification Code will appear in 2 Seconds ... Wait

Top