Effective and Efficient XML Duplicate Detection Using Levenshtein Distance Algorithm
International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 103 | Views: 367

Research Paper | Computer Science & Engineering | India | Volume 4 Issue 6, June 2015 | Popularity: 7 / 10


     

Effective and Efficient XML Duplicate Detection Using Levenshtein Distance Algorithm

Shital Gaikwad, Nagaraju Bogiri


Abstract: There is big amount of work on discovering duplicates in relational data, merely elite findings concentrate on duplication in additional multifaceted hierarchical structures. Electronic information is one of the key factors in several business operations, applications, and determinations, at the same time as an outcome, guarantee its superiority is necessary. Duplicates are several delegacy of the identical real world thing which is dissimilar from each other. Duplicate finding a little assignment because of the actuality that duplicates are not accurately equivalent, frequently because of the errors in the information. Accordingly, many data processing techniques never apply widespread assessment algorithms which identify precise duplicates. As an alternative, evaluate all objective representations, by means of a probably compound identical approach, to identifying that the object is real world or not. Duplicate detection is applicable in data clean-up and data incorporation applications and which considered comprehensively for relational data or XML document. This paper it is suggested to use Levenshtein distance algorithm which is best and efficient than the previous Normalized Edit Distance (NED) algorithm. This paper will provide the person who reads with the groundwork for research in Duplicate Detection in XML data or Hierarchical Data.


Keywords: duplicate detection, , electronic data, hierarchical data, XML data, XML document


Edition: Volume 4 Issue 6, June 2015


Pages: 2676 - 2680



Make Sure to Disable the Pop-Up Blocker of Web Browser


Text copied to Clipboard!
Shital Gaikwad, Nagaraju Bogiri, "Effective and Efficient XML Duplicate Detection Using Levenshtein Distance Algorithm", International Journal of Science and Research (IJSR), Volume 4 Issue 6, June 2015, pp. 2676-2680, https://www.ijsr.net/getabstract.php?paperid=SUB156040, DOI: https://www.doi.org/10.21275/SUB156040

Similar Articles

Downloads: 95 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Informative Article, Computer Science & Engineering, India, Volume 9 Issue 12, December 2020

Pages: 85 - 88

CBCD Methods in Video Copy Detection

Jan Mary Thomas

Share this Article

Downloads: 102

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 5, May 2015

Pages: 3021 - 3028

Smart Type-Ahead Search in XML

Supriya. N. Chaudhari, Vaishali M. Deshmukh

Share this Article

Downloads: 107 | Weekly Hits: ⮙2 | Monthly Hits: ⮙2

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 12, December 2014

Pages: 2681 - 2688

A Proposed Framework Using Neural Network in Web Mining for Improving the Performance of E-Learning System

Dar Masroof Amin, Atul Garg

Share this Article

Downloads: 110

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 6, June 2014

Pages: 2751 - 2756

User-Friendly Keyword Based Search on XML Data

Jeetendra G.Kapase, Sharmila M. Shinde

Share this Article

Downloads: 111

Research Paper, Computer Science & Engineering, India, Volume 2 Issue 11, November 2013

Pages: 200 - 203

Integration of a City GIS Data with Google Map API and Google Earth API for a Web Based 3D Geospatial Application

Akanbi A. K, Agunbiade O. Y

Share this Article
Top