Downloads: 123 | Views: 302
M.Tech / M.E / PhD Thesis | Computer Science & Engineering | India | Volume 3 Issue 3, March 2014 | Rating: 6.2 / 10
Fast and Accurate Incremental Entity Relationships
Rajeshkumar S, Geofrin Shirly S
Abstract: Entity resolution (ER) is the problem of identifying which records in a database refer to the same entity. This project investigates how we can maximize the progress of ER with a limited amount of work using hints, which give information on records that are likely to refer to the same real-world entity. This project introduces a family of techniques for constructing hints efficiently and techniques for using the hints to maximize the number of matching records identified using a limited amount of work. Using real data sets, this project illustrates the potential gains of our pay-as-you-go approach compared to running ER without using hints.
Keywords: Entity resolution, data cleaning
Edition: Volume 3 Issue 3, March 2014,
Pages: 286 - 291