Downloads: 12 | Views: 315 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1
Comparative Studies | Computer Science and Information Technology | India | Volume 12 Issue 8, August 2023 | Popularity: 5.7 / 10
Enhancing Data Accuracy and Efficiency: An Overview of Fuzzy Matching Techniques
Jahnavi Kalluru
Abstract: Fuzzy matching, also known as approximate string matching, is a powerful technique designed to improve data accuracy and efficiency by identifying and linking strings that exhibit partial similarity. Unlike traditional exact matching, which requires precise character - by - character agreement, fuzzy matching accounts for typographical errors, misspellings, and variations, allowing for a more flexible comparison. This paper presents an overview of fuzzy matching techniques and their applications across diverse domains. We delve into the core concepts of various algorithms, including Levenshtein distance, Jaccard similarity, soundex, and metaphone, exploring how each method quantifies the similarity between strings. The paper highlights their strengths and use cases in data cleaning, deduplication, information retrieval, natural language processing, record linkage, and named entity recognition.
Keywords: Fuzzy matching, approximate string matching, data accuracy, efficiency, partial similarity
Edition: Volume 12 Issue 8, August 2023
Pages: 685 - 690
DOI: https://www.doi.org/10.21275/SR23805184140
Make Sure to Disable the Pop-Up Blocker of Web Browser
Similar Articles
Downloads: 0
Research Paper, Computer Science and Information Technology, Kenya, Volume 10 Issue 6, June 2021
Pages: 1621 - 1628Enterprise Resource Planning Integration and Performance of Safaricom Public Limited Company Kenya
Wambui Caroline, Tumuti Joshua
Downloads: 0
Research Paper, Computer Science and Information Technology, China, Volume 12 Issue 6, June 2023
Pages: 1812 - 1827Distributed Deep Learning Based Framework to Optimize Real-Time Offloading in Mobile Edge Computing Networks
Mourita Mozib
Downloads: 0
Informative Article, Computer Science and Information Technology, India, Volume 11 Issue 1, January 2022
Pages: 1644 - 1646Best Practices for Logs and Metrics in Software Development
Krishna Mohan Pitchikala
Downloads: 0
Informative Article, Computer Science and Information Technology, India, Volume 12 Issue 4, April 2023
Pages: 1936 - 1940Zero Trust Network Segmentation
Anvesh Gunuganti
Downloads: 0
Informative Article, Computer Science and Information Technology, India, Volume 12 Issue 4, April 2023
Pages: 1941 - 1944Human - AI Collaboration: Is it Leading to Enhanced Productivity
Goutham Sabbani