International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 2 | Views: 245 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Informative Article | Information Technology | India | Volume 9 Issue 5, May 2020 | Popularity: 5.1 / 10


     

Ensuring Data Integrity in Big Data Ingestion: Techniques and Best Practices for Data Quality Assurance

Sree Sandhya Kona


Abstract: In the era of big data, the quality of data ingested into analytical systems profoundly impacts the accuracy of insights and the efficacy of decision - making processes. Ensuring high - quality data during the ingestion phase is crucial, yet it presents significant challenges, including the handling of inaccuracies, inconsistencies, and incomplete information. This article delves into the fundamental techniques and best practices for data quality assurance in big data ingestion. It explores essential strategies across three main areas: data validation, data cleansing, and data enrichment. Data validation techniques discussed include both pre - and post - ingestion checks, such as schema validation and anomaly detection. In data cleansing, we address methods for identifying and correcting errors, including data imputation and systematic error correction. Furthermore, the article highlights data enrichment strategies that enhance the utility and context of the ingested data, such as data merging and augmentation. We also examine the role of automated tools in integrating these practices into data pipelines and the importance of continuous monitoring and feedback mechanisms to sustain data integrity. Through a combination of theoretical frameworks and real - world case studies, this article aims to provide a comprehensive guide to improving data quality in big data projects, thus supporting more reliable and insightful business analytics.


Keywords: Data Quality Assurance, Data Validation, Data Cleansing, Data Enrichment, Schema Validation, Anomaly Detection, Data Imputation, Data Merging, Data Augmentation, Automated Data Tools, Business Intelligence


Edition: Volume 9 Issue 5, May 2020


Pages: 1866 - 1869



Make Sure to Disable the Pop-Up Blocker of Web Browser




Text copied to Clipboard!
Sree Sandhya Kona, "Ensuring Data Integrity in Big Data Ingestion: Techniques and Best Practices for Data Quality Assurance", International Journal of Science and Research (IJSR), Volume 9 Issue 5, May 2020, pp. 1866-1869, https://www.ijsr.net/getabstract.php?paperid=SR24522140238, DOI: https://www.doi.org/10.21275/SR24522140238



Similar Articles

Downloads: 0

Review Papers, Information Technology, United States of America, Volume 13 Issue 6, June 2024

Pages: 1741 - 1747

Risk Management and Compliance with Business Intelligence in Banking

Pranay Mungara

Share this Article

Downloads: 1 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Research Paper, Information Technology, India, Volume 13 Issue 3, March 2024

Pages: 1943 - 1946

Leveraging Machine Learning for Personalization and Security in Content Management Systems

Venkata Sai Swaroop Reddy Nallapa Reddy

Share this Article

Downloads: 1 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Research Paper, Information Technology, United States of America, Volume 13 Issue 10, October 2024

Pages: 663 - 665

Healthcare Data Warehouses Empowered ML to Detect Anomalies

Arun Kumar Ramachandran Sumangala Devi

Share this Article

Downloads: 1 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Review Papers, Information Technology, United States of America, Volume 13 Issue 11, November 2024

Pages: 1553 - 1558

Advancing Healthcare Data Exchange: The Role of AI and Cloud Analytics in Data Products

Venkateswara Siva Kishore Kancharla

Share this Article

Downloads: 1 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Research Paper, Information Technology, India, Volume 12 Issue 3, March 2023

Pages: 1855 - 1863

Data Integration: AI-Driven Approaches to Streamline Data Integration from Various Sources

Muneer Ahmed Salamkar

Share this Article
Top