International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 1 | Views: 221 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Informative Article | Science and Technology | India | Volume 8 Issue 5, May 2019 | Popularity: 4.7 / 10


     

Efficient File-Based Data Ingestion for Cloud Analytics: A Framework for Extracting and Converting Non-Traditional Data Sources

Prakash Somasundaram


Abstract: In the rapidly evolving landscape of cloud computing and big data analytics, efficiently processing and analyzing diverse data formats is crucial for business decision-making. This paper introduces a comprehensive framework designed for the efficient ingestion of non-traditional data sources, specifically XML, PDF, and JSON files, into cloud analytics platforms. By converting these varied formats into structured CSV data, the framework significantly simplifies data analysis tasks, enhancing the utility of valuable customer data. Key features include a multi-layered architecture with specialized processing for each data type, a caching system for improved efficiency, and robust concurrency control for maintaining data integrity in multi-user environments. While highly effective in handling diverse data formats, the framework encounters challenges with complex nested structures and dependency on third-party libraries. Future enhancements focus on refining processing algorithms, reducing dependencies, and expanding capabilities for real-time processing and integration with big data platforms. This innovative approach to data ingestion addresses the pressing need for scalability and adaptability in cloud analytics, aligning with the ongoing digital transformation and the increasing reliance on comprehensive data analytics in various industries.


Keywords: Cloud Analytics, Data Ingestion, Data Transformation, Non-Traditional Data Sources, PDF Processing, Scalability, Unstructured Data


Edition: Volume 8 Issue 5, May 2019


Pages: 2223 - 2227



Make Sure to Disable the Pop-Up Blocker of Web Browser




Text copied to Clipboard!
Prakash Somasundaram, "Efficient File-Based Data Ingestion for Cloud Analytics: A Framework for Extracting and Converting Non-Traditional Data Sources", International Journal of Science and Research (IJSR), Volume 8 Issue 5, May 2019, pp. 2223-2227, https://www.ijsr.net/getabstract.php?paperid=SR24213022529



Similar Articles

Downloads: 4 | Weekly Hits: ⮙2 | Monthly Hits: ⮙2

Informative Article, Science and Technology, India, Volume 10 Issue 7, July 2021

Pages: 1523 - 1528

Data Preprocessing in Healthcare: A Vital Step towards Informed Decision-Making

Wasim Fathima Shah

Share this Article

Downloads: 4 | Weekly Hits: ⮙4 | Monthly Hits: ⮙4

Informative Article, Science and Technology, India, Volume 8 Issue 6, June 2019

Pages: 2418 - 2421

Semantic Harmonizer: Matching Algorithm to Validate Data Transformations & Migrations

Mahidhar Mullapudi, Aditya Mamidi, Abhishek Shende

Share this Article

Downloads: 4 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Informative Article, Science and Technology, India, Volume 9 Issue 7, July 2020

Pages: 2004 - 2009

Cloud Storage Strategies for High - Performance Analytics: An In - Depth Look at Databases, Data Warehouses, and Object Storage Solutions

Prakash Somasundaram

Share this Article

Downloads: 4 | Weekly Hits: ⮙2 | Monthly Hits: ⮙2

Informative Article, Science and Technology, India, Volume 8 Issue 10, October 2019

Pages: 1870 - 1871

Unlocking Data Potential: The GCS XML CSV Transformer for Enhanced Accessibility in Google Cloud

Preyaa Atri

Share this Article



Top