Downloads: 1 | Views: 221 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1
Informative Article | Science and Technology | India | Volume 8 Issue 5, May 2019 | Popularity: 4.7 / 10
Efficient File-Based Data Ingestion for Cloud Analytics: A Framework for Extracting and Converting Non-Traditional Data Sources
Prakash Somasundaram
Abstract: In the rapidly evolving landscape of cloud computing and big data analytics, efficiently processing and analyzing diverse data formats is crucial for business decision-making. This paper introduces a comprehensive framework designed for the efficient ingestion of non-traditional data sources, specifically XML, PDF, and JSON files, into cloud analytics platforms. By converting these varied formats into structured CSV data, the framework significantly simplifies data analysis tasks, enhancing the utility of valuable customer data. Key features include a multi-layered architecture with specialized processing for each data type, a caching system for improved efficiency, and robust concurrency control for maintaining data integrity in multi-user environments. While highly effective in handling diverse data formats, the framework encounters challenges with complex nested structures and dependency on third-party libraries. Future enhancements focus on refining processing algorithms, reducing dependencies, and expanding capabilities for real-time processing and integration with big data platforms. This innovative approach to data ingestion addresses the pressing need for scalability and adaptability in cloud analytics, aligning with the ongoing digital transformation and the increasing reliance on comprehensive data analytics in various industries.
Keywords: Cloud Analytics, Data Ingestion, Data Transformation, Non-Traditional Data Sources, PDF Processing, Scalability, Unstructured Data
Edition: Volume 8 Issue 5, May 2019
Pages: 2223 - 2227
Make Sure to Disable the Pop-Up Blocker of Web Browser
Similar Articles
Downloads: 4 | Weekly Hits: ⮙2 | Monthly Hits: ⮙2
Informative Article, Science and Technology, India, Volume 10 Issue 7, July 2021
Pages: 1523 - 1528Data Preprocessing in Healthcare: A Vital Step towards Informed Decision-Making
Wasim Fathima Shah
Downloads: 4 | Weekly Hits: ⮙4 | Monthly Hits: ⮙4
Informative Article, Science and Technology, India, Volume 8 Issue 6, June 2019
Pages: 2418 - 2421Semantic Harmonizer: Matching Algorithm to Validate Data Transformations & Migrations
Mahidhar Mullapudi, Aditya Mamidi, Abhishek Shende
Downloads: 4 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1
Informative Article, Science and Technology, India, Volume 9 Issue 7, July 2020
Pages: 2004 - 2009Cloud Storage Strategies for High - Performance Analytics: An In - Depth Look at Databases, Data Warehouses, and Object Storage Solutions
Prakash Somasundaram
Downloads: 4 | Weekly Hits: ⮙2 | Monthly Hits: ⮙2
Informative Article, Science and Technology, India, Volume 8 Issue 10, October 2019
Pages: 1870 - 1871Unlocking Data Potential: The GCS XML CSV Transformer for Enhanced Accessibility in Google Cloud
Preyaa Atri