International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064




Downloads: 4 | Views: 71 | Weekly Hits: ⮙2 | Monthly Hits: ⮙4

Informative Article | Data & Knowledge Engineering | India | Volume 8 Issue 1, January 2019 | Rating: 4.9 / 10


Designing Data Schema and Formats for Efficient Storage and Processing within Hadoop

Fasihuddin Mirza [6]


Abstract: Designing and optimizing data schemas and formats is essential for harnessing the full potential of Hadoop, a powerful platform for storage and processing of large - scale data. However, organizations often face difficulties in determining the most suitable configurations that maximize performance, scalability, and compatibility. This paper addresses the challenges in selecting file formats, designing efficient schemas, optimizing storage techniques, enhancing data processing efficiency, and utilizing the appropriate tools and frameworks. Through comprehensive research and analysis, this study aims to provide guidance to organizations seeking to maximize the efficiency of their Hadoop implementations. By addressing these challenges, organizations can achieve efficient storage, faster processing, improved query performance, and enhanced overall performance within the Hadoop ecosystem.


Keywords: Hadoop, data schema, data formats, optimization, performance, scalability, compatibility, file formats, schema design, storage optimization, data processing efficiency, tools and frameworks


Edition: Volume 8 Issue 1, January 2019,


Pages: 2258 - 2261



How to Download this Article?

Type Your Valid Email Address below to Receive the Article PDF Link


Verification Code will appear in 2 Seconds ... Wait

Top