Downloads: 107 | Views: 341
M.Tech / M.E / PhD Thesis | Computer Science & Engineering | India | Volume 4 Issue 4, April 2015 | Popularity: 6.3 / 10
Enhancing the Hadoop Performance through Data Placement in Heterogeneous Hadoop Cluster
A Ankita Poovaiah, Gopal B
Abstract: In the present world large volumes of data are getting generated and these records and data details have to be maintained for future purpose. Keeping these large bulks of data and using them becomes difficult. To overcome this and make it easy to store, use and work with it a tool called Hadoop is used. Hadoop uses the concept of a cluster that is many small nodes together form a cluster. Nodes with varying configurations (like varying RAM sizes, processors) form a heterogeneous cluster. Data placement technique in heterogeneous cluster is complicated. The data placement technique in heterogeneous cluster helps in the efficient use of resources and when combined with the MapReduce programming model increases the performance. Data placement can be done by forming racks. In this work we enhance the performance of Hadoop in heterogeneous cluster by first creating racks for data placement and then modifying certain parameters of Hadoop tool. The techniques are implemented and evaluated in Hadoop 1.0.3.
Keywords: Big Data, Hadoop, Heterogeneous Cluster, Data Placement
Edition: Volume 4 Issue 4, April 2015
Pages: 3150 - 3153
Please Disable the Pop-Up Blocker of Web Browser
Verification Code will appear in 2 Seconds ... Wait