International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064




Downloads: 107 | Views: 230

Research Paper | Computer Science & Engineering | India | Volume 4 Issue 10, October 2015 | Rating: 6.4 / 10


Performance Analysis of Multi-Node Hadoop Clusters using Amazon EC2 Instances

Ruchi Mittal | Ruhi Bagga


Abstract: Hadoop, an open source implementation of MapReduce model, is an effective tool for handling, processing and analyzing unstructured data generated these days by different cloud applications. Hadoop considers its nodes to be homogeneous in terms of their processing capability in a cluster. But in real word applications nodes in a cluster are heterogeneous in terms of their processing capability. In such cases, Hadoop does not yields effective performance levels In this paper, we had evaluated and analyzed the performance of WordCount MapReduce application using Hadoop on Amazon EC2 using different Ubuntu instances. The performance has been evaluated both on single node and multi-node clusters. Multi-node clusters include both the homogeneous and the heterogeneous clusters. The performance is evaluated in terms of execution time of the application on different file sizes.


Keywords: Cloud Computing, Hadoop, MapReduce, Multi-node cluster


Edition: Volume 4 Issue 10, October 2015,


Pages: 1646 - 1650



How to Download this Article?

Type Your Valid Email Address below to Receive the Article PDF Link


Verification Code will appear in 2 Seconds ... Wait

Top