A Competitive Clustering on Bigdata

N. Narasimha Swamy, Anita Kumari Singh; N. Narasimha Swamy, Anita Kumari Singh

doi:10.21275/NOV151162

Downloads: 114 | Views: 396

M.Tech / M.E / PhD Thesis | Computer Science & Engineering | India | Volume 4 Issue 11, November 2015 | Popularity: 6.7 / 10

A Competitive Clustering on Bigdata

N. Narasimha Swamy, Anita Kumari Singh

Abstract: The vast increase in the volume of data has created a need for new applications and algorithms to quickly analyse the large scale of data. Many of the cluster analysis techniques like K-Means are used to compute the data in distributed systems, its accuracy depends on the initial seeding of centroids. The improvisation of K-Means algorithm shows good initial seeding but it suffers with the serial nature i. e. , it takes long time on large data sets. In this paper we propose a new algorithm with Map Reduce implementation to address the above problems. Our algorithm provides parallel processing of data by dividing it into number of subsets using Hadoop with MapReducing methods. Our work provides good initial centers in a less time and also produces fast and accurate cluster analysis on large scale data.

Keywords: K-Means, K-Means++, MapReduce, Mahout, HDFS

Edition: Volume 4 Issue 11, November 2015

Pages: 812 - 814

Please Disable the Pop-Up Blocker of Web Browser

Verification Code will appear in 2 Seconds ... Wait

Text copied to Clipboard!

N. Narasimha Swamy, Anita Kumari Singh, "A Competitive Clustering on Bigdata", International Journal of Science and Research (IJSR), Volume 4 Issue 11, November 2015, pp. 812-814, https://www.ijsr.net/getabstract.php?paperid=NOV151162, DOI: https://www.doi.org/10.21275/NOV151162