International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 117 | Views: 265

Research Paper | Computer Science & Engineering | India | Volume 7 Issue 9, September 2018 | Popularity: 6.3 / 10


     

BigData: A Case Study of Spark Mllib and Hive

Shubhajoy Das


Abstract: The extent to which data is generated has shown a tremendous increase in the past decade because of social networks, sensornetworks, geographicinformationsystems, Financial Institutions, Supply chains. The storage capacity of computers have increased to stay competitive, but a big problem is that the access speeds of the disk has not improved to that extent to be at par with disk space improvement. Big Data comes to the rescue with a framework to analyse massive amounts of data in a distributed environment which is both horizontally and vertically scalable. Data sets with trillions of rows can be analysed very fast to provide valuable insights from data. Cloud service providers such as amazon, Alibaba Cloud have made available robust infrastructure for Big Data. We study Apache Hive, Spark Mllib in profiling a Stack Overflow Dataset and Collaborative Filtering algorithm in Spark Mllib for movie recommendations.


Keywords: BigData, SparkMllib, Collaborative Filtering, Hadoop, Spark, Apache, Hive, Amazon aws, HDFS


Edition: Volume 7 Issue 9, September 2018


Pages: 865 - 868



Make Sure to Disable the Pop-Up Blocker of Web Browser




Text copied to Clipboard!
Shubhajoy Das, "BigData: A Case Study of Spark Mllib and Hive", International Journal of Science and Research (IJSR), Volume 7 Issue 9, September 2018, pp. 865-868, https://www.ijsr.net/getabstract.php?paperid=ART20191358, DOI: https://www.doi.org/10.21275/ART20191358



Similar Articles

Downloads: 2 | Weekly Hits: ⮙1 | Monthly Hits: ⮙2

Analysis Study Research Paper, Computer Science & Engineering, India, Volume 10 Issue 9, September 2021

Pages: 1793 - 1802

Event - Driven Architecture: Building Responsive and Scalable Systems

Venkata Naga Sai Kiran Challa

Share this Article

Downloads: 2 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Survey Paper, Computer Science & Engineering, United States of America, Volume 13 Issue 4, April 2024

Pages: 1730 - 1734

A Comparative Analysis of Popular Distributed Key-Value Stores

Ramprasad Chinthekindi, Shyam Burkule, Ashok Kumar Chintakindi

Share this Article

Downloads: 2 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Review Papers, Computer Science & Engineering, United States of America, Volume 13 Issue 7, July 2024

Pages: 653 - 655

Event Driven Data Architecture: Design and Implementation with Kinesis and Spark Streaming

Arjun Mantri

Share this Article

Downloads: 107

Research Paper, Computer Science & Engineering, India, Volume 8 Issue 9, September 2019

Pages: 937 - 938

Updating XML Files using a Tool based on DOM Parser

Nehal Pandey, Deepak Pase, Priyanka Chaudhari

Share this Article

Downloads: 108

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 4 Issue 8, August 2015

Pages: 1424 - 1430

Hadoop Distributed File System and Map Reduce Processing on Multi-Node Cluster

Dr. G. Venkata Rami Reddy, CH. V. V. N. Srikanth Kumar

Share this Article



Top