Downloads: 115 | Views: 271
Survey Paper | Computer Science & Engineering | India | Volume 5 Issue 7, July 2016 | Popularity: 6.4 / 10
Survey Paper on Data Lake
Surabhi D Hegde, Ravinarayana B
Abstract: One of the key driving forces behind the problem of Big Data is the rapid growth of unstructured data, which constitutes huge percentage of overall data [1]. The Big Data is not only about massive data capture and storage, but intelligently combining the past data that already exists inside an organization with the unstructured data. For an organization to be really successful to meet the latent benefits of Big Data, it needs the perfect technology in place to acquire the data, store it, combine it and enrich huge volumes of unstructured data in raw format. It should also have the ability to perform analytics, real-time, near-real-time analysis, batch processing on these huge volumes of data. To address these businesses needs efficiently, the concept of Data Lake is proposed. It is one of the empowering data capture and processing capability for Big Data analysis. Data Lake makes it possible to store all types of data irrespective of their schema and the formats. Data Lake is a massive, easily accessible, flexible enough and scalable large data repository.
Keywords: Big Data, Big Data analytics, Data Warehouse, Data Lake
Edition: Volume 5 Issue 7, July 2016
Pages: 1718 - 1720
Make Sure to Disable the Pop-Up Blocker of Web Browser