Problems

We will now discuss the first task—building a Data Lake. In Chapter 4, Ingestion and Storing, and Chapter 5Processing and Visualizing, you have already been given an idea of all required tools that will be helping us in building a Data Lake.

So, the main purpose of a Data Lake is for storing a huge amount of data in one place. This is done to pull data as per our requirement for different use cases in organizations.

So, as we are building the Data Lake we need to understand the different types of data that will be dumped into a Data Lake. Volume, velocity, and variety - all three Vs need to be considered. Data as we know is in the form of files, images, and emails. It can also come in the form of server logs from many servers and ...

Get Cloud Analytics with Google Cloud Platform now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.