Building Hadoop cluster

So, the data is currently stored in a Data Lake across multiple GCP services such as Cloud Storage, Cloud Bigtable, Cloud Bigquery, and so on. We also require a Hadoop cluster to perform some tasks. So, we can use Cloud Dataproc to start our own Hadoop cluster and utilize the Hadoop ecosystem.

Get Cloud Analytics with Google Cloud Platform now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.