Dataproc

Dataproc, in simple terms, is Hadoop on cloud. In Dataproc, understand how to create a cluster, different types of clusters, using pre-emptible workers to save money, and many other features.

And how you utilize it will be a sole responsibility of yours.

For Dataproc, you can have Cloud Dataproc, BigQuery, Cloud Storage, Cloud Bigtable, and Compute Engine as an input. While you can have Cloud Dataproc, BigQuery, Cloud Storage, Cloud BigTable, Compute Engine as output.

Get Cloud Analytics with Google Cloud Platform now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.