Cloud Dataproc

Cloud Dataproc provides us with the power of Hadoop and Spark. Using Cloud Dataproc, we can build our own Hadoop and Spark servers, with all the prominent services auto installed. Cloud Dataproc is well integrated with other GCP components.

When to use:

Cloud Dataproc can be used when you have an established application on Hadoop. But now as your data is growing and demand in resources is varying, you need features such as auto scaling, elastic load balancer, and no overhead of Hadoop Admin tasks—in such scenarios you can go with Cloud Dataproc.

Special features:

  • Resizable cluster
  • Automated cluster management
  • Configurations automatic or manual
  • Versioning helps you in changing versions in Hadoop and Spark

Costing:

Please refer ...

Get Cloud Analytics with Google Cloud Platform now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.