Cloud Dataproc provides us with the power of Hadoop and Spark. Using Cloud Dataproc, we can build our own Hadoop and Spark servers, with all the prominent services auto installed. Cloud Dataproc is well integrated with other GCP components.
When to use:
Cloud Dataproc can be used when you have an established application on Hadoop. But now as your data is growing and demand in resources is varying, you need features such as auto scaling, elastic load balancer, and no overhead of Hadoop Admin tasks—in such scenarios you can go with Cloud Dataproc.
Special features:
- Resizable cluster
- Automated cluster management
- Configurations automatic or manual
- Versioning helps you in changing versions in Hadoop and Spark
Costing:
Please refer ...