Available versions and supported components of Cloud DataProc

The latest version of Cloud Dataproc is 1.2, which was released in July 2017 and is now the default version for new cluster setup. This latest version supports the following components:

  • Apache Spark 2.2.0
  • Apache Hadoop 2.8.2
  • Apache Pig 0.16.0
  • Apache Hive 2.1.1
  • Google Cloud Storage Connector 1.6.3 – Hadoop 2
  • BigQuery connector 0.10.4 – Hadoop 2

These versions keep on getting updated by GCP's Cloud Dataproc team.

Get Cloud Analytics with Google Cloud Platform now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.