Dataflow

As you already know by now, Dataflow is about autoscaling the data processing pipeline on GCP. You can ingest, transform, and load data in one go. If you have worked on Apache Beam, it will be a good plus. Java and Python have very good support for Dataflow. Experiment with aggregation, combine, and group by.

Get Cloud Analytics with Google Cloud Platform now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.