Google Cloud Dataflow

Google Cloud Dataflow is a fully managed service for creating data pipelines that transform, enrich, and analyze data in batch and streaming modes. Google Cloud Dataflow extracts useful information from data, reducing operating costs without the hassle of implementing, maintaining, or resizing the data infrastructure.

A pipeline is a set of data processing elements connected in series, in which the output of one element is the input of the next. The data pipeline is implemented to increase throughput, which is the number of instructions executed in a given amount of time, parallelizing the processing flows of multiple instructions.

By appropriately defining a process management flow, significant resources can be saved ...

Get Hands-On Machine Learning on Google Cloud Platform now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.