O'Reilly logo

Learning Cascading by Victoria Loewengart, Michael Covert

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Understanding how Cascading controls data flow

Now, we know what the Cascading record looks like. How do we process these records? How do we move and manipulate data? Cascading provides us with the concept of pipes. Pipes control how data is managed during the processing segment.

Using pipes

Pipes are things that do stuff. The Cascading API allows the developer to assemble pipe assemblies that split, merge, group, or join streams. As data moves through pipes, streams may be separated or combined for various purposes:

Using pipes

Figure 2.3 – Pipe definition

Some pipes, such as Merge, GroupBy, and the Join classes, perform single actions on entire Tuple streams. ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required