Generating all possible pairs formed from two datasets

This is a quick recipe that teaches you how to do a Cartesian product between datasets. A Cartesian product is created by taking all rows from one dataset, all rows from another dataset, and generating a new dataset with all the possible combinations of rows.

This particular recipe is, in fact, the implementation of the Community Acronym Generator (CAG) as proposed by Nicholas Goodman (@nagoodman) on Twitter:

@webdetails @pmalves @josvandongen How about CAG? Community Acronym Generator? A project to generate new acronyms for community projects?!

There are already several community projects around Pentaho such as CDF (Community Dashboard Framework), CDE (Community Dashboard Editor), or CDA (Community ...

Get Pentaho Data Integration Cookbook Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.