Transforming the Dataset

There are occasions when your dataset does not have the structure you like or the structure you need. The solution is not always about changing or adding fields or about filtering rows. The solution has to do with looking around (rows preceding or succeeding the current one) or altering the whole dataset. This chapter explains techniques to implement this behavior and transform the dataset as a whole, for example, aggregating data or normalizing pivoted tables.

The topics covered will be as follows:

  • Sorting data
  • Working on groups of rows
  • Converting rows to columns called denormalizing
  • Converting columns to rows called normalizing

Get Learning Pentaho Data Integration 8 CE - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.