Sorting data

Sorting the dataset is a very useful and common task. Sorting is really easy to do in PDI, and we will demonstrate it with a simple transformation. We will take the files of the surveys that we used in the previous chapter, and we will sort the data by neighborhood and room_type columns, and then by the reviews column in descending order. In order to do this, go through the following steps:

  1. Open any of the transformations created in the last chapter that read files with surveys. Save the transformation with a different name.
  2. Drag a Sort rows step from the Transform folder and create a hop from the Text file input toward this new step.
  3. Double-click the step and configure it as shown in the following screenshot:
Configuring ...

Get Pentaho Data Integration Quick Start Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.