Standardizing information

In the Looking up data section in Chapter 6, Controlling the Flow of Data, there were missing countries in the countries.xml file. In fact, the countries were there, but with different names. For example, Russia in our file is Russian Federation in the XML file. What we should do to enhance the solution is standardize the information, sticking to a unique naming for the countries.

Modify the Transformation that looks for the language in the following way:

  1. Open the Transformation created in Chapter 6, Controlling the Flow of Data, and save it with a new name.
  2. Delete the hop that links your main stream with the Lookup Stream step.
  1. After reading your data, add a Value Mapper step and use it to get the standard name ...

Get Learning Pentaho Data Integration 8 CE - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.