Fixing data that doesn't match the rules

At the beginning of the Validating Data section in this chapter, you learned to validate a field discarding the rows with invalid values. Now you will learn how to avoid discarding the row. You will fix the issue by proposing a product code equal to <invalid>. After doing so, you will send the rows with invalid values back to the main stream:

  1. Open the Transformation you created in the Validating Data section and save it with a different name.
  1. After the Write to log step, add a UDJE step. Use the step to replace the invalid product code with the text <invalid> and also to add two strings—a flag to indicate that there is an error and a new field named ERR_DESC with the description of the problem: ...

Get Learning Pentaho Data Integration 8 CE - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.