Designing and developing reusable jobs and transformations

Besides applying the different practices already explained, there is one more valuable factor while you design ETL processes: reusability. This feature not only benefits your current project, but also saves development time if you end up using your work in other projects. There are a couple of ways to design and develop reusable jobs and transformations:

  • Make heavy use of variables and named parameters.
  • If you identify tasks that are going to be used in several situations, create sub-transformations.
  • If you identify patterns to manipulate data, use metadata injection.
  • Avoid overloading your transformations. If a Transformation does much more than a specific task, think of splitting ...

Get Learning Pentaho Data Integration 8 CE - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.