Defining a folder structure for the project

There is no general rule regarding the folder structure for a PDI project, but in general terms, we suggest a structure that is easy to understand, maintain, and also port when it's time to take the project to a different environment. The following is a suggested list of folders that you should have:

  • Input files
  • Output files
  • Temporary files
  • Log files
  • Database scripts
  • Properties files
  • Scripts (bat / sh files)
  • PDI files (jobs and transformations)

Then, in the kettle.properties file, you could add variables pointing to each of these folders, for example:

INPUT_FOLDER = /home/pentaho/inputOUTPUT_FOLDER = /home/pentaho/outputTEMP_FOLDER=/tmp...PROJECT_PATH=/home/pentaho/my/project

Finally, in your ...

Get Learning Pentaho Data Integration 8 CE - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.