Loading a fact table by overwriting snapshot data

Now suppose that you want to track individual orders. In the fact table, you have columns where you want to save the date in which the order was in each state. In this case, when you take the picture, you want to not only insert new orders into the fact table, but also update the status in the orders that already were saved in the fact in a previous load.

The way we do this is with an Insert/Update step. As an alternative and for better performance, you can do as follows:

  1. In the fact, define a primary key as the order id.
  2. Update the fact table with a Table output step. The primary key will cause the orders already in the fact table to be rejected by the Table output.
  3. Capture the error in ...

Get Learning Pentaho Data Integration 8 CE - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.