O'Reilly logo

Pentaho Data Integration Cookbook Second Edition by María Carina Roldán, Adrián Sergio Pulvirenti, Alex Meadows

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Generating sample data for testing purposes

Having sample data to test your transformations is very useful and allows you to move faster through your development and testing process. There are several cases where you will want to generate sample data, for example:

  • To quickly populate datasets with random data
  • Manually generate specific information
  • Generate large volumes of custom data

Take a subset from a large volume of data. In this recipe you will learn how to generate a dataset with 100 random rows in different formats (integer, string, and dates). Then, in the There's more section, you will find alternative solutions for generating data for testing.

How to do it...

Carry out the following steps:

  1. Create a new transformation.
  2. Drop a Generate rows

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required