Extracting data from existing fields

First, we will learn how to extract data from fields that exist in our dataset in order to generate new fields. For the first exercise, we will read a file containing data about the cost of living in Europe. The content of the file looks like this:

Rank City Cost of Living Index Rent Index Cost of Living Plus Rent Index Groceries Index Restaurant Price Index Local Purchasing Power Index1 Zurich, Switzerland 141.25 66.14 105.03 149.86 135.76 142.702 Geneva, Switzerland 134.83 71.70 104.38 138.98 129.74 130.963 Basel, Switzerland 130.68 49.68 91.61 127.54 127.22 139.014 Bern, Switzerland 128.03 43.57 87.30 132.70 119.48 112.715 Lausanne, Switzerland 127.50 52.32 91.24 126.59 132.12 127.956 Reykjavik, Iceland ...

Get Pentaho Data Integration Quick Start Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.