Extracting delimited fields

As we have seen, some systems may store data in a denormalized form and, in the previous section, we saw how we could normalize the data. In essence, we were turning the data from column into a row. However, with some data, we may wish to change its normalized form not to rows, but to individual columns. For example, suppose a system stores its employee data with the following schema:

[employee_id] | [name]

And the name field holds the first name and last name of the employee in the following format:

[last_name], [first_name]

An example file is shown as follows:

Extracting delimited fields

Note

Note that the schema does not have three fields, but that ...

Get Getting Started with Talend Open Studio for Data Integration now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.