Chapter 4. Data Preparation – Construct

In this chapter, we will cover:

  • Building transformations with multiple Derive nodes
  • Calculating and comparing conversion rates
  • Grouping categorical values
  • Transforming high skew or kurtosis variables using a multiple Derive mode
  • Creating flag variables for aggregation
  • Using Association Rules for interaction detection/feature creation
  • Creating time-aligned cohorts

Introduction

This chapter will focus on the Construct subtask of CRISP-DM's data preparation phase. The CRISP-DM document describes it as follows:

This task includes constructive data preparation operations such as the production of derived attributes, entire new records, or transformed values for existing attributes.

Of all the subtasks in CRISP-DM, the Construct ...

Get IBM SPSS Modeler Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.