Merging a lookup table

Nominal variables with more than several categories pose a potential problem. First, fields with a large number of categories can significantly increase processing time. Second, these fields can potentially have categories with very few cases, which can become problematic (for example, they might be outliers or just difficult to understand). Third, these fields might not even be used by certain models (see the following screenshot). Finally, fields with a large number of categories might not really get at the crux of the real characteristics of interest. Many new users of Modeler don't realize that many algorithms are automatically transforming nominal variables behind the scenes. Within the General Setting in Stream Properties ...

Get IBM SPSS Modeler Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.