Data processing through seven transformations

Amazon ML offers the following seven transformations. Four transformations for text variables are as follows:

  • Lowercase transformation
  • Remove punctuation transformation
  • N-gram transformation
  • Orthogonal sparse bigram (OSB) transformation

Two transformations for numeric variables are as follows:

  • Normalization transformation
  • Quantile binning transformation

And one transformation for coupling text with categorical variables:

  • Cartesian product transformation

These transformations are well explained on the Amazon ML documentation (http://docs.aws.amazon.com/machine-learning/latest/dg/data-transformations-reference.html).

Get Effective Amazon Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.