Model evaluation

After completing our model estimation as described in the preceding section, we need to evaluate these estimated models to see if they fit our client's criterion so that we can either move to the explanation of results or go back to some previous stage to refine our predictive models.

To perform our model evaluation, in this section, we will utilize confusion matrix numbers to assess the quality of fit for our models, and then expand to other statistics.

As always, to calculate them, we need to use our test data rather than the training data.

Confusion matrix

In R, we can produce the model's performance indices with the following code:

model$confusion

Once a cutting point is determined, the following confusion matrix is produced, which ...

Get Apache Spark Machine Learning Blueprints now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.