Best practices for statistics

Statistics are an integral part of any predictive modelling assignment. Statistics are important because they help us gauge the efficiency of a model. Each predictive model generates a set of statistics, which suggests how good the model is and how the model can be fine-tuned to perform better. The following is a summary of the most widely reported statistics and their desired values for the predictive models described in this book:

Algorithms

Statistics/Parameter

The desired value of statistics

Linear regression

R2, p-values, F-statistic, and Adj. R2

High Adj. R2, low F-statistic, and low p-value

Logistic regression

Sensitivity, specificity, Area Under the Curve (AUC), and KS statistic

High AUC (proximity ...

Get Learning Predictive Analytics with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.