5 Analytics

5.1 Introduction

Data needs to be explored and analysed, and decisions need to be made. These activities are sometimes referred to as descriptive analytics and predictive analytics. Here, we include details of carrying out comparative tests and cross tabulations and consider how to detect correlations and patterns in the data that can be useful in selecting variables to build a model.

In data mining, there are special issues when testing features of the data because of the size of the datasets; statistical significance tests have to be interpreted differently. There are also issues when working with subsets sampled from the data, how the subsets are ...

Get A Practical Guide to Data Mining for Business and Industry now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.