CHAPTER 4 Statistics and Sampling

STATISTICS INVOLVES THE STUDY of research designs, the collection of the data, describing the data, analyzing the data, and then forming a conclusion. We are interested mainly in the analysis of data that has already been collected for us by various business systems. We hope to be able to arrive at various conclusions after analyzing the data.

Understanding some basic statistics allows you to understand the makeup or distribution of your data files. This is especially useful when your data file is large and contains millions of records.

There are various types of statistical analysis, but the two major categories are descriptive statistics and inferential statistics.

inlinedbox DESCRIPTIVE STATISTICS

Descriptive statistics is where you describe information from the data set. It is used to summarize the data. Where the data have categories, they can be summarized in each group as to frequency or as a percentage that is a relative frequency. With numerical data, we determine the middle of the data or spread of how close or far the numbers are from that middle. We can determine ranges and possibly determine relationships between two variables. Data can also be summarized to ranges.

There are two main types of data: categorical (qualitative data) or numerical (quantitative data). Categorical data in a record describes qualities or characteristics of the ...

Get Fraud and Fraud Detection: A Data Analytics Approach, + Website now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.