Descriptive statistics

In the previous section, we learnt how distributions are formed. In this section, we will learn how to describe them through descriptive statistics. There are two important components of a distribution that can help describe it, which are its location and its spread.

Measures of location

A measure of location is a single value that describes where the center of the data lies. The three most common measures of location are mean, median, and mode.

Mean

By far the most common and widely used measure of central tendency is the mean, which is otherwise known as the average. Whether it is a sample or a population, the mean or average is the summation of all the elements divided by the total number of elements.

Median

The median is the ...

Get Spark for Data Science now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.