Plotting the data as a probability distribution

Plotting our data in a histogram as a probability distribution tells matplotlib to integrate the total area of the histogram, and scale the values appropriately. Rather than showing how many values go into each bin as in the previous recipe, we'll have the probability of finding a number in the bin.

How to do it…

  1. To create a probability distribution for a single column in a Pandas DataFrame, begin by importing all the required libraries. To show the matplotlib plots in IPython Notebook, we will use an IPython magic function which starts with %:
    %matplotlib inline
    import pandas as pd
    import numpy as np
    from pymongo import MongoClient
    import matplotlib as mpl
    import matplotlib.pyplot as plt
  2. Next, connect ...

Get Python Business Intelligence Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.