Creating a data quality report

Data quality is a fundamental issue for business intelligence. The reliability of your analysis and, by extension, the decisions you make based on that analysis, depend on the quality of data you use.

A data quality report provides objective measures of the quality of your data making it a critical first step of the business intelligence process.

Getting ready

For creating our report, we are going to create a number of DataFrames from our dataset, and then merge them together at the end. The parts of our report will include the following:

  1. Available columns
  2. For each column:
    • Data type
    • Count of missing values
    • Count of present values
    • Number of unique values
    • Minimum value
    • Maximum value

How to do it…

  1. To create your data quality report, ...

Get Python Business Intelligence Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.