Data Profiling

Data profiling is used to help ensure data quality throughout an enterprise. By profiling your data, you can measure the volume and types of inconsistencies your data contains. Data profiles can contain a variety of measurements, including counts, distinct values, missing values, and possible relationships with other data points. If you are integrating data from multiple, disparate systems, data profiling will probably be an important part of your process.

Creating a Data Profile

To begin, create a new package named DataProfile.dtsx, and place a Data Profiling Task onto the Control Flow design area. Next, double-click on the task to open the Data Profiling Task Editor. In the Destination drop-down, select “New File connection” ...

Get Foundations of SQL Server 2008 R2 Business Intelligence, Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.