Chapter 6. Data Quality and Data Cleansing

Data quality is one of the common challenges in every database system, especially when data comes from different kinds of sources. Assume that some customer data comes in the form of a SQL Server database table, and the table is filled with data from a website's customer application forms. On the other hand, some data comes in the form of Excel files and there are some data files that come from a DB2 database. The incoming data might contain multiple copies of a single customer's information that differ slightly; for example, Mike might be written as Maike somewhere or a company might be written as MSFT somewhere and Microsoft on another location. This chapter will dig into the concept of data quality ...

Get Microsoft SQL Server 2014 Business Intelligence Development: Beginner’s Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.