THE CAUSES OF POOR-QUALITY DATA

Poor-quality data can arise for a number of reasons, some technical and some human (although even the technical reasons can probably be traced back to some human error):

  • databases having inappropriate schemas;

  • errors being made on data entry;

  • data decaying over time;

  • data being corrupted when moved between systems;

  • lack of understanding of the data when it is used.

As discussed in Chapter 2, databases should be designed so that there is no unnecessary duplication of data. They should also be designed so that they can cope with changes in requirements without major cost implications. Update anomalies, which can lead to data inconsistency, are avoided when there is no unnecessary duplication of data. Data inconsistency ...

Get Principles of Data Management now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.