336 Solving Operational Business Intelligence with InfoSphere Warehouse Advanced Edition
9.1 The value of data and its age
The most valuable data in a data warehouse is data that delivers actionable
insight to the business. In an operational business intelligence (BI) environment,
this is recent or low latency data. The ability to gain actionable insight from low
latency data increases as the volume of low latency data increases. Because the
data sample is greater, more opportunities exist to find value through identifying
patterns, seeing trends, predicting outcomes with greater accuracy, comparing
data sets, and applying data mining techniques.
By effectively managing the data lifecycle, you can help enable greater volumes
of data to be ingested at greater speeds, which delivers an increased return on
investment to your business. The challenge is to maximize the ingest rate. To
meet this challenge, you must have the capability to organize, segregate, and
isolate data that is “cooling off” to help ensure that priority operational queries are
as efficient as possible.
As discussed in Chapter 7, “Understand and address data latency requirements”
on page 267, reducing the time latency between business events occurring and
the ability to take actionable insight based on data related to the event represents
a significant value proposition to your business.
The technical challenge is to store and maintain data to facilitate queries in a way
that maximizes performance and minimizes cost. This can be achieved by
distinguishing data as it ages.
The concept of data temperature and a multi-temperature database, although
unique to each environment, can be generally described by the patterns shown
in Table 9-1.
Table 9-1 Data temperature patterns
Data
temperature
Data temperature
characteristics
Typical data age Data maintenance
Hot Tactical and OLTP-type data;
that is, current data that is
accessed frequently by
queries that must have short
response times, for example,
high volume, small result set
point queries in operational
data stores (ODS).
0 - 3 months and
aggregates or
summaries of this data.
Data is located on the
fastest storage and is
updated frequently.
Frequent table space
backups are taken to aid
fast recovery if needed.