13. Merging Unstructured Databases into the Data Warehouse

The output of unstructured data integration is a database. The database can be either a database keyed on document/word or a database keyed on one or more identifiers. Figure 13-1 shows the output of unstructured data processing and the resulting databases.

Figure 13-1 The two types of databases that can be created from structured and semistructured data

image

Of course, if the source is unstructured data, the database is likely to be a document/word database, and if the source is semistructured data, the database is likely to be an ID 1, ID 2, ID 3. . . database.

The question then becomes: ...

Get Tapping into Unstructured Data: Integrating Unstructured Data and Textual Analytics into Business Intelligence now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.