2.5

Contextualizing Repetitive Unstructured Data

Abstract

Contextualizing repetitive unstructured data. Repetitive unstructured data consists of many records, the form and structure of which is highly repetitive. Once the metadata is found for the repetitive records, the context of the data is revealed. And the context for one record is revealed, it is revealed for all records. Thus, it is said that discovering the context for repetitive unstructured data is very easy to do. Once the context is revealed, the unstructured data can be sent to a standard DBMS, an index or to Big Data in a contextualized state.

Keywords

repetitive unstructured data
metadata
index
DBMS
Big Data
context
contextualization
parsing
To be used for analysis, all ...

Get Data Architecture: A Primer for the Data Scientist now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.