Contextualizing Repetitive Unstructured Data
Abstract
Contextualizing repetitive unstructured data. Repetitive unstructured data consists of many records, the form and structure of which is highly repetitive. Once the metadata is found for the repetitive records, the context of the data is revealed. And the context for one record is revealed, it is revealed for all records. Thus, it is said that discovering the context for repetitive unstructured data is very easy to do. Once the context is revealed, the unstructured data can be sent to a standard DBMS, an index or to Big Data in a contextualized state.
Keywords
Get Data Architecture: A Primer for the Data Scientist now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.