6.2

Big Data/Existing Systems Interface

Abstract

Big Data is divided into two kinds of data – repetitive unstructured data and nonrepetitive unstructured data. The nonrepetitive data passes through textual disambiguation where it is contextualized. Once the nonrepetitive data is contextualized, it can pass back into Big Data in a context enriched state or it can pass into a standard data base. The repetitive unstructured data found in Big Data is either distilled or filtered. Once distilled or filtered, the data can pass into a database or back into Big Data in as context enriched state.

Keywords

Big Data
textual disambiguation
context
context enriched state
data base
unstructured repetitive data
unstructured nonrepetitive data
One of ...

Get Data Architecture: A Primer for the Data Scientist now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.