Text can have a superficial analysis done with no transformation, but in order to do a deep analysis of the data it is necessary to disambiguate the text. The disambiguation of text has two important effects: Text is restructured into a uniform, database format, and Text has context identified and attached to the text itself.
- Chapter 4 Data Ponds
- from Data Lake Architecture: Designing the Data Lake and Avoiding the Garbage Dump
- Publisher: Technics Publications
- Released: April 2016
Text data pond
Share this highlighthttp://www.safaribooksonline.com/a/data-lake-architecture/7948788/