Each of the data ponds (other than the raw data pond) has some common components: Pond descriptor. The pond descriptor contains a description of the external contents and manifestation of the pond, and where the data in the pond originated from. Pond target. The pond target is a description of the relationship between the business of the corporation and the data inside the pond. Pond data. The data in the pond is merely the physical data that resides inside the pond. Pond metadata. The metadata describes the physical characteristics of the data contained in the data pond. Pond metaprocess. Metaprocess information is information about the transformation / conditioning of the data inside the data pond. In order to b...
- Chapter 5 Generic Structure of the Data Pond
- from Data Lake Architecture: Designing the Data Lake and Avoiding the Garbage Dump
- Publisher: Technics Publications
- Released: April 2016
Share this highlighthttp://www.safaribooksonline.com/a/data-lake-architecture/7948785/