The transformation criteria are a description of the criteria used in the transformation process for the conditioning of data within the data pond. Each of the data ponds has their own unique transformation criteria. The analog data pond may have a statement of the threshold for measurements. There may be a criterion that says: “If the length is greater than 45 cm then capture the record, else do not capture the record.” Or there may be criterion that says: “Catch all measurements of a certain machine for the month of May.” In the application data pond, there might be criteria that looks like: “If gender = 0 then convert gender to female. If gender = 1 then convert gender to male. If gender = x then convert gender to female. If gender...
- Chapter 5 Generic Structure of the Data Pond
- from Data Lake Architecture: Designing the Data Lake and Avoiding the Garbage Dump
- Publisher: Technics Publications
- Released: April 2016
Share this highlighthttp://www.safaribooksonline.com/a/data-lake-architecture/7948805/