O'Reilly logo
  • Bhavisha Patel thinks this is interesting:

Each of the data ponds (other than the raw data pond) has some common components: Pond descriptor. The pond descriptor contains a description of the external contents and manifestation of the pond, and where the data in the pond originated from. Pond target. The pond target is a description of the relationship between the business of the corporation and the data inside the pond. Pond data. The data in the pond is merely the physical data that resides inside the pond. Pond metadata. The metadata describes the physical characteristics of the data contained in the data pond. Pond metaprocess. Metaprocess information is information about the transformation / conditioning of the data inside the data pond. In order to b...