O'Reilly logo
  • Bhavisha Patel thinks this is interesting:

Another Perspective The three major categories of data found in the data lake then are analog data, application data and textual data. But there is another important classification of data in the data lake between repetitive and non-repetitive data. In general, analog and application data are repetitive, whereas textual data is non repetitive. Fig 3.8 shows data in the data lake divided into classifications of repetitive data and non-repetitive data