Data structure analysis

Once data is acquired, we have to understand the structure of the data. Remember that the data we are getting can be in any of the following forms:

  • Text data:
    • Structured data
    • Unstructured data
  • Binary data

This is where we need certain tools to help us understand the structure of the data.

Once we have a thorough understanding of the data we are dealing with, the next task is to understand the bits and pieces we need to extract from this structure. Sometimes, depending on the complexity and size of the data we are dealing with, it might take time for us to really find and extract the information we are looking for.

Once we know what we are looking for and also have a solid understanding of the structure of the ...

Get Modern Big Data Processing with Hadoop now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.