O'Reilly logo

Principles of Data Science by Sinan Ozdemir

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Structured versus unstructured data

The distinction between structured and unstructured data is usually the first question you want to ask yourself about the entire dataset. The answer to this question can mean the difference between needing three days or three weeks of time to perform a proper analysis.

The basic breakdown is as follows (this is a rehashed definition of organized and unorganized data in the first chapter):

  • Structured (organized) data: This is data that can be thought of as observations and characteristics. It is usually organized using a table method (rows and columns).
  • Unstructured (unorganized) data: This data exists as a free entity and does not follow any standard organization hierarchy.

Here are a few examples that could help ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required