Unstructured data is data that isn’t easy to fit into a data model because the content is context-specific or varying. One example of unstructured data is your regular email (figure 1.2). Although email contains structured elements such as the sender, title, and body text, it’s a challenge to find the number of people who have written an email complaint about a specific employee because so many ways exist to refer to a person, for example. The thousands of different languages and dialects out there further complicate this.
- Chapter 1. Data science in a big data world
- from Introducing Data Science: Big data, machine learning, and more, using Python tools
- Publisher: Manning Publications
- Released: May 2016
Share this highlighthttp://www.safaribooksonline.com/a/introducing-data-science/14161836/