O'Reilly logo

Predictive Analytics For Dummies by Tommy Jung, Mohamed Chaouchi, Anasse Bari

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 6

Identifying Similarities in Data

In This Chapter

arrow Clustering data

arrow Identifying hidden groups of similar information in your data

arrow Finding associations among data items

arrow Organizing data with biologically inspired clustering

There is so much data around us that it can feel overwhelming. Large amounts of information are constantly being generated, organized, analyzed, and stored. Data clustering is the process that can help you make sense of this flood of data by discovering hidden groupings of similar data items. Data clustering provides a description of your data that says, in essence, your data contains x number of groups of similar data objects.

Clustering — in the form of grouping similar things — is part of our daily activities. You use clustering any time you group similar items together. For example, when you store groceries in your fridge, you group the vegetables by themselves in the crisper, put frozen foods in their own section (the freezer), so on. When you organize currency in your wallet, you arrange the bills by denomination — larger with larger, smaller with ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required