Chapter 4. Clustering: grouping things together

This chapter covers:

  • Understanding the need and value of clustering

  • Discovering user groups in a typical website and finding groups of similar news stories, blog reports, or documents.

  • Link-based clustering algorithms and the blazing fast k-means

Our ability as humans to accumulate and retain information relies greatly on our ability to structure the abundance of information that we receive through means, such as sensory perception, reason, language, and emotion. The profusion of available information would be overwhelming without some reference structures. Mental constructs that put order to all the data that we receive help us retain the essence of the data and understand the world around us.

Typically, ...

Get Algorithms of the Intelligent Web now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.