There's more...

More about the Bisecting KMeans can be found at:

We use clustering to explore the data and get a feel for what the outcome looks like as clusters. The bisecting KMeans is an interesting case of hierarchical analysis versus KMeans clustering.

The best way to conceptualize it is to think of bisecting KMeans as a recursive hierarchical KMeans. The bisecting KMeans algorithm divides the data using similarity measurement techniques like KMeans, but uses a hierarchical scheme to increase accuracy. It is particularly prevalent ...

Get Apache Spark 2.x Machine Learning Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.