Unsupervised Learning Using Apache Spark

In this chapter, we will train and evaluate unsupervised machine learning models applied to a variety of real-world use cases, again using Python, Apache Spark, and its machine learning library, MLlib. Specifically, we will develop and interpret the following types of unsupervised machine learning models and techniques:

  • Hierarchical clustering
  • K-means clustering
  • Principal component analysis

Get Machine Learning with Apache Spark Quick Start Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.