Supervised Learning Using Apache Spark

In this chapter, we will develop, test, and evaluate supervised machine learning models applied to a variety of real-world use cases using Python, Apache Spark, and its machine learning library, MLlib. Specifically, we will train, test, and interpret the following types of supervised machine learning models:

  • Univariate linear regression
  • Multivariate linear regression
  • Logistic regression
  • Classification and regression trees
  • Random forests

Get Machine Learning with Apache Spark Quick Start Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.