Real-Time Machine Learning Using Apache Spark

In this chapter, we will extend our deployment of machine learning models beyond batch processing in order to learn from data, make predictions, and identify trends in real time! We will develop and deploy a real-time stream processing and machine learning application comprised of the following high-level technologies:

  • Apache Kafka producer application
  • Apache Kafka consumer application
  • Apache Spark's Structured Streaming engine
  • Apache Spark's machine learning library, MLlib

Get Machine Learning with Apache Spark Quick Start Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.