Chapter 8. Data Visualization with Spark

In this chapter, you will learn the following recipes:

  • Visualization using Zeppelin
  • Creating scatter plots with Bokeh-Scala
  • Creating a time series MultiPlot with Bokeh-Scala
  • Creating plots with the lightning visualization server
  • Visualizing machine learning models with Databricks notebook

Introduction

Visualizing large data is challenging. There are more data points than possible pixels and manipulating distributed data can take a long time. Along with the increase in volume, there are new kinds of datasets which are becoming more and more mainstream. The need to analyze user comments, sentiments, customer calls and various unstructured data has resulted in the use of new kinds of visualizations. The use of graph ...

Get Apache Spark for Data Science Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.