Running custom Spark Docker locally

As discussed in previous chapters, this custom all-in-one Docker image is built on the CentOS 7 community Docker, with all of the following applications with their corresponding versions:

  • Cassandra 3.11.2
  • InfluxDB 1.6.2
  • Grafana 5.2.4
  • Telegraf 1.7.4
  • JmxTrans 270
  • Spark 2.3.1
  • Python 2.7.15
  • R 3.5.1

It even has the PySpark, SparkR, and Jupyter CLIs installed with the required configuration. They can be triggered with corresponding parameters through SPARK_CLI flags with PySpark, SparkR, or Jupyter, but PySpark would be the default if it's not passed. All of the required configs for the preceding applications are already preconfigured. This makes it very simple to run this complex custom Spark for easier ...

Get Mastering Apache Cassandra 3.x - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.