Appendix A. Installing Apache Spark

Although we provide a VM image where Spark is already installed, we also wanted to give you step-by-step instructions on how to install Apache Spark as it would be done in the real world. This appendix contains instructions for the following:

  • Installing Java (JDK)
  • Downloading, installing, and configuring Apache Spark

If you aren’t using Ubuntu, we suggest that you install the VirtualBox hardware-virtualization software and create a Ubuntu VM (www.wikihow.com/Install-Ubuntu-on-VirtualBox).

Prerequisites: installing the JDK

Let’s get started. From now on, we’ll assume that you’re logged in to your Ubuntu OS.

If you aren’t sure whether you already have the JDK installed and set up correctly, open your ...

Get Spark in Action now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.