Installation

Generally Spark can be installed from the source or prebuilt binaries. The following examples demonstrate how to use the binaries with prebuilt Hadoop 2.7 installation for any Linux, and for other kinds you can refer to the Spark docs at Spark-Spark Overview: http://spark.apache.org. Make sure your server has access to the internet for downloading the binary and installing it accordingly:

# Assuming on a Centos 7 instance# Java Installation.Refer OpenJDK Docs for further information at OpenJDK: Download and Install, which# was retrieved from http://openjdk.java.net/install/yum install java-1.8.0-openjdk# Spark Installationcurl -# https://archive.apache.org/dist/spark/spark-$SPARK_VERSION/spark-$SPARK_VERSION-bin-hadoop2.7.tgz ...

Get Mastering Apache Cassandra 3.x - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.