CHAPTER 4

image

Interactive Data Analysis with Spark Shell

One of the reasons for Spark’s hockey-stick growth is its usability. It not only provides a rich expressive API in multiple languages, but it also makes it easy to get started. It comes with a command-line tool called Spark shell, which allows you to interactively write Spark applications in Scala. The Spark shell is similar to the Scala shell, discussed in Chapter 2. In fact, it is based on the Scala shell.

The Spark shell provides a great environment for not only interactively analyzing data, but also learning Spark. You can install it on your local development machine in less than a minute ...

Get Big Data Analytics with Spark: A Practitioner’s Guide to Using Spark for Large-Scale Data Processing, Machine Learning, and Graph Analytics, and High-Velocity Data Stream Processing now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.