4

Learning Spark Programming Basics

Talk is cheap. Show me the code.

Linus Torvalds, Finnish-American creator of Linux

In This Chapter:

Resilient Distributed Datasets (RDDs)

How to load data into Spark RDDs

Transformation and actions on RDDs

How to perform operations on multiple RDDs

Now that we’ve covered Spark’s runtime architecture and how ...

Get Data Analytics with Spark Using Python, First edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.