O'Reilly logo

Sams Teach Yourself Apache Spark™ in 24 Hours by Jeffrey Aven

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Hour 12. Advanced Spark Programming

What You’ll Learn in This Hour:

Image Shared variables in Spark—broadcast variables and accumulators

Image Partitioning and repartitioning of Spark RDDs

Image Processing RDD data with external programs

In this hour, I will cover the additional programming tools at your disposal with the Spark API, including broadcast variables and accumulators as shared variables across different workers. I will also dive deeper into the important ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required