What You’ll Learn in This Hour:
Shared variables in Spark—broadcast variables and accumulators
Partitioning and repartitioning of Spark RDDs
Processing RDD data with external programs
In this hour, I will cover the additional programming tools at your disposal with the Spark API, including broadcast variables and accumulators as shared variables across different workers. I will also dive deeper into the important ...