Summary

In this chapter, we discussed the architecture of Spark and its various components. We also talked about a few of the core components of the Spark framework, such as RDDs. We discussed the packaging structure of Spark and its various core APIs, and we also configured our Spark cluster and executed our first Spark job in Scala and Java.

In the next chapter, we talk about the various functions/APIs exposed by Spark RDDs in detail.

Get Real-Time Big Data Analytics now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.