Summary

YARN is used as a generic resource manager for distributed applications. YARN allows easy resource scheduling and application execution over a cluster of nodes. YARN is being integrated with different big data technologies such as Apache Storm, Spark, and so on. In this chapter, you learned about the Spark-YARN architecture and how you can submit Spark jobs on YARN. You also learned about the integration of Storm on YARN and how you can manage the storm services through YARN. This chapter also covered a brief overview of the integration of HAMA and Giraph with YARN.

Get Learning YARN now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.