Corona

Corona is another work from Facebook, which is now open-sourced and hosted on the GitHub repository at https://github.com/facebookarchive/hadoop-20/tree/master/src/contrib/corona.

Facebook, with its huge peta-scale quantity of data, suffered serious performance-related issues with the classic MapReduce framework because of the single JobTracker taking care of thousands of jobs and doing a lot of work alone.

In order to solve these issues, Facebook created Corona, which separated cluster resource management from job coordination.

In Hadoop Corona, the cluster resources are tracked by a central Cluster Manager. Each job gets its own Corona Job Tracker which tracks just that particular job.

Corona has entirely redesigned MapReduce architecture ...

Get YARN Essentials now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.