Summary

In this chapter, we covered the architecture and implementation of HDFS Federation, which is useful for splitting the namespace into multiple parts and managing them separately. Then we implemented HDFS high availability by configuring two namenodes as an active/passive configuration. We also configured jobtracker high availability by configuring two jobtrackers in an active/passive configuration. Both HDFS HA and jobtracker HA can be configured for manual failover or automatic failover using Apache ZooKeeper.

In the next chapter, we will learn the architecture and implementation of Cloudera Manager, Cloudera's Apache Hadoop administration tool. We will cover all of its features and how it can be used to administer a cluster running CDH5. ...

Get Cloudera Administration Handbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.