Summary

In this chapter, we have learned the essentials of HDFS, such as file operations on HDFS and how to configure HDFS. We looked at the namenode and secondary namenode web interfaces and explored a few HDFS commands. We also covered the MapReduce architecture along with a detailed walkthrough of the namenode and jobtracker web interfaces.

In the next chapter, we will dive into Cloudera's Distribution Including Apache Hadoop (CDH).

Get Cloudera Administration Handbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.