8. The Role of the NameNode and How HDFS Works

As you learned in earlier chapters, HDFS is one of the two main architectural pillars of Hadoop 2, the other being YARN. In this chapter, I start off with an explanation of the interaction between the NameNode and the DataNodes and go on to discuss the theory behind how clients read from and write to HDFS. I then discuss HDFS features such as archival storage and HDFS cache management.

This chapter discusses the following topics pertaining to HDFS:

Image The interaction between the NameNode and the DataNodes

HDFS data organization

How clients read and write HDFS data

HDFS recovery processes

Hadoop archival ...

Get Expert Hadoop® Administration now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.