You are previewing Hadoop in Action.

Hadoop in Action

Cover of Hadoop in Action by Chuck Lam Published by Manning Publications

Chapter 8. Managing Hadoop

This chapter covers

  • Configuring for a production system
  • Maintaining the HDFS filesystem
  • Setting up a job scheduler

The installation instructions in chapter 2 produced a running Hadoop cluster fairly quickly. The configuration was relatively simple, but unfortunately it’s not good for a production cluster, which will be under heavy sustained use. There are various configuration parameters that you would want to tune for a production cluster, and section 8.1 will cover those parameters.

In addition, like any system, a Hadoop cluster will change over time and you (or some administrator) will have to know how to maintain it to keep it running in good shape. This is particularly true for the HDFS filesystem. In sections ...

The best content for your career. Discover unlimited learning on demand for around $1/day.