Chapter 8. Basic Performance Tuning

In this chapter, we will cover:

  • Setting up Hadoop to spread disk I/O
  • Using a network topology script to make the Hadoop rack-aware
  • Mounting disks with noatime and nodiratime
  • Setting vm.swappiness to 0 to avoid swap
  • Java GC and HBase heap settings
  • Using compression
  • Managing compactions
  • Managing a region split

Introduction

Performance is one of the most interesting characteristics of an HBase cluster's behavior. It is a challenging operation for administrators, because performance tuning requires deep understanding of not only HBase but also of Hadoop, Java Virtual Machine Garbage Collection (JVM GC), and important tuning parameters of an operating system.

The structure of a typical HBase cluster is shown in the following ...

Get HBase Administration Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.