O'Reilly logo

Learning HBase by Shashwat Shriparv

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Optimizing HBase

For some systems, memory-related settings that we discussed for Hadoop are common to HBase. So, let's discuss HBase-specific optimization in brief. We will also look at component-wise HBase optimization.

Let's start with Hadoop.

Hadoop

Add the following parameter to the hdfs-site.xml and hbase-site.xml files:

<property>
  <name>dfs.support.append</name>
  <value>true</value>
</property>

The preceding code will enable sync on HDFS, which is essential for durable HBase data synchronization and durability. After making this change, we need to restart the cluster.

The following code decides the number of open files on DataNode and should be kept high as HBase keeps a lot of files open. This must be kept to 4096 or above, according to the size ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required