Working with HBase

In this recipe, you will learn how to integrate HBase with Google Drill.

HBase is a distributed database used to store large volume of data. It is written in Java and runs on top of HDFS. Therefore, it is a fast way of reading and writing large volumes of data with high throughput.

Getting ready

For integrating Hive with HBase, there are a few prerequisites that must be met. In this topic, we will cover the use of Hive and HBase. You must have HBase installed on your system before going further in the topic.

Once HBase is installed, configure the HBase as shown in the following steps:

Add the following properties to the hbase-site.xml file:

<property> <name>hbase.cluster.distributed</name> <value>true</value> </property> <property> ...

Get Apache Hive Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.