Working with Accumulo

In this recipe, you will learn how to integrate Hive with Apache Accumulo.

Apache Accumulo is a sparse, distributed, sorted, and multidimensional map of key-value pairs. It is modeled after Google's Bigtable design. It's a key-value store and handles structured, semi-structured, and unstructured data. Also, it is extremely fast in accessing data to and fro tables containing large volumes of data.

Getting ready

In this topic, we will cover the use of Hive and Accumulo. You must have Apache Accumulo installed on your system before going further in the topic.

For Apache integration with Hive, there are two main components as follows:

  • AccumuloStorageHandler: The main job of this class is to map the Hive table to the Accumulo tables. ...

Get Apache Hive Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.