Chapter 2. Data Migration

In this chapter, we will cover:

  • Importing data from MySQL using a single client
  • Importing data from TSV files using the bulk load tool
  • Writing your own MapReduce job to import data
  • Precreating regions before moving data into HBase

Introduction

There are several ways to move data into HBase:

  • Using the HBase Put API
  • Using the HBase bulk load tool
  • Using a customized MapReduce job

The HBase Put API is the most straightforward method. Its usage is not difficult to learn. For most situations however, it is not always the most efficient method. This is especially true when a large amount of data needs to be transferred into HBase within a limited time period. The volume of data to be taken care of is usually huge, and that's probably ...

Get HBase Administration Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.