O'Reilly logo

HBase Design Patterns by Sujee Maniyam, Mark Kerzner

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Importing data from HDFS into HBase

Let's say we have lots of data in HDFS and want to import it into HBase. We are going to write a MapReduce program that reads from HDFS and inserts data into HBase. This is depicted in the second scenario in the table we just saw.

Now, we'll be setting up the environment for the following discussion. We presume that you have already set up HBase through the Kiji distribution or are using any other approach described in Chapter 1, Starting Out with HBase. In addition, you can find the code and the data for this discussion in our GitHub repository at https://github.com/elephantscale/hbase-book.

The dataset we will use is the sensor data. Our (imaginary) sensor data is stored in HDFS as CSV (comma-separated values) ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required