Chapter 14

Integrating Hadoop with Relational Databases Using Sqoop

In This Chapter

Introducing Sqoop

Looking at the nuts and bolts of Sqoop

Importing data with Sqoop

Exporting data with Sqoop

Customizing your Sqoop input and output formats

Looking ahead to Sqoop 2.0

Performing analytics on large, diverse data sets is a natural fit for Apache Hadoop. The whole point of the Hadoop File System (HDFS) is that it excels at providing a massively scalable, diverse data store that, when combined with the many analytic tools available on the Hadoop platform — from Map Reduce to Mahout and others — gives you a lean, mean, analytics machine when you hitch your data store wagon to Apache Hadoop.

This rosy picture presents a slight problem, however: It turns out that most of the world’s structured data is already stored in relational database management systems (RDBMSs), and it’s common practice ...

Get Hadoop For Dummies now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Hadoop For Dummies by

Integrating Hadoop with Relational Databases Using Sqoop

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly