Chapter 6. Data Import/Export Using Sqoop and Flume

The chapter covers the following topics:

  • Importing data from RDMBS to HDFS using Sqoop
  • Exporting data from HDFS to RDBMS
  • Using query operator in Sqoop import
  • Importing data using Sqoop in compressed format
  • Performing Atomic export using Sqoop
  • Importing data into Hive tables using Sqoop
  • Importing data into HDFS from Mainframes
  • Incremental import using Sqoop
  • Creating and executing Sqoop job
  • Importing data from RDBMS to Hbase using Sqoop
  • Importing Twitter data into HDFS using Flume
  • Importing data from Kafka into HDFS using Flume
  • Importing web logs data into HDFS using Flume

Introduction

In the previous chapter, we talked about advanced analytics options using Apache Hive. In this chapter, we are going to talk ...

Get Hadoop: Data Processing and Modelling now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.