Time for action – a more selective import

Let's see how this works by performing an import that is limited by a conditional expression.

  1. Delete any existing employee import directory:
    $ hadoop fs -rmr employees
    

    You will receive the following response:

    Deleted hdfs://head:9000/user/hadoop/employees
    
  2. Import selected columns with a predicate:
    sqoop import --connect jdbc:mysql://10.0.0.100/hadooptest 
    --username hadoopuser -P
    --table employees --columns first_name,salary
     --where "salary > 45000" 
    --hive-import --hive-table salary 
    

    You will receive the following response:

    12/05/23 15:02:03 INFO hive.HiveImport: Hive import complete.
    
  3. Examine the created table:
    $ hive -e "describe salary"
    

    You will receive the following response:

    OK
    first_name  string  

Get Hadoop: Data Processing and Modelling now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.