Implementing a Map Reduce action job using Oozie

In the previous recipe, we talked about how to use a Sqoop action to import data to HDFS. In this recipe, we are going to take a look at how to execute Map Reduce jobs using Oozie.

Getting ready

To perform this recipe, you should have a running Hadoop cluster as well as the latest version of Oozie installed on it.

How to do it...

Any Oozie job execution consists of two important things, workflow.xml and a properties file. The Workflow.xml file is where we need to specify the flow of execution. The following is an example of workflow.xml, which uses the MR action. Here, we also need to provide the jar file that contains the the map reduce code:

<workflow-app xmlns="uri:oozie:workflow:0.2" name="map-reduce-wf"> ...

Get Hadoop: Data Processing and Modelling now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.