Running our first Oozie job

We will start with a very simple example. In this chapter, our use case is to delete a given folder on HDFS via Oozie. In our case study, we get data daily in one folder in HDFS, but we want to delete the previous day's data. We want to keep just latest version in our system. Let's solve our business problem:

  1. Log in to Hue and go to Workflows | Editor.
  2. In the top row of editor, there are various types of actions. Select the Hadoop Fs action.

    Tip

    Take some time with your mouse over and read the names of various types of actions that Oozie can run.

  3. Drag the Hadoop Fs action to the editor as shown in the next screenshot.
  4. Give a meaningful name to this action, for example, my_delete_folder_action.
  5. Give the path of the folder that ...

Get Apache Oozie Essentials now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.