Jobtracker high availability

In MRv1, if the jobtracker fails, all running jobs and tasks are lost. Also, the jobtracker service along with its jobs need to be manually restarted. To avoid these issues, the jobtracker needs to be configured for high availability. CDH5 comes inbuilt with the jobtracker HA package.

Configuring jobtracker high availability

Use the following steps from user hduser to configure and HA jobtracker for your cluster:

  1. Stop all the tasktrackers by executing the following command on all the nodes that host tasktrackers:
    $ sudo service hadoop-0.20-mapreduce-tasktracker stop
    
  2. Stop the jobtracker by executing the following command on the node that hosts the jobtracker:
    $ sudo service hadoop-0.20-mapreduce-jobtracker stop
    
  3. Remove ...

Get Cloudera Administration Handbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.