O'Reilly logo

Learning Storm by Anand Nalya, Ankit Jain

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Installing Apache Hadoop

Now that we have seen both the storage and processing parts of a Hadoop cluster, let's get started with the installation of Hadoop. We will use Hadoop 2.2.0 in this chapter.

Note

Hadoop 2.2.0 is not compatible with Hadoop 1.X versions.

We will be setting up a cluster on a single node. Before starting, please make sure that you have the following software installed on your system:

  • JDK 1.7: We need JDK to run Hadoop as it is written in Java
  • ssh-keygen: This is used to generate SSH keys that are used to set password-less SSH required for Hadoop

If you don't have ssh-keygen, install it with the following command:

yum install openssh-clients

Next, we will need to set up password-less SSH on this machine as it is required for Hadoop. ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required