Selecting services

Now, we need to select the list of applications/services that we want to install on the three servers we have selected.

At the time of writing, Ambari supports the following services:

Application/Service

Application Description

HDFS

Hadoop Distributed File System

YARN + MapReduce2

Next generation Map Reduce framework

Tez

Hadoop query processing framework built on top of YARN

Hive

Data warehouse system for ad hoc queries

HBase

Non-relational distributed database

Pig

Scripting platform to analyze datasets in HDFS

Sqoop

Tool to transfer data between Hadoop and RDBMS

Oozie

Workflow co-ordination for Hadoop jobs with a web UI

ZooKeeper

Distributed system coordination ...

Get Modern Big Data Processing with Hadoop now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.