O'Reilly logo

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Hadoop Administration and Cluster Management

Video Description

Planning, deploying, managing, monitoring and performance-tuning your Hadoop cluster with Apache Hadoop

About This Video

  • Plan, deploy, manage and monitor your Hadoop Cluster with Apache Hadoop
  • Become an expert Hadoop Administrator by performance tuning your Hadoop Cluster for optimization
  • Comprehensive tutorial to help you get a better understanding of troubleshooting, diagnostics and best practices in Hadoop administration

In Detail

Hadoop is one of the most popular Big Data solutions for reliable and scalable distributed computing and storage. Administering your Hadoop cluster is the key to exploiting its rich features, and get the most out of it. This course focuses on planning, deploying and monitoring your cluster’s performance and looking at the optimal performance and health of this organizational cluster infrastructure. This course will help you understand the basics of Hadoop administration, with comprehensive coverage of various administrative tasks using the popular Apache Hadoop distribution.

This video course will start by installing the Apache Hadoop for cluster installation and configuring the required services. You will also learn various cluster operations like validations, and expanding and shrinking Hadoop services.

You will then move onto gain a better understanding of administrative tasks like planning your cluster, monitoring, logging, security, troubleshooting and best practices. Techniques to keep your Hadoop clusters highly available and reliant are also covered in this course. By the end of this course, you will have a thorough understanding of the concepts related to Hadoop administration.

Table of Contents

  1. Chapter 1 : Installing Apache Hadoop
    1. The Course Overview 00:03:27
    2. Navigation of GitBash 00:10:35
    3. Navigation of Vagrant 00:09:10
    4. Navigation of VirtualBox 00:10:59
    5. Planning a Single Node Setup 00:14:50
    6. Install Apache Hadoop 00:14:43
  2. Chapter 2 : Apache Hadoop
    1. Apache Hadoop Overview 00:04:31
    2. Hadoop Distributed File System (HDFS) 00:11:06
    3. YARN Overview 00:11:23
    4. MapReduce 00:09:56
  3. Chapter 3 : Hadoop Cluster Installation
    1. Planning Hadoop Services Placement 00:09:36
    2. Planning ZooKeeper Placement 00:11:03
    3. Planning HDFS Service Placement 00:10:48
    4. Planning YARN 00:04:43
    5. Planning Spark Services 00:08:57
  4. Chapter 4 : Validating the Cluster
    1. HDFS Concepts 00:13:37
    2. HDFS Data Movement 00:07:38
    3. HDFS Admin Commands 00:07:42
    4. MapReduce Jobs 00:07:18
    5. Spark Jobs 00:12:12
  5. Chapter 5 : Manage Hadoop Services
    1. Start/Stop Services 00:07:09
    2. Manage Cluster Using Ambari 00:04:53
    3. Hadoop Upgrade 00:10:12
  6. Chapter 6 : Scaling Cluster
    1. Scaling Cluster – Part 1 00:09:27
    2. Scaling Cluster – Part 2 00:06:52
  7. Chapter 7 : High Availability
    1. HDFS Masters 00:06:15
    2. HA Configuration 00:18:48
    3. YARN Masters 00:06:20
  8. Chapter 8 : HDFS Security
    1. Linux ACLs 00:10:27
    2. HDFS ACLs Security – Part 1 00:03:22
    3. HDFS ACLs Security – Part 2 00:04:37
    4. Hadoop Users and Groups 00:03:47
  9. Chapter 9 : Monitoring and Logging
    1. NameNode UI 00:05:22
    2. Apache Hadoop Auditing 00:05:30
    3. Hadoop Metrics 00:06:45
    4. Hadoop Logs and Monitoring 00:06:45
  10. Chapter 10 : Cluster Troubleshooting
    1. Hadoop Troubleshooting – Part 1 00:07:11
    2. Hadoop Troubleshooting – Part 2 00:09:16