Video description
In this Introduction to the Hadoop Technology Stack training course, expert author Justin Watkins will teach you about the concepts and benefits of Apache Hadoop, and how it can help you meet your business goals. This course is designed for the absolute beginner, meaning no previous experience with the Hadoop Technology Stack is required.
You will start by learning about Hadoop, including HDFS architecture, parallel performance, and YARN. From there, Justin will teach you about options for data input, including Sqoop, Flume, and other important tools. Finally, this video tutorial also covers Hadoop tools, such as Pig, Hive, HCatalog, and Apache Storm.
Once you have completed this computer based training course, you will have gained a solid understanding of the Hadoop Technology Stack and how it can help you meet your business goals.
Publisher resources
Table of contents
-
Introduction
- Introduction And Course Overview 00:00:56
- About The Author 00:00:48
- Getting Started With A Hadoop Installation 00:01:32
-
What Is Hadoop?
- What Is Hadoop? 00:04:56
- What Is HDFS? - Scalable Storage 00:02:02
- Understanding Block Storage 00:00:56
- Block Replication And Resilience 00:02:50
- HDFS Architecture - The Name Node And The Data Nodes 00:02:56
- Parallel Performance 00:01:12
- What Is Yarn? - Scalable Compute 00:04:30
- Yarn: Plug-In Processing Engines 00:02:01
- Overview Of MapReduce 00:06:13
- Using Different Languages 00:02:34
-
Options For Data Input
- Importing Data 00:02:59
- The Hadoop Client 00:02:26
- Overview Of Sqoop 00:02:33
- Overview Of Flume 00:02:07
- Other Import Tools 00:02:55
-
Hadoop Tools
- What Is Pig? 00:03:40
- What Is Hive? 00:04:35
- Comparing Hive To SQL 00:02:34
- Hive Architecture 00:02:25
- What Is HCatalog? 00:01:37
- Hive Interfaces 00:02:32
- Apache Storm 00:02:00
- Apache Spark 00:05:53
- Hadoop Security 00:01:46
- Overview Of Oozie 00:01:43
- Mahout 00:01:58
- HBase And Other Data Stores: Hbase, Accumulo, Etc. 00:05:02
- Apache Kafka 00:01:21
- Cluster Management 00:02:32
-
Conclusion
- Distributions And Where To Go From Here 00:03:42
- Conclusion 00:00:29
Product information
- Title: Introduction to the Hadoop Technology Stack
- Author(s):
- Release date: April 2016
- Publisher(s): Infinite Skills
- ISBN: 9781771376150
You might also like
book
Hadoop 2 Quick-Start Guide: Learn the Essentials of Big Data Computing in the Apache Hadoop 2 Ecosystem
Get Started Fast with Apache Hadoop ® 2, YARN, and Today’s Hadoop Ecosystem With Hadoop 2.x …
video
Introduction to Apache HBase Operations
HBase master Jonathan Hsieh provides a complete overview of Apache HBase operations in this course designed …
video
Learning Apache Hadoop
In this Introduction to Hadoop training course, expert author Rich Morrow will teach you the tools …
video
Building Apache HBase Applications
In this Building Apache HBase Applications training course, expert author Jonathan Hsieh will teach you how …