Video description
Apache Kudu is an entirely new storage manager for the Hadoop ecosystem. It addresses many of the most difficult architectural issues in Big Data, including the Hadoop "storage gap" problem common when building near real-time analytical applications. This vexing issue has prevented many applications from transitioning to Hadoop-based architectures.
In this course, you'll learn why Kudu exists, when to use it, the key concepts of Kudu's design, and how it enables simple, real-time analytics without the need for separate batch and speed layers. Designed for developers, architects, and engineers with some limited experience using Hadoop ecosystem components like HDFS, Hive, Spark, or Impala, the course describes how to architect Kudu applications that are low-risk, fast, scalable, and reliable.
- Explore skills critical to the "big data" toolbox of any developer, architect, or engineer
- Learn how Kudu solves the Hadoop storage gap problem
- Understand Kudu's design goals, strengths, and weaknesses
- Discover how Kudu reads and writes data
- Master the concepts that make Kudu-based applications low-risk, scalable, and fast
Publisher resources
Table of contents
- Welcome To The Course 00:00:25
- About The Author 00:01:27
- HDFS History 00:06:04
- HBase History 00:03:08
- Kudu And The Changing Hardware Landscape 00:02:02
- Why Kudu 00:07:10
- General Kudu Architecture 00:06:16
- Kudu Read And Write Operations 00:08:56
- Tables And Schemas In Kudu 00:07:07
- Kudu Partitioning 00:06:57
Product information
- Title: Introducing Kudu and Kudu Architecture
- Author(s):
- Release date: February 2017
- Publisher(s): Infinite Skills
- ISBN: 9781491985670
You might also like
video
Building a Near Real-Time Analytical Application with Kudu
Building near real-time analytical applications that combine real-time data inserts, updates, and fast analytics is almost …
video
Basic Kudu Installation, API Usage, and SQL Integration
Apache Kudu is a required skill in the Big Data world because it addresses problems that …
video
Using Kudu with Apache Spark and Apache Flume
Apache Kudu, the breakthrough storage technology, is often used in conjunction with other Hadoop ecosystem frameworks …
book
Getting Started with Kudu
Fast data ingestion, serving, and analytics in the Hadoop ecosystem have forced developers and architects to …