Chapter 6. Hive Extensibility Features

In previous chapters, we learned about different ways to load data in Hive along with recently added updates and deletes in Hive.

In this chapter, we will cover the following recipes in detail:

  • Serialization and deserialization formats and data types
  • Exploring views
  • Exploring indexes
  • Hive partitioning
  • Creating buckets in Hive
  • Analytics functions in Hive
  • Windowing in Hive
  • File formats

Introduction

In this chapter, we are going to cover some of the key features of Hive including partitions, bucketing, windowing, and analytics functions. In the practical demonstration, we have used the following sales data set:

Schema : id, fname, state, zip, ip, pid

Get Apache Hive Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.