Summary

In this chapter, we covered how to aggregate data using basic aggregation functions. Then, we introduced the advanced aggregations with GROUPING SETS, ROLLUP, and CUBE, as well as aggregation conditions using HAVING. We also covered the various analytic functions and windowing clauses. At the end of the chapter, we introduced three ways of sampling data in Hive. After going through this chapter, you should be able to do basic and advanced aggregations and data sampling in Hive.

In the next chapter, we'll talk about performance considerations in Hive.

Get Apache Hive Essentials now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.