Partitioning a managed table

Managed tables can be partitioned using the PARTITIONED BY clause. In a managed table, if you delete a table, then the data of that table will also get deleted. Similarly, if you delete a partition, then the data of that partition will also get deleted.

How to do it…

Let's take an example of the customer table data and imagine that we have the data of different customers of different country. Now if we don't enable any partitioning, then by default, all data will go into one directory. Let's assume that data size is around 1 TB. Now if we query for customers belonging to India, then this query will be executed on entire data of 1 TB size and this query will take more time. By enabling partitioning this query, execution ...

Get Apache Hive Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.