Job engine

Hive supports running jobs on different engines. The choice of engine will also impact the overall performance. However, this is a bigger change compared to the other settings. Also, this change requires a service restart rather than temporarily make it effective in command-line session. Here is the syntax to set the engine as well as details for each of them:

SET hive.execution.engine=<engine>; -- <engine> = mr|tez|spark 
  • mr: This is the default engine, MapReduce. It was deprecated after Hive v2.0.0.
  • tez: Tez (http://tez.apache.org/) is an application framework built on Yarn that can execute complex Directed Acyclic Graphs (DAGs) for general data-processing tasks. Tez further splits map and reduce jobs into smaller tasks and ...

Get Apache Hive Essentials now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.