Chapter 8. A Relational View on Data with Hive

MapReduce is a powerful paradigm which enables complex data processing that can reveal valuable insights. However, it does require a different mindset and some training and experience on the model of breaking processing analytics into a series of map and reduce steps. There are several products that are built atop Hadoop to provide higher-level or more familiar views on the data held within HDFS. This chapter will introduce one of the most popular of these tools, Hive .

In this chapter, we will cover:

  • What Hive is and why you may want to use it
  • How to install and configure Hive
  • Using Hive to perform SQL-like analysis of the UFO data set
  • How Hive can approximate common features of a relational database ...

Get Hadoop: Data Processing and Modelling now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.