Chapter 4. Data Analysis Using Hive, Pig, and Hbase

  • Storing and processing Hive data in a sequential file format
  • Storing and processing Hive data in the RC file format
  • Storing and processing Hive data in the ORC file format
  • Storing and processing Hive data in the Parquet file format
  • Performing FILTER By queries in Pig
  • Performing Group By queries in Pig
  • Performing Order By queries in Pig
  • Performing JOINS in Pig
  • Writing a user-defined function in Pig
  • Analyzing web log data using Pig
  • Performing the Hbase operation in CLI
  • Performing Hbase operations in Java
  • Executing the MapReduce programming Hbase Table

Introduction

In the previous chapter, we discussed how to write MapReduce programs in various ways in order to analyze data. Earlier, MapReduce was the only means ...

Get Hadoop: Data Processing and Modelling now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.