Chapter 3. Mastering Map Reduce Programs

In this chapter, we'll cover the following recipes:

  • Writing the Map Reduce program in Java to analyze web log data
  • Executing the Map Reduce program in a Hadoop cluster
  • Adding support for a new writable data type in Hadoop
  • Implementing a user-defined counter in a Map Reduce program
  • Map Reduce program to find the top X
  • Map Reduce program to find distinct values
  • Map Reduce program to partition data using a custom partitioner
  • Writing Map Reduce results to multiple output files
  • Performing Reduce side Joins using Map Reduce
  • Unit testing the Map Reduce code using MRUnit

Introduction

Hadoop consists of two important components: HDFS and Map Reduce. In the previous chapter, we talked about various recipes one can perform to ...

Get Hadoop Real-World Solutions Cookbook - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.