You are previewing Hadoop in Action.

Hadoop in Action

Cover of Hadoop in Action by Chuck Lam Published by Manning Publications
O'Reilly logo

Chapter 4. Writing basic MapReduce programs

This chapter covers

  • Patent data as an example data set to process with Hadoop
  • Skeleton of a MapReduce program
  • Basic MapReduce programs to count statistics
  • Hadoop’s Streaming API for writing MapReduce programs using scripting languages
  • Combiner to improve performance

The MapReduce programming model is unlike most programming models you may have learned. It’ll take some time and practice to gain familiarity. To help develop your proficiency, we go through many example programs in the next couple chapters. These examples will illustrate various MapReduce programming techniques. By applying MapReduce in multiple ways you’ll start to develop an intuition and a habit of “MapReduce thinking.” The examples ...

The best content for your career. Discover unlimited learning on demand for around $1/day.