Writing our first Hadoop MapReduce job

In this recipe, we will write our first MapReduce job using the Hadoop MapReduce API and run it using the mongo-hadoop connector getting the data from MongoDB. Refer to the Executing MapReduce in Mongo using a Java client recipe in Chapter 3, Programming Language Drivers to see how MapReduce is implemented using a Java client, test data creation, and problem statement.

Getting ready

Refer to the previous Executing our first sample MapReduce job using the mongo-hadoop connector recipe to set up the mongo-hadoop connector. The prerequisites of this recipe and the Executing MapReduce in Mongo using a Java client recipe from Chapter 3, Programming Language Drivers are all that we need for this recipe. This is a ...

Get MongoDB Cookbook - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.