Part 5. Taming the elephant

Part 5 is called “Taming the elephant,” and it’s dedicated to examining languages, tools, and processes that make it easier to work with MapReduce. Why is elephant in the title? Hadoop was created by Doug Cutting, who named the framework after his son’s yellow stuffed elephant.

Chapter 10 dives into Hive, a SQL-like domain-specific language that’s one of the most accessible interfaces when you work with MapReduce.

Pig, covered in chapter 11, offers a good compromise as an abstracted Map-Reduce language that can bump down into Java code when you need a lower level of access to Hadoop.

Chapter 12 targets programmers who want to integrate MapReduce with existing Java applications. You’ll learn about Crunch and Cascading, ...

Get Hadoop in Practice now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.