Contents
Chapter 1: Big Data and the Hadoop Ecosystem
Developing Enterprise Applications with Hadoop
Chapter 2: Storing Data in Hadoop
Combining HDFS and HBase for Effective Data Storage
Managing Metadata with HCatalog
Choosing an Appropriate Hadoop Data Organization for Your Applications
Chapter 3: Processing Your Data with MapReduce
Your First MapReduce Application
Designing MapReduce Implementations
Chapter 4: Customizing MapReduce Execution
Controlling MapReduce Execution with InputFormat
Reading Data Your Way with Custom RecordReaders
Organizing Output Data with Custom Output Formats
Writing Data Your Way with Custom RecordWriters
Optimizing Your MapReduce Execution with a Combiner
Controlling Reducer Execution with Partitioners
Using Non-Java Code with Hadoop
Chapter 5: Building Reliable MapReduce Apps
Unit Testing MapReduce Applications
Local Application Testing with Eclipse
Using Logging for Hadoop Testing
Reporting Metrics with Job Counters
Defensive Programming in MapReduce
Chapter 6: Automating Data Processing with Oozie
Oozie Parameterization with Expression Language
Get Professional Hadoop Solutions now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.