Chapter 9. Deploying Storm on Hadoop for Advertising Analysis

In the previous two chapters, we saw how we might integrate Storm with a real-time analytics system. We then extended that implementation, supporting the real-time system with batch processing. In this chapter, we will explore the reverse.

We will examine a batch processing system that computes the effectiveness of an advertising campaign. We will take the system that was built on Hadoop and convert it into a real-time processing system.

To do this, we will leverage the Storm-YARN project out of Yahoo! The Storm-YARN project allows users to leverage YARN to deploy and run Storm clusters. The running of Storm on Hadoop allows enterprises to consolidate operations and utilize the same infrastructure ...

Get Storm Blueprints: Patterns for Distributed Real-time Computation now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.