Apache Storm

Apache Storm is one of the most popular open source streaming engines and provides real-time analytics on unbounded streams of data. It is a distributed framework that can work on multiple nodes, providing both fault tolerance and horizontal scalability. Another primary feature of Apache Storm is that it provides guaranteed event processing, that is, every event that enters the system is processed without a loss of events. Apache Storm applications can be deployed with any programming language of the developer's choice, making it extremely attractive for usage in low-latency analytics.

MapReduce provides the Map and Reduce function primitives to build batch applications. Similarly, Storm provides its own set of primitives to support ...

Get Mastering Hadoop now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.