Storm-YARN

Apache Storm is an open source distributed real-time computation system from Twitter.

Storm helps in processing unbounded streams of data in a reliable manner. Storm can be used with any programming language. Some of the most common use cases of Storm are real-time analytics, real-time machine learning, continuous computation, ETL, and many more.

Storm-YARN is a project from Yahoo that enables the Storm cluster to be deployed and managed by YARN. Earlier, a separate cluster was needed for Hadoop and Storm.

One major benefit that comes with this integration is elasticity. Batch processing (Hadoop MapReduce) is usually done on the basis of need, and real-time processing (Storm) is an ongoing processing. When the Hadoop cluster is idle, you ...

Get YARN Essentials now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.