Designing the topology for our use case

For this example, we will again use Trident and build on the topology that we constructed in the previous chapter. The Trident topology is depicted as follows:

Designing the topology for our use case

The TwitterSpout performs the search against the Twitter API periodically, emitting the tweets that it returns into a Trident stream. The TweetSplitterFunction then parses the tweets and emits a tuple for each word in the tweets. The WordFrequencyFunction enriches each tuple with the count for that word from a random sample of the English language. Finally, we let Druid consume that information to perform the aggregations over time. Druid partitions ...

Get Storm Blueprints: Patterns for Distributed Real-time Computation now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.