Map/Reduce pattern

Whenever there is a need to migrate from Hadoop MapReduce or to develop new Map/Reduce kind of data processing applications, Cassandra can be used in conjunction with Spark and a preferred programming language such as Scala, Java, Python, or R. Divide the data processing task into Mapper and Reducer kind of transformations and actions. Choose the appropriate Spark Transformations and Spark Actions to complete the task.

Motivations/solutions

Many of the day-to-day data processing needs commonly seen in applications can be divided into multiple smaller tasks and executed in parallel. The results can be combined together to produce the final result. A clustered data processing infrastructure in conjunction with a parallel and distributed ...

Get Cassandra Design Patterns - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.