Understanding Lambda architecture

The Lambda architectural pattern attempts to combine the best of both worlds—batch processing and stream processing. This pattern consists of several layers: Batch Layer (ingests and processes data on persistent storage such as HDFS and S3), Speed Layer (ingests and processes streaming data, that has not been processed by the batch layer yet), and the Serving Layer that can combine outputs from the batch and speed layers to present merged results. This is a very popular architecture in Spark-based cloud environments because it can support both batch and speed layer implementations with minimal code differences between the two.

The following figure depicts the Lambda architecture as a combination of the batch ...

Get Learning AWS - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.