Samza on Mesos

Samza is an open source distributed stream processing framework originally developed at LinkedIn. It has the following features:

  • A simple API
  • State management
  • Fault tolerance
  • Durability
  • Scalability
  • Pluggability
  • Processor isolation

Important concepts of Samza

Some concepts in Samza are described in the following sections.

Streams

Samza processes streams of data—for example, website clickstreams, server logs, or any other event data. Messages can be added and read from a data stream. Multiple frameworks can access the same data stream and can partition the data based on the keys present in the message.

Streams

Jobs

A Samza job is the computation logic that ...

Get Mastering Mesos now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.