Kafka Streams API

Kafka Streams is a library for building streaming applications. Kafka Streams is a client library for building applications and microservices, where the input and output data are stored in Kafka Clusters. The Kafka Streams API transforms and enriches the data.

The following are the important features of the Kafka Streams API:

  • It is part of the open source Apache Kafka project.
  • It supports per record streams processing with a very low latency (milliseconds). There is no micro- batching concept in the Kafka Streams API. Every record that comes into the stream is processed on its own.
  • It supports stateless processing (filtering and mapping), stateful processing (joins and aggregations), and windowing operations (for example, ...

Get Modern Big Data Processing with Hadoop now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.