Apache Flink

Apache Flink's documentation describes Flink in the following way: Flink is an open-source framework for distributed stream processing.

Flink provides accurate results and supports out-of-order or late-arriving datasets. It is stateful and fault-tolerant and can seamlessly recover from failures while maintaining an exactly-once application state. It performs at a large scale, running on thousands of nodes with very good throughput and latency characteristics.

The following are the features of Apache Flink:

  • Flink guarantees exactly-once semantics for stateful computations
  • Flink supports stream processing and windowing with event time semantics
  • Flink supports flexible windowing based on time, count, or sessions, in addition to ...

Get Modern Big Data Processing with Hadoop now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.