Summary

In this chapter, we covered various interceptors shipped with Flume, including:

  • Timestamp: These are used to add a timestamp header, possibly overwriting an existing one.
  • Host: This is used to add the Flume agent hostname or IP as a header in the event.
  • Static: This is used to add static String headers.
  • Regular expression filtering: This is used to include or exclude events based on a matched regular expression.
  • Regular expression extractor: This is used to create headers from matched regular expressions. It's useful for routing with Channel Selectors.
  • Morphline: This is used to delegate transformation to a Morphline command chain.
  • Custom: This is used to create any custom transformations you need that you can't find elsewhere.

We also covered ...

Get Apache Flume: Distributed Log Collection for Hadoop - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.