Summary

This chapter explained the Data Intake layer in detail; we started with understanding the various zones in the Intake tier and the external sources from which the data can be acquired as per your use case. We then took a deep dive into the functionalities of the Source System Zone, Transient landing Zone, and the Raw Zone and also comprehended the best practices that can be considered while architecting the Data Intake tier.

In the subsequent sections, we took a look at the various Big Data tools and technologies that can be used to acquire different types of data from various sources. The architectural guidance section helped you in decision making in order to arrive at the set of technologies that can be used for specific use cases.

Get Data Lake Development with Big Data now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.