Summary

In this chapter, we started with basic understanding of a customer 360-degree view and how it can be useful. We explored tools such as Hive and Sqoop. We worked with MySQL and HDFS to extract, load, and transform data. We used structured data, web logs, and tweets to build our customer 360-degree view. We performed batch processing on various datasets that contribute to building the 360-degree view.

In this chapter, we carried out extract, load, and transform (ELT) tasks manually. This will not be practical in a real-life situation. With the help of schedulers and workflow tools, we can automate ELT tasks and build a data pipeline.

Get Hadoop Blueprints now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.