Orchestrating Data using AWS Data Pipeline

In the previous chapter, we explored the AWS analytics suite of services by deep diving into Amazon EMR and Amazon Redshift services.

In this chapter, we will be continuing the trend and learning about an extremely versatile and powerful data orchestration and transformation service called AWS Data Pipeline.

Let's have a quick look at the various topics that we will be covering in this chapter:

  • Introducing AWS Data Pipeline along with a quick look at some of its concepts and terminologies
  • Getting started with Data Pipeline using a simple Hello World example
  • Working with the Data Pipeline definition file
  • Executing scripts and commands on remote EC2 instances using a data pipeline
  • Backing up data ...

Get AWS Administration : The AWS Definitive Guide to core AWS service offerings and implementing AWS in your own environment now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.