Backing up data using AWS Data Pipeline

One of the most widely used use cases for AWS Data Pipeline is its ability to synchronize and schedule backup jobs. You can use Data Pipeline to take backups of data stored within EC2 instances, EBS volumes, databases and even S3 buckets. In this section, we will walk through a simple, parameterized pipeline definition using which you can effectively schedule and perform backups of files stored within an Amazon S3 bucket.

First up, let's have a look at the pipeline definition file itself:

You can find the complete copy of code at https://github.com/yoyoclouds/Administering-AWS-Volume2.

To start with, we once again provide a list of objects that describe the pipeline components starting with a pipeline ...

Get AWS Administration : The AWS Definitive Guide to core AWS service offerings and implementing AWS in your own environment now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.