Cold backup

Cold backup is important for enterprises as the data gets older. Even though Hadoop is designed to store unlimited amounts of data, it's not always necessary to keep all the data available for processing.

It is sometimes necessary to preserve the data for auditing purposes and also for historical reasons. In such cases, we can create a dedicated Hadoop cluster with only the HDFS (File System) component and periodically sync all the data into this cluster.

The design for this system is similar to the data redundant Hadoop cluster.

Get Modern Big Data Processing with Hadoop now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.