Summary

Cassandra is a highly available, fault-tolerant, distributed database. However, sometimes data can get corrupted due to client application faults and other reasons. To handle such situations, Cassandra provides tools to back up and restore data to the last known state. Using the nodetool snapshot command, we can manually take snapshots of the data of a node. Restoring the snapshot from a node might not be consistents but restoring from all nodes' data will eventually become consistent. An incremental backup configuration allows the taking of automatic snapshots of node data. While loading SSTables in the bulk sstableloader is a great utility, using the sstableloader SSTables can be uploaded to different clusters with different ranges and ...

Get Apache Cassandra Essentials now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.