Time synchronization

As Cassandra's log-based storage is timestamp-driven, it is absolutely critical that all nodes in a cluster have synchronized clocks. To this end, instances running Cassandra should be using NTPD or another time synchronization application to keep all system clocks in a cluster in sync.

Issues related to time synchronization discrepancies can manifest themselves as obsolete data or deleted data ghosting itself.

Consider the following current times from instances in a three-node cluster:

Figure 4.1: A three-node cluster, where one node is behind by about 13 seconds

Let's say I run the following CQL to create a simple table: ...

Get Mastering Apache Cassandra 3.x - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.