In both High Availability and Disaster Recovery, a significant percentage of the issues in software systems are caused by a human error. This does not refer to incorrect input or unexpected usage of the system, but rather the mistakes made in system code, setup, or configuration.
The human error horror stories account for much of IT folklore, and include incidents such as the following:
- Unintentionally deleting entire data stores holding business critical data
- Deploying test configuration to production systems
- Causing a vast hardware or network outage with a typo
- Making the entire company domain unusable by failing to renew an SSL certificate.
These errors usually occur during an introduction of a change (such ...