Prevention of human errors

In both High Availability and Disaster Recovery, a significant percentage of the issues in software systems are caused by a human error. This does not refer to incorrect input or unexpected usage of the system, but rather the mistakes made in system code, setup, or configuration.

The human error horror stories account for much of IT folklore, and include incidents such as the following:

  • Unintentionally deleting entire data stores holding business critical data
  • Deploying test configuration to production systems
  • Causing a vast hardware or network outage with a typo
  • Making the entire company domain unusable by failing to renew an SSL certificate.

These errors usually occur during an introduction of a change (such ...

Get Serverless computing in Azure with .NET now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.