11.2. High Availability and Fault Tolerance

High availability (HA) is a measure of the percentage of the time that a system is running. Saying a system has HA implies that a system is up and running nearly all the time. Fault tolerance (FT) means that a system is capable of surviving the loss of one or more critical components while still retaining functionality. The two terms often get confused, but they have entirely different meanings.

Take the Hubble Space Telescope as an example. Hubble shipped with six gyroscopes and needed three for normal observations. As gyroscopes failed, NASA found a way to maintain functionality by taking some of the operational gyros and moving them to stabilize the telescope to keep their observations running. ...

Get Microsoft® Windows Server® 2008: Implementation and Administration now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.