Chapter 13 Fault Tolerance

One of the most significant ways in which servers differ from desktop computers has to do with the importance placed on fault tolerance. Fault tolerance means that a problem with software or hardware does not shut down the system, which minimizes the risk of data loss. To achieve fault tolerance, servers use technologies such as the following:

  • Error correction for memory
  • Self-monitoring of critical components such as memory, disk storage, and network connections
  • Redundant memory, disk arrays, power supplies, and network connections
  • Management of network hardware

Although many of these topics are covered in detail in other chapters, this chapter is designed to show you how these server technologies help you achieve ...

Get Upgrading and Repairing Servers now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.