Types of Outages

After you've spent years as a system administrator and are looking back at all the service outages you've dealt with in your organization during that time, you'll probably discover that they can be divided into two groups: those that are preventable, and those that aren't. Preventable outages are ones that are either caused by human error or ones that you can see coming based on current monitoring data. Human error, although impossible to prevent, can be minimized with procedures that eliminate guessing on the part of an administrator. Creating procedures for routine tasks like removing a server from a rack or rebooting a production server can prevent the occasional human mishap.

Other outages can be prevented because you can ...

Get Unix® System Management Primer Plus now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.