4.1. Linux on pSeries RAS

The IBM pSeries hardware has been known for its RAS capabilities due to IBM’s knowledge and experience in developing mainframes and mission-critical servers. Much of the RAS design has been developed to analyze failures within the Central Electronic Complex (CEC) to either eliminate the errors or to contain and reduce them to avoid bringing the entire server down. Some of the RAS features that you see available for Linux on pSeries are:

  • Persistent deallocation for memory and processor during boot-time

  • Automatic First Failure Data Capture and diagnostics capability

  • ECC and chipkill correction in the real memory

  • Fault tolerance with N+1 redundancy of power and cooling

  • Dual line cords

  • Predictive failure analysis and diagnostics ...

Get Deploying Linux on IBM eServer pSeries Clusters now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.