Keep RDC Operational If a QP Goes Down

General

The EEC is a common resource utilized by its client RD QPs to perform message transfers. It is therefore very important that errors related to a QP rather than the EEC must not result in crashing the EEC. Examples of errors that must not crash the EEC would be a Q_Key miscompare, an R_Key violation, or the receipt of an RNR Nak. The sections that follow provide a detailed discussion of the various types of errors that may be encountered by an EEC during a message transfer and how they are handled.

Error Handling for Requester-Detected Conditions

Receipt of an RNR Nak Causes Suspend Followed by Restart

When the EEC Send Logic receives an RNR Nak in response to a request packet, the EEC decrements ...

Get InfiniBand Network Architecture now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.