If you measure all four golden signals and page a human when one signal is problematic (or, in the case of saturation, nearly problematic), your service will be at least decently covered by monitoring.


Cover of Site Reliability Engineering


Latency, Traffic, Error, Saturation