Chapter 10. Monitoring: Houston, we have a problem

 

This chapter covers

  • Basics of writing Nagios health checks
  • Using both AMQP and the REST API to monitor Rabbit internals
  • Verifying that Rabbit is available and responding
  • Watching queue levels for early detection of consumer problems
  • Checking for undesirable configuration changes in the messaging fabric

 

Your RabbitMQ server is up and running and your snazzy dog walking app is bringing in thousands of orders nationwide. Everything seems to be going great when you suddenly get the call: customers are getting errors from your web app and the flow of orders has stopped completely. The RabbitMQ server has died, and to make matters worse it appears that it’s been down for hours. If only you’d ...

Get RabbitMQ in Action now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.