Monitoring the node status

As you can guess, perhaps the first thing that you always need to check is the status of each node—whether they are online or offline. Otherwise, there is little point in proceeding with further availability and performance analysis.

If you have a network management system (such as Zabbix or Nagios) server, you can easily monitor the status of your cluster members and receive alerts when they are unreachable. If not, you must come up with a supplementary solution of your own (which may not be as effective or errorproof) that you can use to detect when a node has gone offline.

One such solution is a simple bash script (we will name it pingreport.sh, save it inside /root/scripts, and make it executable with chmod +x /root/scripts/pingreport.sh ...

Get CentOS High Performance now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.