Skip to content

Host Monitoring

Host Monitoring in ClusterControl is designed to provide comprehensive insights into the health, performance, and availability of the hosts (servers) within the database cluster.

It can monitor status of the nodes and key metrics such as CPU usage, memory consumption, disk I/O, network activity, and many more.

Cluster Node Summary

The Cluster Node Summary is part of the Cluster Summary where is available after we login into ClusterControl. The Cluster Node Summary on the home page provides a quick overview of the current status of each node in database cluster.

Here is what it typically includes:

  • Node Name/Hostname – Identifies the server/node.
  • Status – Indicates if the node is Operational, Failed, Offline, Shutdown, Recovering or Unreachable/Unknown.
  • Role – Shows the role of the node (e.g., Master, Slave, Primary, Secondary, depending on the database type).
  • Type – Describe of the database engine running on the host.
  • Cluster name – Identifies the cluster name to which the node belongs.

To get the cluster node summary:

  1. Login into the ClusterControl UI.

  2. In the Home section and you will see the Nodes group which appears in the Status Overview and Nodes itself.

  3. The Status Overview inside the Nodes group shows the summary of the nodes. The green color indicates that the nodes status in operational while the red indicates the nodes is offline. If you rollover to one of the node, it will show more information about the node such as Hostname, Role, Type, Cluster Name, and Status.

  1. Check status of the nodes and there will be information about cluster name, hostname, port, and status of the nodes.

    s9s node --list --long
    
  2. Filter the nodes specific by clusterID.

    s9s node --cluster-id=<clusterID> --list --long
    

The clusterID (CID) is the unique identifier for each cluster.

Node Health Status

The node health status represents the current operational state of each individual server (or node) within a database cluster. It is a key part of monitoring and helps you quickly identify if a node is running as expected or experiencing issues.

CPU Utilization

  1. Login into the ClusterControl UI.

  2. Go to specific cluster, and choose the Dashboard.

  3. Choose the System Overview, and you will see information about CPU Usage.

  4. The CPU Usage metrics describe each status that is running in the nodes such as : Idle, User, System, Iowait, Irq, Streal.

Memory Utilization

  1. Login into the ClusterControl UI.

  2. Go to specific cluster, and choose the Dashboard.

  3. Choose the System Overview, and you will see information about memory Usage.

  4. The Memory Usage metrics show the Free Memory and Used Memory statistics.

Disk IO Utilization

  1. Login into the ClusterControl UI.

  2. Go to specific cluster, and choose the Dashboard.

  3. Choose the System Overview, and you will see information about disk I/O utilization.

  4. The Disk IO Util metrics show percentage usage on each storage and mount points.

Node Menu

The Node Menu in ClusterControl is a context-specific menu accessible from the ClusterControl UI, typically by clicking on a specific database node listed in the cluster view or in the Nodes menu on the left side of the interface. The Node Menu serves as a key interface for accessing monitoring capabilities related to the status of services on each host.

It also provides a set of actions and detailed options for managing and monitoring the selected node.

  1. Login into the ClusterControl UI.

  2. Go to the Nodes menu in the left side of ClusterControl UI.

  3. In the header, it appears the total of nodes in several categories (Operational, Failed, Offline, Shutdown, Recovering, Unkown State, All).

  4. Beside the summary header, it appears list of the host with information such as: Type, Status, Role, version, Cluster, Last seen, and last one is Action menu. The actions menu provide some menu for managing the selected nodes (for example: Restart Nodes, Reboot Host, Stop Node, Remove Nodes, Rebuild Replication, Change Replication)