Sun Microsystems
Products & Services
 
Support & Training
 
 

Previous Previous     Contents     Index     Next Next
Chapter 11

Node Management Agent

This chapter describes how the Node Management Agent (NMA) can be used to monitor and manipulate a cluster. This chapter contains the following sections:

Introduction to the Node Management Agent

The NMA is compliant with the Java Management Extensions (JMX) and based on the Java Dynamic Management Kit. The NMA provides access to cluster statistics through the Simple Network Management Protocol (SNMP) or through JMX clients using HTTP. The NMA supports the Internet Engineering Task Force standard RFC 2573.

The NMA retrieves statistics about the cluster membership, the reliable transport mechanism, the network file system, and the monitoring of process daemons. The NMA can be used to initiate a switchover, change the maximum number of times that it attempts to restart a daemon, reset the current retry count for a daemon, and listen for certain cluster notifications.

The following figure shows a remote client accessing nodes in a cluster.

Figure 11-1 Remote Access to the Cluster

Diagram shows  a remote client accessing each node in the cluster, through the NMA.

Monitoring Statistics With the NMA

The NMA collects two types of statistics: node-level statistics and cluster-level statistics.

There is an NMA on each peer node that collects node-level statistics, that is, statistics for that node. Each NMA collects statistics about CGTP, CMM, and the Daemon Monitor. The NMA on each master-eligible node collects node-level statistics about Reliable NFS.

The NMA on the master node collects cluster-level statistics, that is, statistics about the cluster.

The following table describes the statistics that are collected by the NMA on each peer node and on the master node.

Table 11-1 Statistics Collected by the NMA

Type of Statistics

Collected From Which Nodes

Statistics

Node-level statistics

All peer nodes, including the master node

CGTP statistics, CMM statistics, and Daemon Monitor statistics

Reliable NFS statistics on the master-eligible nodes

Cluster-level statistics

Master node only

CMM statistics, list of peer nodes, cluster Reliable NFS statistics, and high-level statistics for each peer node

The following statistics can be collected by the NMA:

  • CMM statistics

    Some cluster membership statistics are collected on each node. Other cluster membership statistics can be collected by the NMA on the master node only. Statistics collected from the master node include the following:

    • Information about mastership elections

    • Information about switchovers

    • Information about the direct link

    • Details of the individual cluster nodes

    • Activity of the nhcmmd and nhprobed daemons

  • CGTP statistics

    These statistics include a set of general CGTP statistics and a set of dedicated packet filtering statistics. Packet filtering statistics count the number of packets successfully received through each of the CGTP redundant links. In this way, packet filtering statistics measure the quality of the communication.

  • Reliable NFS statistics

    These statistics give the file status and disk replication status. These statistics can be collected for master-eligible nodes only.

  • Daemon Monitor statistics

    These statistics provide the following information:

    • Which daemons are monitored

    • Which monitored processes have failed

    • The number of times a failed daemon has been restarted

    • The maximum number of times a failed daemon is allowed to be restarted

The NMA running on the master node cascades the statistics from the NMA on each of the peer nodes into its namespace. By cascading, the NMA on the master node can see the statistics on all of the peer nodes. In this way, the NMA on the master node has a view of the entire cluster. The following figure illustrates the cascade of statistics from the peer nodes to the master node.

Figure 11-2 Cascading Data From Peer Nodes to the Master Node

Diagram shows the cascade of statistics from the peer nodes to the master node.

A set of NMA APIs can be used to develop applications that monitor and react to the statistics produced by a cluster. For information about developing applications with the NMA APIs, see the Netra High Availability Suite Foundation Services 2.1 6/03 NMA Programming Guide.

The statistics that a cluster generates depend on the type, size, and arrangement of the cluster. Each cluster has an individual set of statistics. Knowing the statistics that your cluster generates when it runs correctly can help you to interpret the statistics when the cluster is failing. Use the NMA and its APIs to establish a set of statistics or a benchmark for your cluster when it is working correctly.

Manipulating the Cluster With the NMA

The NMA can be configured to initiate a switchover or to change the following Daemon Monitor parameters:

  • The maximum number of times that the Daemon Monitor attempts to restart a daemon or group of daemons

  • The current retry count for a monitored daemon

For more information, see the Netra High Availability Suite Foundation Services 2.1 6/03 NMA Programming Guide.

Previous Previous     Contents     Index     Next Next