AlphaServer SC patch kit: ========================== AlphaServer SC 2.5 UK1 Kit Name: SCV25UK1100243 Release Date:051403 PTR: 153-2-1100 IPMT Number: CFS.100243 Abstract: This patch addresses occurances of an rmsd coredump if clu_get_info supplied a NULL value as the hostname of a cluster member. Description of Patch: ===================== This fix modifies the behaviour of the rmsd running on the node responsible for pushing the cluster status to the mmanager. It will not forward any information for a node for which clu_get_info has supplied a value of NULL as its hostname. This patch also contains fixes concerning the CAA failover procedure. After a failover, the value of rmshost is now re-read from the database by RMS daemons before they attempt to reconnect. This ensures, for example, that the that the node rmsd's forward stats to the mmanager and that prun's reconnect to the pmanager on the new rmshost. Kit checksum: ============= bash-2.02$ cksum SCV25UK1100243.tar.gz 1567703762 1679441 SCV25UK1100243.tar.gz Updated files: ============== /usr/opt/rms/lib/librms.so /usr/shlib/librms.so Dependencies: ============= It should be installed over the RMS kit shipped with UK1. Instructions: ============= This patch is provided as a setld installable kit. Unpack it into a directory that is NFS mounted on all domains e.g. /usr/kits/ and install it as follows: 1. Stop Partitions, eg # rcontrol stop partition=parallel 2. Stop RMS on all nodes eg: # sra command -domains all -m 1 -command "CluCmd /sbin/init.d/rms stop" 3. Stop RMS and msql on Management Server # /sbin/init.d/rms stop # /sbin/init.d/msqld stop 4. Install on Management Server : # /usr/sbin/setld -l SCV25UK1100243 6. Start RMS and msql on Management Server # /sbin/init.d/msqld start # /sbin/init.d/rms start 7. Install across all domains, eg: # sra command -domains all -m 1 -command "/usr/sbin/setld -l SCV25UK1100243" 8. Start RMS on all nodes eg: # sra command -domains all -m 1 -command "CluCmd /sbin/init.d/rms start" 9. Restart Parallel partition # rcontrol start partition=parallel -------- To remove the patch use the following steps: 1. Stop Partitions, eg # rcontrol stop partition=parallel 2. Stop RMS on all nodes eg: # sra command -domains all -m 1 -command "CluCmd /sbin/init.d/rms stop" 3. Delete across all domains, eg: # sra command -domains all -m 1 -command "/usr/sbin/setld -d SCV25UK1100243" 4. Stop RMS and msql on Management Server # /sbin/init.d/rms stop # /sbin/init.d/msqld stop 5. Delete from Management Server : # /usr/sbin/setld -d SCV25UK1100243 6. Start RMS and msql on Management Server # /sbin/init.d/msqld start # /sbin/init.d/rms start 7. Start RMS on all nodes eg: # sra command -domains all -m 1 -command "CluCmd /sbin/init.d/rms start" 8. Restart Parallel partition # rcontrol start partition=parallel