| Title and Copyright Information |
| About This Manual |
| Audience |
| New and Changed Features |
| Organization |
| Related Documents |
| Reader's Comments |
| Conventions |
| 1 | Managing Clusters Overview |
| 1.1 | Commands and Utilities for Clusters |
| 1.2 | Commands and Features That Are Different in a Cluster |
| 2 | Tools for Managing Clusters |
| 2.1 | Introduction |
| 2.1.1 | Cluster Tools Quick Start |
| 2.1.2 | A Word About Compaq Insight Manager Integration |
| 2.1.3 | A Word About Compaq Insight Manager XE Integration |
| 2.2 | Available Management Tools and Interfaces |
| 2.3 | Cluster Configuration Tools and Related User Interfaces |
| 2.4 | The Many Faces of SysMan |
| 2.4.1 | Introduction to SysMan Menu |
| 2.4.2 | Introduction to SysMan Station |
| 2.4.3 | Introduction to the SysMan Command Line |
| 2.5 | Using SysMan Menu in a Cluster |
| 2.5.1 | Getting in Focus |
| 2.5.2 | Specifying a Focus on the Command Line |
| 2.5.3 | Invoking SysMan Menu |
| 2.6 | Using SysMan Station in a Cluster |
| 2.6.1 | Invoking SysMan Station |
| 2.7 | Using the SysMan Java Applets in a Cluster |
| 2.7.1 | Invoking the SysMan Java Applets |
| 2.8 | Using the SysMan Java PC Applications in a Cluster |
| 2.8.1 | Invoking the SysMan Java Applications on a PC |
| 2.8.2 | SysMan Compatibility Issues |
| 2.9 | Using the SysMan Command-Line Interface in a Cluster |
| 2.10 | Using Compaq Insight Manager in a Cluster |
| 2.10.1 | Invoking Compaq Insight Manager |
| 2.11 | Using Tru64 UNIX Configuration Report |
| 3 | Managing the Cluster Alias Subsystem |
| 3.1 | Summary of Alias Features |
| 3.2 | Configuration Files |
| 3.3 | Planning for Cluster Aliases |
| 3.4 | Preparing to Create Cluster Aliases |
| 3.5 | Specifying and Joining a Cluster Alias |
| 3.6 | Modifying Cluster Alias and Service Attributes |
| 3.7 | Leaving a Cluster Alias |
| 3.8 | Monitoring Cluster Aliases |
| 3.9 | Load Balancing |
| 3.10 | Extending Clusterwide Port Space |
| 3.11 | Enabling Cluster Alias vMAC Support |
| 3.12 | Routing Configuration Guidelines |
| 3.13 | Cluster Alias and NFS |
| 3.14 | Cluster Alias and Cluster Application Availability |
| 4 | Managing Cluster Membership |
| 4.1 | Connection Manager |
| 4.2 | Quorum and Votes |
| 4.2.1 | How a System Becomes a Cluster Member |
| 4.2.2 | Expected Votes |
| 4.2.3 | Current Votes |
| 4.2.4 | Node Votes |
| 4.2.5 | Quorum Disk Votes |
| 4.3 | Calculating Cluster Quorum |
| 4.4 | A Connection Manager Example |
| 4.5 | Using a Quorum Disk |
| 4.5.1 | Replacing a Failed Quorum Disk |
| 4.6 | Using the clu_quorum Command to Display Cluster Vote Information |
| 4.7 | Cluster Vote Assignment Examples |
| 4.8 | Monitoring the Connection Manager |
| 4.9 | Connection Manager Panics |
| 4.10 | Troubleshooting Unfortunate Expected Vote and Node Vote Settings |
| 4.10.1 | Joining a Cluster After a Cluster Member or Quorum Disk Fails and Cluster Loses Quorum |
| 4.10.2 | Forming a Cluster When Members Do Not Have Enough Votes to Boot and Form a Cluster |
| 5 | Managing Cluster Members |
| 5.1 | Managing Configuration Variables |
| 5.2 | Managing Kernel Attributes |
| 5.3 | Managing Remote Access Within and From the Cluster |
| 5.4 | Shutting Down the Cluster |
| 5.5 | Shutting Down and Starting One Cluster Member |
| 5.5.1 | Identifying a Critical Voting Member |
| 5.5.2 | Preparing to Halt or Delete a Critical Voting Member |
| 5.5.3 | Halting a Noncritical Member |
| 5.5.4 | Shutting Down a Hosting Member |
| 5.6 | Shutting Down a Cluster Member to Single-User Mode |
| 5.7 | Deleting a Cluster Member |
| 5.8 | Removing a Cluster Member and Restoring It as a Standalone System |
| 5.9 | Changing the Cluster Name or IP Address |
| 5.9.1 | Changing the Cluster IP Address |
| 5.10 | Changing the Member Name, IP Address, or Cluster Interconnect Address |
| 5.11 | Managing Software Licenses |
| 5.12 | Installing and Deleting Layered Applications |
| 5.13 | Managing Accounting Services |
| 6 | Managing Networks in a Cluster |
| 6.1 | Providing Failover for Network Interfaces |
| 6.2 | Running IP Routers |
| 6.3 | Configuring the Network |
| 7 | Managing Network Services |
| 7.1 | Configuring DHCP |
| 7.2 | Configuring NIS |
| 7.2.1 | Configuring an NIS Master in a Cluster with Enhanced Security |
| 7.3 | Configuring Printing |
| 7.4 | Configuring DNS/BIND |
| 7.5 | Managing Time Synchronization |
| 7.5.1 | Configuring NTP |
| 7.5.2 | All Members Should Use the Same External NTP Servers |
| 7.5.2.1 | Time Drift |
| 7.6 | Managing NFS |
| 7.6.1 | Configuring NFS |
| 7.6.2 | Considerations for Using NFS in a Cluster |
| 7.6.2.1 | Clients Must Use a Cluster Alias |
| 7.6.2.2 | Using CDSLs to Mount NFS File Systems |
| 7.6.2.3 | Loopback Mounts Not Supported |
| 7.6.2.4 | Do Not Mount Non-NFS File Systems on NFS-Mounted Paths |
| 7.6.2.5 | Using AutoFS in a Cluster |
| 7.6.2.6 | Forcibly Unmounting File Systems |
| 7.6.2.6.1 | Determining Whether a Forced Unmount is Required |
| 7.6.2.6.2 | Correcting the Problem |
| 7.7 | Managing inetd Configuration |
| 7.8 | Managing Mail |
| 7.8.1 | Configuring Mail |
| 7.8.1.1 | Mail Files |
| 7.8.1.2 | The Cw Macro (System Nicknames List) |
| 7.8.1.3 | Configuring Mail at Cluster Creation |
| 7.8.1.4 | Configuring Mail After the Cluster Is Running |
| 7.8.2 | Distributing Mail Load Among Cluster Members |
| 7.9 | Configuring a Cluster for RIS |
| 7.10 | Displaying X Window Applications Remotely |
| 8 | Managing Highly Available Applications |
| 8.1 | Learning the Status of a Resource |
| 8.1.1 | Learning the State of a Resource |
| 8.1.2 | Learning Status of All Resources on One Cluster Member |
| 8.1.3 | Learning Status of All Resources on All Cluster Members |
| 8.1.4 | Getting Number of Failures and Restarts and Target States |
| 8.2 | Relocating Applications |
| 8.2.1 | Manual Relocation of All Applications on a Cluster Member |
| 8.2.2 | Manual Relocation of a Single Application |
| 8.2.3 | Manual Relocation of Dependent Applications |
| 8.3 | Starting and Stopping Application Resources |
| 8.3.1 | Starting Application Resources |
| 8.3.2 | Stopping Application Resources |
| 8.3.3 | No Multiple Instances of an Application Resource |
| 8.3.4 | Using caa_stop to Reset UNKNOWN State |
| 8.4 | Registering and Unregistering Resources |
| 8.4.1 | Registering Resources |
| 8.4.2 | Unregistering Resources |
| 8.4.3 | Updating Registration |
| 8.5 | Network, Tape, and Media Changer Resources |
| 8.6 | Using SysMan to Manage CAA |
| 8.6.1 | Managing CAA with SysMan Menu |
| 8.6.1.1 | CAA Management Dialog Box |
| 8.6.1.2 | Start Dialog Box |
| 8.6.1.3 | Setup Dialog Box |
| 8.6.2 | Managing CAA with SysMan Station |
| 8.6.2.1 | Starting an Application with SysMan Station |
| 8.6.2.2 | Resource Setup with SysMan Station |
| 8.7 | CAA Considerations for Startup and Shutdown |
| 8.8 | Managing caad |
| 8.8.1 | Determining Status of the Local CAA Daemon |
| 8.8.2 | Restarting the CAA Daemon |
| 8.8.3 | Monitoring CAA Daemon Messages |
| 8.9 | Using EVM to View CAA Events |
| 8.9.1 | Viewing CAA Events |
| 8.9.2 | Monitoring CAA Events |
| 8.10 | Troubleshooting with Events |
| 8.11 | Troubleshooting a Command-Line Message |
| 9 | Managing File Systems and Devices |
| 9.1 | Working with CDSLs |
| 9.1.1 | Making CDSLs |
| 9.1.2 | Maintaining CDSLs |
| 9.1.3 | Kernel Builds and CDSLs |
| 9.1.4 | Exporting and Mounting CDSLs |
| 9.2 | Managing Devices |
| 9.2.1 | Managing the Device Special File |
| 9.2.2 | Determining Device Locations |
| 9.2.3 | Adding a Disk to the Cluster |
| 9.2.4 | Managing Third-party Storage |
| 9.2.5 | Tape Devices |
| 9.2.6 | Formatting Floppy Disks in a Cluster |
| 9.2.7 | CD-ROM and DVD-ROM |
| 9.3 | Managing the Cluster File System |
| 9.3.1 | When File Systems Cannot Fail Over |
| 9.3.2 | Direct Access Cached Reads |
| 9.3.3 | Optimizing CFS Performance |
| 9.3.3.1 | CFS Load Balancing |
| 9.3.3.2 | Automatically Distributing CFS Server Load |
| 9.3.3.3 | Tuning the Block Transfer Size |
| 9.3.3.4 | Changing the Number of Read-Ahead and Write-Behind Threads |
| 9.3.3.5 | Taking Advantage of Direct I/O |
| 9.3.3.5.1 | Differences Between Cluster and Standalone AdvFS Direct I/O |
| 9.3.3.5.2 | Cloning a Fileset With Files Open in Direct I/O Mode |
| 9.3.3.5.3 | Gathering Statistics on Direct I/O |
| 9.3.3.6 | Adjusting CFS Memory Usage |
| 9.3.3.7 | Using Memory Mapped Files |
| 9.3.3.8 | Avoid Full File Systems |
| 9.3.3.9 | Other Strategies |
| 9.3.4 | MFS and UFS File Systems Supported |
| 9.3.5 | Partitioning File Systems |
| 9.3.6 | Block Devices and Cache Coherency |
| 9.4 | Managing the Device Request Dispatcher |
| 9.4.1 | Direct-Access I/O and Single-Server Devices |
| 9.4.1.1 | Devices Supporting Direct-Access I/O |
| 9.4.1.2 | Replacing RZ26, RZ28, RZ29, or RZ1CB-CA as Direct-Access I/O Disks |
| 9.4.1.3 | HSZ Hardware Supported on Shared Buses |
| 9.5 | Managing AdvFS in a Cluster |
| 9.5.1 | Integrating AdvFS Files from a Newly Added Member |
| 9.5.2 | Create Only One Fileset in Cluster Root Domain |
| 9.5.3 | Do Not Add a Volume to a Member's Root Domain |
| 9.5.4 | Using the addvol and rmvol Commands in a Cluster |
| 9.5.5 | User and Group File System Quotas Are Supported |
| 9.5.5.1 | Quota Hard Limits |
| 9.5.5.2 | Setting the |
| 9.5.6 | Storage Connectivity and AdvFS Volumes |
| 9.6 | Considerations When Creating New File Systems |
| 9.6.1 | Verifying Disk Connectivity |
| 9.6.2 | Looking for Available Disks |
| 9.6.2.1 | Looking for the Location of the Quorum Disk |
| 9.6.2.2 | Looking for the Location of Member Boot Disks and Clusterwide AdvFS File Systems |
| 9.6.2.3 | Looking for Member Swap Areas |
| 9.6.3 | Editing /etc/fstab |
| 9.7 | Managing CDFS File Systems |
| 9.8 | Backing Up and Restoring Files |
| 9.8.1 | Suggestions for Files to Back Up |
| 9.9 | Managing Swap Space |
| 9.9.1 | Locating Swap Device for Improved Performance |
| 9.10 | Fixing Problems with Boot Parameters |
| 9.11 | Using the verify Utility in a Cluster |
| 9.11.1 | Using the verify Utility on Cluster Root |
| 10 | Using Logical Storage Manager in a Cluster |
| 10.1 | Differences Between Managing LSM in Clusters and in Standalone Systems |
| 10.2 | Storage Connectivity and LSM Volumes |
| 10.3 | Configuring LSM for a Cluster |
| 10.3.1 | Configuring LSM Before Cluster Creation |
| 10.3.2 | Configuring LSM After Cluster Creation and Before Members Have Been Added |
| 10.3.3 | Configuring LSM in a Multimember Cluster |
| 10.4 | Adding Cluster Members with LSM Legacy Volumes |
| 10.5 | Moving LSM Disk Groups Between Standalone and Cluster Environments |
| 10.5.1 | Importing Tru64 UNIX Version 5.1A Standalone Disk Groups |
| 10.5.2 | Importing Tru64 UNIX Version 4.0 Standalone Disk Groups |
| 10.5.2.1 | Determining the Device Name, Media Name, and LSM Disk Types |
| 10.5.2.2 | Converting the Disk Group for Cluster Use |
| 10.5.2.3 | Converting Legacy Device Special Files |
| 10.6 | Dirty-Region Log Sizes for Clusters |
| 10.7 | Placing Cluster Domains into LSM Volumes |
| 10.7.1 | Encapsulating the /usr File System |
| 10.7.2 | Encapsulating Members' swap Devices |
| 10.7.3 | Migrating AdvFS Domains into LSM Volumes |
| 10.7.4 | Migrating Domains from LSM Volumes to Physical Storage |
| 10.7.5 | Unencapsulating Swap Volumes |
| 11 | Troubleshooting Clusters |
| 11.1 | Resolving Problems |
| 11.1.1 | Booting Systems Without a License |
| 11.1.2 | Shutdown Leaves Members Running |
| 11.1.3 | Dealing with CFS Errors at Boot |
| 11.1.4 | Backing Up and Repairing a Member's Boot Disk |
| 11.1.4.1 | Example of Recovering a Member's Boot Disk |
| 11.1.5 | Specifying cluster_root at Boot Time |
| 11.1.6 | Recovering the Cluster Root File System to a Disk Known to the Cluster |
| 11.1.7 | Recovering the Cluster Root File System to a New Disk |
| 11.1.8 | Dealing with AdvFS Problems |
| 11.1.8.1 | Responding to Warning Messages from addvol or rmvol |
| 11.1.8.2 | Resolving AdvFS Domain Panics Due to Loss of Device Connectivity |
| 11.1.8.3 | Forcibly Unmounting an AdvFS File System or Domain |
| 11.1.8.4 | Avoiding Domain Panics |
| 11.1.9 | Accessing Boot Partitions on Down Systems |
| 11.1.10 | Booting a Member While Its Boot Disk Is Already Mounted |
| 11.1.11 | Generating Crash Dumps |
| 11.1.12 | Fixing Network Problems |
| 11.1.13 | Running routed in a Cluster |
| 11.2 | Hints for Managing Clusters |
| 11.2.1 | Moving /tmp |
| 11.2.2 | Running the MC_CABLE Console Command |
| 11.2.3 | Korn Shell Does Not Record True Path to Member-Specific Directories |
| A | Cluster Events |
| B | Configuration Variables |
| C | clu_delete_member Log |
| Examples |
| 2-1 | Example |
| Figures |
| 2-1 | The SysMan Menu Hierarchy |
| 2-2 | The SysMan Menu Interfaces |
| 2-3 | SysMan Station Graphical Interface |
| 2-4 | SysMan Station Initial Cluster View |
| 2-5 | A Sample SysMan Station Cluster Hardware View |
| 2-6 | Displaying Available Actions in SysMan Station |
| 2-7 | The Compaq Insight Manager Display |
| 2-8 | Sample Configuration Report Display |
| 4-1 | The Three-Member deli Cluster |
| 4-2 | Three-Member deli Cluster Loses a Member |
| 4-3 | Two-Member deli Cluster Without a Quorum Disk |
| 4-4 | Two-Member deli Cluster with Quorum Disk Survives Member Loss |
| 8-1 | CAA Branch of SysMan Menu |
| 8-2 | CAA Management Dialog Box |
| 8-3 | Start Dialog Box |
| 8-4 | Setup Dialog Box |
| 8-5 | SysMan Station CAA_Applications_(active) View |
| 8-6 | SysMan Station CAA_Applications_(all) View |
| 8-7 | SysMan Station CAA Setup Screen |
| 9-1 | SysMan Station Display of Hardware Configuration |
| 9-2 | Cluster with Semi-private Storage |
| 9-3 | Four Node Cluster |
| Tables |
| 1-1 | Cluster Commands |
| 1-2 | File Systems and Storage Differences |
| 1-3 | Networking Differences |
| 1-4 | Printing Differences |
| 1-5 | Security Differences |
| 1-6 | General System Management Differences |
| 1-7 | Features Not Supported |
| 2-1 | Cluster Tools Quick Start |
| 2-2 | Available Management Tools and Interfaces |
| 2-3 | Cluster Management Tools |
| 2-4 | Invoking SysMan Menu |
| 2-5 | Invoking SysMan Station |
| 4-1 | Effects of Various Member cluster_expected_votes Settings and Vote Assignments in a Two- to Four-Member Cluster |
| 4-2 | Examples of Resolving Quorum Loss in a Cluster with Failed Members or Quorum Disk |
| 4-3 | Examples of Repairing a Quorum Deficient Cluster by Booting a Member with Sufficient Votes to Form the Cluster |
| 5-1 | /etc/rc.config* Files |
| 5-2 | Kernel Attributes Not to Decrease |
| 5-3 | Configurable TruCluster Server Subsystems |
| 8-1 | Target and State Combinations for Application Resources |
| 8-2 | Target and State Combinations for Network Resources |
| 8-3 | Target and State Combinations for Tape and Media Changer Resources |
| 9-1 | Sources of Information of Storage Device Management |
| 10-1 | Sizes of DRL Log Subdisks |
| 11-1 | File Systems and Storage Differences |
| B-1 | Cluster Configuration Variables |
| Index |