6 Using Fibre Channel Storage

This chapter provides an overview of Fibre Channel, Fibre Channel configuration examples, and information on Fibre Channel hardware installation and configuration in a Tru64 UNIX or TruCluster Server Version 5.1A configuration.

This chapter discusses the following topics:

An overview of Fibre Channel (Section 6.1).

A comparison of Fibre Channel topologies (Section 6.2).

Example cluster configurations using Fibre Channel storage (Section 6.3).

A brief discussion of Quickloop (Section 6.4).

A discussion of zoning (Section 6.5).

A discussion of cascaded switches (Section 6.6).

A procedure for Tru64 UNIX Version 5.1A or TruCluster Server Version 5.1A installation using Fibre Channel disks (Section 6.7).

The steps necessary to install and configure the Fibre Channel hardware (Section 6.8).

The steps necessary to install the base operating system and cluster software using disks accessible over the Fibre Channel hardware (Section 6.9).

How to convert the HSG80 from transparent to multiple-bus failover mode (Section 6.10).

A discussion on how you can use the emx manager (emxmgr) to display the presence of Fibre Channel adapters, target ID mappings for a Fibre Channel adapter, and the current Fibre Channel topology (Section 6.11).

The information includes an example storageset configuration, how to determine the /dev/disk/dskn value that corresponds to the Fibre Channel storagesets that have been set up as the Tru64 UNIX boot disk, cluster root (/), cluster /usr, cluster /var, cluster member boot, and quorum disks, and how to set up the bootdef_dev console environment variable to facilitate Tru64 UNIX Version 5.1A and TruCluster Server Version 5.1A installation.

Note

TruCluster Server Version 5.1A configurations require one or more disks to hold the Tru64 UNIX operating system. The disks are either private disks on the system that will become the first cluster member, or disks on a shared bus that the system can access.
Whether or not you install the base operating system on a shared disk, always shut down the cluster before booting the Tru64 UNIX disk.
TruCluster Server requires a cluster interconnect, which can be the Memory Channel, or a private LAN. (See the Cluster LAN Interconnect manual for more information on the LAN interconnect.) All the configurations in this chapter are shown with the Memory Channel interconnect. The illustrations could represent multiple standalone systems by removing the Memory Channel.

6.1 Fibre Channel Overview

Fibre Channel supports multiple protocols over the same physical interface. Fibre Channel is primarily a protocol-independent transport medium; therefore, it is independent of the function for which you use it.

TruCluster Server uses the Fibre Channel Protocol (FCP) for SCSI to use Fibre Channel as the physical interface.

Fibre Channel, with its serial transmission method, overcomes the limitations of parallel SCSI by providing:

Data rates of 100 MB/sec, 200 MB/sec, and 400 MB/sec

Support for multiple protocols

Better scalability

Improved reliability, serviceability, and availability

Fibre Channel uses an extremely high-transmit clock frequency to achieve the high data rate. Using optical fiber transmission lines allows the high-frequency information to be sent up to 40 kilometers (24.85 miles), which is the maximum distance between transmitter and receiver. Copper transmission lines may be used for shorter distances.

6.1.1 Basic Fibre Channel Terminology

The following list describes the basic Fibre Channel terminology:

AL_PA: The Arbitrated Loop Physical Address (AL_PA) is used to address nodes on the Fibre Channel loop. When a node is ready to transmit data, it transmits Fibre Channel primitive signals that include its own identifying AL_PA.
Arbitrated Loop: A Fibre Channel topology in which frames are routed around a loop set up by the links between the nodes in the loop. All nodes in a loop share the bandwidth, and bandwidth degrades slightly as nodes and cables are added.
Frame: All data is transferred in a packet of information called a frame. A frame is limited to 2112 bytes. If the information consists of more than 2112 bytes, it is divided up into multiple frames.
Node: The source and destination of a frame. A node may be a computer system, a redundant array of independent disks (RAID) array controller, or a disk device. Each node has a 64-bit unique node name (worldwide name) that is built into the node when it is manufactured.
N_Port: Each node must have at least one Fibre Channel port from which to send or receive data. This node port is called an N_Port. Each port is assigned a 64-bit unique port name (worldwide name) when it is manufactured. An N_Port is connected directly to another N_Port in a point-to-point topology. An N_Port is connected to an F_Port in a fabric topology.
NL_Port: In an arbitrated loop topology, information is routed around a loop. A node port that can operate on the loop is called an NL_Port (node loop port). The information is repeated by each NL_Port until it reaches its destination. Each port has a 64-bit unique port name (worldwide name) that is built into the node when it is manufactured.
Fabric: A switch, or multiple interconnected switches, that route frames between the originator node (transmitter) and destination node (receiver).
F_Port: The ports within the fabric (fabric port). This port is called an F_port. Each F_port is assigned a 64-bit unique node name and a 64-bit unique port name when it is manufactured. Together, the node name and port name make up the worldwide name.
FL_Port: An F_Port containing the loop functionality is called an FL_Port.
Link: The physical connection between an N_Port and another N_Port or an N_Port and an F_Port. A link consists of two connections, one to transmit information and one to receive information. The transmit connection on one node is the receive connection on the node at the other end of the link. A link may be optical fiber, coaxial cable, or shielded twisted pair.
E_Port interswitch expansion port: An expansion port on a switch used to make a connection between two switches in the fabric.

6.1.2 Fibre Channel Topologies

Fibre Channel supports three different interconnect topologies:

Point-to-point (Section 6.1.2.1)

Fabric (Section 6.1.2.2)

Arbitrated loop (Section 6.1.2.3)

Note

Although you can interconnect an arbitrated loop with fabric, hybrid configurations are not supported at the present time, and therefore are not discussed in this manual.

6.1.2.1 Point-to-Point

The point-to-point topology is the simplest Fibre Channel topology. In a point-to-point topology, one N_Port is connected to another N_Port by a single link.

Because all frames transmitted by one N_Port are received by the other N_Port, and in the same order in which they were sent, frames require no routing.

Figure 6-1 shows an example point-to-point topology.

Figure 6-1: Point-to-Point Topology

6.1.2.2 Fabric

The fabric topology provides more connectivity than point-to-point topology. The fabric topology can connect up to 2²⁴ ports.

The fabric examines the destination address in the frame header and routes the frame to the destination node.

A fabric may consist of a single switch, or there may be several interconnected switches (up to three interconnected switches are supported). Each switch contains two or more fabric ports (F_Port) that are internally connected by the fabric switching function, which routes the frame from one F_Port to another F_Port within the switch. Communication between two switches is routed between two expansion ports (E_Ports).

When an N_Port is connected to an F_Port, the fabric is responsible for the assignment of the Fibre Channel address to the N_Port attached to the fabric. The fabric is also responsible for selecting the route a frame will take, within the fabric, to be delivered to the destination.

When the fabric consists of multiple switches, the fabric can determine an alternate route to ensure that a frame gets delivered to its destination.

Figure 6-2 shows an example fabric topology.

Figure 6-2: Fabric Topology

6.1.2.3 Arbitrated Loop Topology

In an arbitrated loop topology, frames are routed around a loop set up by the links between the nodes. The hub maintains loop continuity by bypassing a node when the node or its cabling fails, when the node is powered down, or when the node is removed for maintenance. The hub is transparent to the protocol. It does not consume any Fibre Channel arbitrated loop addresses so it is not addressable by a Fibre Channel arbitrated loop port.

The nodes arbitrate to gain control (become master) of the loop. After a node becomes master, the nodes select (by way of setting bits in a bitmask) their own Arbitrated Loop Physical Address (AL_PA). The AL_PA is used to address nodes on the loop. The AL_PA is dynamic and can change each time the loop is initialized, a node is added or removed, or at any other time that an event causes the membership of the loop to change. When a node is ready to transmit data, it transmits Fibre Channel primitive signals that include its own identifying AL_PA.

In the arbitrated loop topology, a node port is called an NL_Port (node loop port), and a fabric port is called an FL_Port (fabric loop port).

Figure 6-3 shows an example of an arbitrated loop topology.

Figure 6-3: Arbitrated Loop Topology

6.2 Fibre Channel Topology Comparison

This section compares and contrasts the fabric and arbitrated loop topologies and describes why you might choose to use them.

When compared with the fabric (switched) topology, arbitrated loop is a lower cost, and lower performance, alternative. Arbitrated loop reduces Fibre Channel cost by substituting a lower-cost, often nonintelligent and unmanaged hub, for a more expensive switch. The hub operates by collapsing the physical loop into a logical star. The cables, associated connectors, and allowable cable lengths are similar to those of a fabric. Arbitrated loop supports a theoretical limit of 127 nodes in a loop. Arbitrated loop nodes are self-configuring and do not require Fibre Channel address switches.

Arbitrated loop provides reduced cost at the expense of bandwidth; all nodes in a loop share the bandwidth (100 MB/sec per loop), and bandwidth degrades slightly as nodes and cables are added. Nodes on the loop see all traffic on the loop, including traffic between other nodes. The hub can include port-bypass functions that manage movement of nodes on and off the loop. For example, if the port bypass logic detects a problem, the hub can remove that node from the loop without intervention. Data availability is then preserved by preventing the down time associated with node failures, cable disconnections, and network reconfigurations. However, traffic caused by node insertion and removal, errors, and so forth, can cause temporary disruption on the loop.

Although the fabric topology is more expensive, it provides both increased connectivity and higher performance; switches provide a full-duplex 100 (200) MB/sec point-to-point connection to the fabric. Switches also provide improved performance and scaling because nodes on the fabric see only data destined for themselves, and individual nodes are isolated from reconfiguration and error recovery of other nodes within the fabric. Switches can provide management information about the overall structure of the Fibre Channel fabric, which may not be the case for an arbitrated loop hub.

Table 6-1 compares the fabric and arbitrated loop topologies.

Table 6-1: Fibre Channel Fabric and Arbitrated Loop Comparison

When to use Arbitrated Loop	When to use Fabric
In clusters of up to two members	In clusters of more than two members
In applications where low total solution cost and simplicity are key requirements	In multinode cluster configurations when possible temporary traffic disruption due to reconfiguration or repair is a concern
In applications where the shared bandwidth of an arbitrated loop configuration is not a limiting factor	In high bandwidth applications where a shared arbitrated loop topology is not adequate
In configurations where expansion and scaling are not anticipated	In cluster configurations where expansion is anticipated and requires performance scaling

6.3 Example Fibre Channel Configurations Supported by TruCluster Server

This section provides diagrams of some of the configurations supported by TruCluster Server Version 5.1A. Diagrams are provided for both transparent failover mode and multiple-bus failover mode.

6.3.1 Fibre Channel Cluster Configurations for Transparent Failover Mode

With transparent failover mode:

The hosts do not know a failover has taken place (failover is transparent to the hosts).

The units are divided between an HSG80 port 1 and port 2.

If there are dual-redundant HSG80 controllers, controller A port 1 and controller B port 2 are normally active; controller A port 2 and controller B port 1 are normally passive.

If one controller fails, the other controller takes control and both its ports are active.

Figure 6-4 shows a typical Fibre Channel cluster configuration using transparent failover mode.

Figure 6-4: Fibre Channel Single Switch Transparent Failover Configuration

In transparent failover, units D00 through D99 are accessed through port 1 of both controllers. Units D100 through D199 are accessed through port 2 of both HSG80 controllers.

You cannot achieve a no-single-point-of-failure (NSPOF) configuration using transparent failover. The host cannot initiate failover, and if you lose a host bus adapter, switch or hub, or a cable, you lose the units behind at least one port.

You can, however, add the hardware for a second bus (another KGPSA, switch, and RA8000/ESA12000 with associated cabling) and use LSM to mirror across the buses. However, because you cannot use LSM to mirror the member boot partitions or the quorum disk you cannot obtain an NSPOF transparent failover configuration, even though you have increased availability.

Figure 6-5 shows a two-node Fibre Channel cluster with a single RA8000 or ESA12000 storage array with dual-redundant HSG80 controllers and an DS-SWXHB-07 Fibre Channel hub.

Figure 6-5: Arbitrated Loop Configuration with One Storage Array

6.3.2 Fibre Channel Cluster Configurations for Multiple-Bus Failover Mode

With multiple-bus failover:

The host controls the failover by accessing units over a different path or causing the access to the unit to be through the other HSG80 controller.

An active controller causes a failover to the other controller if the controller recognizes the loss of the switch, hub, or cable to a controller port.

Each cluster member system has two or more (fabric only) KGPSA host bus adapters (multiple paths to the storage units).

Normally, all available units (D0 through D199) are available at all host ports. Only one HSG80 controller will be actively doing I/O for any particular storage unit.
However, both controllers can be forced active by preferring units to one controller or the other (SET unit PREFERRED_PATH=THIS). By balancing the preferred units, you can obtain the best I/O performance using two controllers.

Note

If you have preferred units, and the HSG80 controllers restart because of an error condition or power failure, and one controller restarts before the other controller, the HSG80 controller restarting first will take all the units, whether they are preferred or not. When the other HSG80 controller starts, it will not have access to the preferred units, and will be inactive.
Therefore, you want to ensure that both HSG80 controllers start at the same time under all circumstances so that the controller sees its own preferred units.

Figure 6-6 and Figure 6-7 show two different recommended multiple-bus NSPOF cluster configurations. The only difference is the fiber-optic cable connection path between the switch and the HSG80 controller ports.

There is no difference in performance between these two configurations. It may be easier to cable the configuration shown in Figure 6-6 because the cables from one switch (or switch zone) both go to the ports on the same side of both controllers (for example, port 1 of both controllers).

Figure 6-6: Multiple-Bus NSPOF Configuration Number 1

Figure 6-7: Multiple-Bus NSPOF Configuration Number 2

The configuration that is shown in Figure 6-8 is a NSPOF configuration, but is not a recommended cluster configuration because of the performance loss during failure conditions. If a switch or cable failure causes a failover to the other switch, access to the storage units has to be moved to the other controller, and that takes time. In the configurations shown in Figure 6-6 and Figure 6-7, the failure would cause access to the storage unit to shift to the other port of the same controller. This is faster than a change of controllers, providing better overall performance.

Note

If you have a configuration like the one that is shown in Figure 6-8, change the switch to HSG80 cabling to match the configurations that are shown in Figure 6-6 or Figure 6-7.

The single-system configuration that is shown in Figure 6-9 is also a configuration that we do not recommend.

Figure 6-8: A Configuration That Is Not Recommended

Figure 6-9: Another Configuration That Is Not Recommended

Figure 6-10 shows the maximum supported arbitrated loop configuration of a two-node Fibre Channel cluster with two RA8000 or ESA12000 storage arrays, each with dual-redundant HSG80 controllers and two DS-SWXHB-07 Fibre Channel hubs. This provides a NSPOF configuration.

Figure 6-10: Arbitrated Loop Maximum Configuration

6.4 QuickLoop

QuickLoop supports Fibre Channel arbitrated loop (FC-AL) devices within a fabric. This logical private loop fabric attach (PLFA) consists of multiple private arbitrated loops (looplets) that are interconnected by a fabric. A private loop is formed by logically connecting ports on up to two switches.

Note

QuickLoop is not supported in a Tru64 UNIX Version 5.1A configuration or TruCluster Server Version 5.1A configuration.

6.5 Zoning

This section provides a brief overview of zoning.

A zone is a logical subset of the Fibre Channel devices that are connected to the fabric. Zoning allows partitioning of resources for management and access control. In some configurations, it may provide for more efficient use of hardware resources by allowing one switch to serve multiple clusters or even multiple operating systems. Zoning entails splitting the fabric into zones, where each zone is essentially a virtual fabric.

Zoning may be used:

When you want to set up barriers between systems of different operating environments or uses, for instance to allow two clusters to utilize the same switch.

To create test areas that are separate from the rest of the fabric.

To provide better utilization of a switch by reducing the number of unused ports.

Note

Any initial zoning must be made before connecting the host bus adapters and the storage to the switches, but after zoning is configured, changes can be made dynamically.

6.5.1 Switch Zoning Versus Selective Storage Presentation

Switch zoning and the selective storage presentation (SSP) feature of the HSG80 controllers have similar functions.

Switch zoning controls which servers can communicate with each other and each storage controller host port. SSP controls which servers will have access to each storage unit.

Switch zoning controls access at the storage system level, whereas SSP controls access at the storage unit level.

The following configurations require zoning or selective storage presentation:

When you have a TruCluster Server cluster in a storage array network (SAN) with other stand-alone systems (UNIX or non-UNIX), or other clusters.

Any time you have Windows NT or Windows 2000 in the same SAN with Tru64 UNIX. (Windows NT or Windows 2000 must be in a separate switch zone.)

The SAN configuration has more than 64 connections to an RA8000, ESA12000, MA6000, MA8000, or EMA12000.

The use of selective storage presentation is the preferred way to control access to storage (so zoning is not required).

6.5.2 Types of Zoning

There are two types of zoning, soft and hard:

Soft zoning is a software implementation that is based on the Simple Name Server (SNS) enforcing a zone. Zones are defined by either the node or port World Wide Names (WWN), or the domain and port numbers in the form of D,P, where D is the domain and P is the physical port number on the switch.
A host system requests a list of all adapters and storage controllers that are connected to the fabric. The name service provides a list of all ports that are in the same zone or zones as the requesting host bus adapter.
Soft zoning only works if all hosts honor it; it does not work if a host is not programmed to allow for soft zoning. For instance, if a host tries to access a controller that is outside the zone, the switch does not prevent the access.
Tru64 UNIX honors soft zoning and does not attempt to access devices outside the zone.
If you have used the WWN to define the zone and replace a KGPSA host bus adapter, you must modify the zone configuration and SSP because the node World Wide Name has changed.

With hard zoning, zones are enforced at the physical level across all fabric switches by hardware blocking of Fibre Channel frames. Hardware zone definitions are in the form of D,P, where D is the domain and P is the physical port number on the switch. An example might be 1,2 for switch 1, port 2.
If a host attempts to access a port that is outside its zone, the switch hardware blocks the access.
You must modify the zone configuration when you move any cables from one port to another within the zone.

If you want to guarantee that there is no access outside any zone, either use hard zoning, or use operating systems that state that they support soft zoning.

Table 6-2 lists the types of zoning that are supported on each of the supported Fibre Channel switches.

Table 6-2: Type of Zoning Supported by Switches

Switch Type	Type of Zoning Supported
DS-DSGGA	Soft
DS-DSGGB	Soft and Hard
DS-DSGGC	Soft and Hard

6.5.3 Zoning Example

Figure 6-11 provides an example configuration using zoning. This configuration consists of two independent zones with each zone containing an independent cluster.

Figure 6-11: A Simple Zoned Configuration

For information on setting up zoning, see the SAN Switch Zoning documentation that is provided with the switch.

6.6 Cascaded Switches

Multiple switches may be connected to each other to form a network of switches, or cascaded switches.

A cascaded switch configuration, which allows for network failures up to and including the switch without losing a data path to a SAN connected node, is called a mesh or meshed fabric. Figure 6-12 shows an example meshed fabric with three cascaded switches. This is not a no-single-point-of-failure (NSPOF) configuration.

Figure 6-12: Meshed Fabric with Three Cascaded Switches

Figure 6-13 shows an example meshed resilient fabric with four cascaded interconnected switches. This configuration will tolerate multiple data path failures, and is an NSPOF configuration.

Figure 6-13: Meshed Resilient Fabric with Four Cascaded Switches

Note

If you lose an ISL, the communication can be routed through another switch to the same port on the other controller. This can constitute the maximum allowable two hops.

You can find the following information about storage array networks (SAN) in the Compaq StorageWorks Heterogeneous Open SAN Design Reference Guide located at:


http://www5.compaq.com/products/storageworks/techdoc/san/AA-RMPNA-TE.html

Supported SAN topologies

SAN fabric design rules

SAN platform and operating system restrictions (including the number of switches supported)

6.7 Procedure for Installation Using Fibre Channel Disks

Use the following procedure to install Tru64 UNIX Version 5.1A and TruCluster Server Version 5.1A using Fibre Channel disks. If you are only installing Tru64 UNIX Version 5.1A, complete the first eight steps. Complete all the steps for a TruCluster Server Version 5.1A installation. See the Tru64 UNIX Installation Guide, TruCluster Server Cluster Installation manual, and other hardware manuals as appropriate for the actual installation procedures.

Install the Fibre Channel switch or hub (Section 6.8.1 or Section 6.8.2).

Install the KGPSA PCI-to-Fibre Channel host bus adapter (Section 6.8.3).

Set up the HSG80 RAID array controllers for a fabric or loop configuration (Section 6.8.4).

Configure the HSG80 disks to be used for installation of the base operating system and cluster. Be sure to set the identifier for each storage unit you will use for operating system or cluster installation (Section 6.9.1.1 and Section 6.9.1.2).

If the system is not already powered on, power on the system where you will install Tru64 UNIX Version 5.1A. If this is a cluster installation, this system will also be the first cluster member.
Use the console WWID manager (wwidmgr) utility to set the device unit number for the Fibre Channel Tru64 UNIX Version 5.1A disk and first cluster member system boot disks (Section 6.9.1.3).

Use the show wwid* and show n* console commands to show the disk devices that are currently reachable, and the paths to the devices (Section 6.9.1.4).

Use the WWID manager to set the bootdef_dev console environment variable for the system where you will install the Tru64 UNIX operating system (Section 6.9.1.5).

See the Tru64 UNIX Installation Guide and install the base operating system from the CD-ROM. The installation procedure will recognize the disks for which you set the device unit number. Select the disk that you have chosen as the Tru64 UNIX operating system installation disk from the list of disks that is provided (Section 6.9.2).
After the new kernel has booted to multi-user mode, complete the operating system installation.
If you will not be installing TruCluster Server software, reset the bootdef_dev console environment variable to provide multiple boot paths to the boot disk (Section 6.9.6), then boot the operating system.

Determine the /dev/disk/dskn values to be used for cluster installation (Section 6.9.3).

Use the disklabel utility to label the disks that were used to create the cluster (Section 6.9.4).

See the TruCluster Server Cluster Installation manual and install the TruCluster Server software subsets, then run the clu_create command to create the first cluster member. Do not allow clu_create to boot the system. Shut down the system to the console prompt (Section 6.9.5).

Reset the bootdef_dev console environment variable to provide multiple boot paths to the cluster member boot disk (Section 6.9.6). Boot the first cluster member.

See the Cluster Installation manual and add subsequent cluster member systems (Section 6.9.7). As with the first cluster member, you will have to:
- Use the wwidmgr command to set the device unit number for the member system boot disk.
- Set the bootdef_dev environment variable.
- Reset the bootdef_dev environment variable after building a kernel on the new cluster member system.

6.8 Installing and Configuring Fibre Channel Hardware

This section provides information about installing the Fibre Channel hardware that is needed to support Tru64 UNIX or a TruCluster Server configuration using Fibre Channel storage.

Ensure that the member systems, the Fibre Channel switches or hubs, and the HSG80 array controllers are placed within the lengths of the optical cables that you will be using.

Note

The maximum length of the optical cable between the KGPSA and the switch (or hub), or the switch (or hub) and the HSG80 array controller, is 500 meters (1640.4 feet) via shortwave multimode Fibre Channel cable. The maximum distance between switches in a cascaded switch configuration is 10 kilometers (6.2 miles) using longwave single-mode fiber.

6.8.1 Installing and Setting Up the Fibre Channel Switch

The Fibre Channel switches support up to 8 (DS-DSGGA-AA, DS-DSGGB-AA, and DS-DSGGC-AA) or 16 (DS-DSGGA-AB, DS-DSGGB-AB, or DS-DSGGC-AB) full-duplex 1.6025 Gb/sec ports. Each switch port can be connected to a KGPSA-BC or KGPSA-CA PCI-to-Fibre Channel host bus adapter, an HSG60 or HSG80 array controller, or another switch.

Each switch, except the DS-DSGGB-AA and DS-DSGGC-AA/AB, has a front panel display and four push buttons that you use to manage the switch. Four menus allow you to configure, operate, obtain status, or test the switch. The DS-DSGGB-AA and DS-DSGGC-AA/AB are managed through a telnet session after the IP address has been set (from a PC or terminal).

All switches have a 10Base-T Ethernet (RJ45) port, and after the IP address is set, the Ethernet connection allows you to manage the switch:

Remotely using a telnet TCP/IP connection

With the Simple Network Management Protocol (SNMP)

Using Web management tools

Note

You have to set the IP address and subnet mask from the front panel (or from a PC or terminal with the DS-DSGGB-AA or DS-DSGGC-AA/AB) before you can manage the switch via a telnet session, SNMP, or the Web.
The DS-DSGGC-AA/AB Fibre Channel switches have a default IP address of 10.77.77.77. You may need to change this IP address before you connect the switch to the network.

The DSGGA switch has slots to accommodate up to four (DS-DSGGA-AA) or eight (DS-DSGGA-AB) plug-in interface modules. Each interface module in turn supports two Gigabit Interface Converter (GBIC) modules. The GBIC module is the electrical-to-optical converter.

The shortwave GBIC supports 50-micron multimode fiber (MMF) using the standard subscriber connector (SC) connector. The longwave GBIC supports 9-micron, single-mode fiber-optic cables. Only the 50-micron MMF optical cable is supported between the host bus adapters and switches or switches and HSG80 controllers for the TruCluster Server product. Longwave single-mode fiber-optic cables are supported between switches in a cascaded switch configuration.

Note

If you need to install additional interface modules in the DSGGA switch, do so before you place the switch in a relatively inaccessible location because you have to remove the top cover to install the interface modules.

The DSGGB switch accommodates up to 8 (DS-DSGGB-AA) or 16 (DS-DSBBG-AB) GBIC modules.

The DS-DSGGC-AA switch has seven fixed shortwave optical transceivers and one removable GBIC.

The DSGGC-AB switch accommodates up to 16 GBIC modules.

6.8.1.1 Installing the Switch

Place the switch within 500 meters (1640.4 feet) of the member systems (with KGPSA PCI-to-Fibre Channel adapter) and the HSG80 array controllers.

You can mount the switches in a 48.7-cm (19-in) rackmount installation or place the switch on a flat solid surface.

When you plan the switch location, ensure that you provide access to the front of the switch. All cables plug into the front of the switch. Also, for those switches with a control panel, the display and switches are on the front of the switch.

For an installation, at a minimum, you have to complete the following steps. Some of the steps are explained in more detail in the following sections.

Place the switch or install it in the rack.

If you are using a DS-DSGGB-AA or DS-DSGGC, connect the switch to a terminal or PC (Section 6.8.1.2.3).

Connect the Ethernet cable between the Fibre Channel switch and the Ethernet switch or hub.

Connect the fiber-optic cables between the switch and host bus adapters and RAID array controllers.

Connect power to the switch.

Turn on the switch power. The switch runs a series of power-on self test (POST) tests. The DS-DSGGC Fibre Channel switches have no power switch; power is applied when the unit is plugged in.

Set the switch IP address and subnet mask (Section 6.8.1.2.2 or Section 6.8.1.2.3). You can also set the switch name if desired (Section 6.8.1.2.5). The switch IP address and subnet mask must be set from the front panel, except for the DS-DSGGB-AA or DS-DSGGC-AA/AB Fibre Channel switches. In this case you have to set the IP address from the PC or terminal. You may have to use a telnet session to set the switch name (Section 6.8.1.2.4).

Reboot the switch to enable the change in IP address and subnet mask to take effect.

For more information on the individual switches, see the following documentation:

Compaq StorageWorks Fibre Channel Storage Switch User's Guide

Compaq StorageWorks SAN Switch 8 Installation and Hardware Guide

Compaq StorageWorks SAN Switch 16 Installation and Hardware Guide

Compaq StorageWorks Fibre Channel SAN Switch 8-EL Installation and Hardware Guide

Compaq StorageWorks Fibre Channel SAN Switch 16-EL Installation and Hardware Guide

For more information on managing a Fibre Channel switch, see the Compaq StorageWorks Fibre Channel SAN Switch Management Guide.

For more information on the SAN switch fabric operating system, see the Compaq StorageWorks SAN Switch Fabric Operating System Management Guide.

6.8.1.2 Managing the Fibre Channel Switches

You can manage the DS-DSGGA-AA, DS-DSGGA-AB, and DS-DSGGB-AB switches, and obtain switch status from the front panel, by making a telnet connection or by accessing the Web. The DS-DSGGB-AA and DS-DSGGC-AA/AB Fibre Channel switches do not have a front panel, so you must use a telnet connection or use Web access.

Before you can make a telnet connection or access the switch via the Web, you must assign an IP address and subnet mask to the Ethernet connection using the front panel or from a PC or terminal (DS-DSGGB-AA and DS-DSGGC-AA/AB). You may have to modify the DS-DSGGC-AA/AB IP address from the default of 10.77.77.77.

6.8.1.2.1 Using the Switch Front Panel

The switch front panel consists of a display and four buttons. The display is normally not active, but it lights up when any of the buttons are pressed. The display has a timer. After approximately 30 seconds of inactivity, the display will go out.

The four front panel buttons are:

Up -- Upward triangle: Scrolls the menu up (which effectively moves down the list of commands) or increases the value that is being displayed.

Down -- Downward triangle: Scrolls the menu down (which effectively moves up the list of commands) or decreases the value that is being displayed.

Note

When the up or down buttons are used to increase or decrease a numerical display, the number changes slowly at first, but changes to fast mode if the button is held down. The maximum number displayed is 255. An additional increment at a count of 255 resets the count to 0.

Tab/Esc -- Leftward triangle: Allows you to tab through multiple optional functions, for example, the fields in an IP address. You can use this button to abort an entry, which takes you to the previous menu item. If pressed repeatedly, the front panel display will turn off.

Enter -- Rightward triangle: Causes the switch to accept the input you have made and move to the next function.

6.8.1.2.2 Setting the Ethernet IP Address and Subnet Mask from the Front Panel

Before you telnet to the switch, you must connect the Ethernet cable and then set the Ethernet IP address and subnet mask.

To use the front panel to set the Ethernet address and subnet mask, follow these steps:

Press any of the switch front panel buttons to activate the display for the top-level menu. If the Configuration Menu is not displayed, press the down button repeatedly until it is displayed:
```
Select Menu:
Configuration Menu
 
```
Note

Pressing the down button selects the next lower top-level menu. The top-level menus are:
```
Configuration Menu
Operation Menu
Status Menu
Test Menu
 
```

Press Enter to display the first submenu item in the configuration menu, Ethernet IP address:
```
Ethernet IP address:
10.00.00.10
--
 
```
The underline cursor denotes the selected address field.
Use the up or down button to increase or decrease the displayed number. Use the Tab/Esc button to select the next field. Modify the address fields until you have the address set correctly.

Use Enter to accept the value and step to the next submenu item (Ethernet Submask), and then repeat step 2 to set the Ethernet subnet mask.

Press Enter to accept the Ethernet subnet mask.

Press the Tab/Esc button repeatedly to get back to the top-level menu.

Press the down button to select the Operation Menu:
```
Select Menu:
Operation Menu
 
```

If the switch is operational, place the switch off line before rebooting or you will lose any transmission in progress.
Press Enter to display the first submenu in the Operation Menu, Switch Offline:
```
Operation Menu:
Switch Offline
 
```

Press the down button until the Reboot submenu item is displayed:
```
Operation Menu:
Reboot
 
```

Press Enter. You can change your mind and not reboot:
```
Reboot
Accept?    Yes  No
 
```

Use the Tab/Esc button to select Yes. Press Enter to reboot the switch and execute the POST tests.

Note

After changing any configuration menu settings, you must reboot the switch for the change to take effect.

Refer to the switch documentation for information on other switch configuration settings.

6.8.1.2.3 Setting the DS-DSGGB-AA and DS-DSGGC-AA/AB Ethernet IP Address and Subnet Mask from a PC or Terminal

For the DS-DSGGB-AA and DS-DSGGC-AA/AB switches, which do not have a front panel, you must use a connection to a Windows 95/98/NT/2000 PC or video terminal to set the Ethernet IP address and subnet mask.

To set the Ethernet IP address and subnet mask for the DS-DSGGB-AA or DS-DSGGC-AA/AB switches, follow these steps:

Connect the switch serial port to a terminal or PC COM port with a standard serial cable with a DB9 connector. Note that the serial port is only used for initial power-on self-test (POST) verification, IP address configuration, or for resetting the factory/default settings. Ensure that you make the connection to the switch serial port and not the Ethernet port.

If you are using a PC, start a remote communication program, for example, HyperTerminal.

Set the port settings to 9600 bits per second, 8 bits per character, no parity, 1 stop bit, and no flow control.

Turn on power to the switch and log in. If the connection is correct, the self-test results will be displayed. It takes 2 to 3 minutes for self-tests to complete.
- DS-DSGGB-AA: The switch automatically connects to the host and logs the user on to the switch as admin when the self-tests terminate. For subsequent logons, the default password is password.
- DS-DSGGC-AA/AB: Plugging in the DS-DSGGC-AA/AB switch turns the power on. (There is no on/off power switch.) Log in as the admin user. The password is password.

Enter the ipAddrSet command, then enter the IP address, subnet mask, and gateway address (if necessary). For example:

admin> ipAddrSet
Ethernet IP Address [10.77.77.77]: 16.142.72.54
Ethernet Subnetmask [255.255.255.0]: 255.255.255.0
Fibre Channel IP Address [none]: none
Fibre Channel Subnetmask [none]: none
Gateway Address [172.17.1.1]: none
[ y = set now, n = next reboot ]: y
Fabric OS (tm) Release v2.1.7
login:

When you have completed setting the IP address and subnet mask, disconnect the PC or terminal from the DS-DSGGB-AA or DS-DSGGC-AA/AB switch.

6.8.1.2.4 Logging In to the Switch with a Telnet Connection

Before you telnet to a Fibre Channel switch, you must set the Ethernet IP address and subnet mask.

Note

A serial port connection and a telnet session cannot both be active (at the same time) with the DS-DSGGB-AA and DS-DSGGC-AA/AB switch. The telnet session takes precedence and the serial port session is aborted when the telnet session is started.

You can use a telnet session to log in to the switch at one of three security levels. The default user names, which are listed from lowest security level to highest security level, are listed in Table 6-3.

Table 6-3: Telnet Session Default User Names for Fibre Channel Switches

DSGGA	DSGGB or DSGGC	Description
`other`	n/a	Allows you to execute commands ending in `Show`, such as `dateShow` and `portShow`.
`user`	`user`	Allows you to execute all commands ending in `Show`, plus any commands from the help menu that do not change the state of the switch, for example, `version` and `errDump`. You can change the passwords for all users up to and including the current user's security level.
`admin`	`admin`	Provides access to all the commands that show up in the help menu. Most switch administration is done when logged in as `admin`.
`n/a`	`root`	Gives users access to an extensive command set that can significantly alter system performance. Use root commands only at the request of Compaq customer service.

You can set the user names and passwords for users at or below the security level of the present login level by executing the passwd command. Enter a new user name (if desired) and a new password for the user.

Notes

Use Ctrl/H to correct typing errors.
Use the logout command to log out from any telnet connection.

6.8.1.2.5 Setting the Switch Name via Telnet Session

After you set the IP address and subnet mask, you can use a telnet session to log in to the switch to complete other switch management functions or monitor switch status. For example, if a system's /etc/hosts file contains an alias for the switch's IP address, set the switch name to the alias. This allows you to telnet to the switch name from that system. Telnet from a system that has the IP address in its /etc/hosts file and set the switch address as follows:

# telnet 132.25.47.146 
User admin
Passwd
:Admin> switchName fcsw1
:Admin> switchName
fcsw1
:Admin>

Note

When you telnet to the switch the next time, the prompt will include the switch name, for example:
fcsw1:Admin>
 

6.8.2 Installing and Setting Up the DS-SWXHB-07 Hub

The DS-SWXHB-07 hub supports up to seven 1.6025 Gb/sec ports. The ports can be connected to the KGPSA-CA PCI-to-Fibre Channel host bus adapter or to an HSG80 array controller.

Unlike the DSGGA switch, the DS-SWXHB-07 hub does not have any controls or even a power-on switch. Simply plug in the hub to power it on. The hub has a green power indicator on the front panel.

The DS-SWXHB-07 hub has slots to accommodate up to seven plug-in interface converters. Each interface converter in turn supports two 1-gigabit Gigabit Interface Converter (GBIC) modules. The GBIC module is the electrical-to-optical converter, and supports both 50-micron and 62.5-micron multi-mode fiber (MMF) using the standard SC connector. Only the 50-micron MMF optical cable is supported for the TruCluster Server products.

The GBIC modules and MMF optical cables are not provided with the hub. To obtain them, contact your authorized Compaq Service Representative.

6.8.2.1 Installing the Hub

Ensure that you place the hub within 500 meters (1640.4 feet) of the member systems (with KGPSA-CA PCI-to-Fibre Channel adapter) and the HSG80 array controllers.

The DS-SWXHB-07 hub can be placed on a flat, solid surface or, when configured in the DS-SWXHX-07 rack mount kit, part number 242795-B21, the hub can be mounted in a 48.7-cm (19-in) rackmount installation. (One rack kit holds two hubs.) The hub is shipped with rubber feet to prevent marring the surface.

When you plan the hub location, ensure that you provide access to the GBIC connectors on the back of the hub. All cables plug into the back of the hub.

Caution

Static electricity can damage modules and electronic components. We recommend using a grounded antistatic wrist strap and a grounded work surface when handling modules.

For an installation, at a minimum, you have to:

Place the hub on an acceptable surface or install it in the rackmount.

Install one or more GBIC modules. Gently push the GBIC module into an available port on the hub until you feel the GBIC module click into place. The GBIC module has a built-in guide key that prevents you from inserting it incorrectly. Do not use excessive force.

Connect the optical fiber cables. To do this, plug one end of an MMF cable into one of the GBIC modules installed in the hub. Attach an MMF cable for all active port connections. Unused ports or improperly seated GBIC modules remain in loop bypass and do not affect the operation of the loop.

Attach the other end of the MMF cable to either the KGPSA-CA adapter or to the HSG80.

Connect power to the hub using a properly grounded outlet. Look at the power indicator on the front of the hub to make sure that it powered on.

For more installation information, see the Fibre Channel Storage Hub 7 Installation Guide.

6.8.2.2 Determining the Hub Status

Because the DS-SWXHB-07 hub is not a manageable unit, examine the status of the LED indicators to make sure that the hub is operating correctly. The LED indicators will be particularly useful after you have connected the hub to the KGPSA-CA host adapters and the HSG80 controller. However, at this time you can use the LEDs to verify that the GBIC connectors are installed correctly.

At power on, with no optical cables attached, the green and amber LEDs should both be on, indicating that the port is active but that the connection is invalid. The other possible LED states are as follows:

Both off: Not active. Make sure that the GBIC is installed correctly.

Solid green: Indicates presence and proper functionality of a GBIC.

Green off: Indicates a fault condition (GBIC transmitter fault, improperly seated GBIC, no GBIC installed, or other failed device). The port is in bypass mode. This is the normal status for ports without GBICs installed.

Solid amber: Indicates that a loss of signal or poor signal integrity has put the port in bypass mode. Make sure that a GBIC is installed, that a cable is attached to the GBIC, and that the other end of the cable is attached to a KGPSA-CA or HSG80.

Amber off (and green on): Indicates that the port and device are fully operational.

For more information on determining the hub status, see the Fibre Channel Storage Hub 7 Installation Guide.

6.8.3 Installing and Configuring the KGPSA PCI-to-Fibre Channel Adapter Module

The following sections discuss KGPSA installation and configuration.

6.8.3.1 Installing the KGPSA PCI-to-Fibre Channel Adapter Module

To install the KGPSA-BC or KGPSA-CA PCI-to-Fibre Channel adapter modules, follow these steps. For more information, see the following documentation:

KGPSA-BC PCI-to-Optical Fibre Channel Host Adapter User Guide

64-Bit PCI-to-Fibre Channel Host Bus Adapter User Guide

Caution

Static electricity can damage modules and electronic components. We recommend using a grounded antistatic wrist strap and a grounded work surface when handling modules.

If necessary, install the mounting bracket on the KGPSA-BC module. Place the mounting bracket tabs on the component side of the board. Insert the screws from the solder side of the board.

The KGPSA-BC should arrive with the Gigabit Link Module (GLM) installed. If not, close the GLM ejector mechanism. Then, align the GLM alignment pins, alignment tabs, and connector pins with the holes, oval openings, and board socket. Press the GLM into place.
The KGPSA-CA does not use a GLM, it uses an embedded optical shortwave multimode Fibre Channel interface.

Install the KGPSA in an open 32-bit or 64-bit PCI slot.

Insert the optical cable SC connectors into the KGPSA-BC GLM or KGPSA-CA SC connectors. The SC connectors are keyed to prevent their being plugged in incorrectly. Do not use unnecessary force. Remember to remove the transparent plastic covering on the extremities of the optical cable.

Connect the fiber-optic cables to the shortwave Gigabit Interface Converter (GBIC) modules in the DSGGA, DSGGB, or DSGGC Fibre Channel switch.

6.8.3.2 Setting the KGPSA-BC or KGPSA-CA to Run on a Fabric

The KGPSA host bus adapter defaults to the fabric mode, and can be used in a fabric without taking any action. However, if you install a KGPSA that has been used in the loop mode on another system, you will need to reformat the KGPSA nonvolatile RAM (NVRAM) and configure it to run on a Fibre Channel fabric configuration.

Use the wwidmgr utility to determine the mode of operation of the KGPSA host bus adapter, and to set the mode if it needs changing (for example, from loop to fabric).

Notes

You must set the console to diagnostic mode to use the wwidmgr utility for the following AlphaServer systems: AS1200, AS4x00, AS8x00, GS60, GS60E, and GS140. Set the console to diagnostic mode as follows:
P00>>> set mode diag
Console is in diagnostic mode
P00>>>
 
The console remains in wwid manager mode (or diagnostic mode for the AS1200, AS4x00, AS8x00, GS60, GS60E, and GS140 systems), and you cannot boot until the system is re-initialized. Use the init command or a system reset to re-initialize the system after you have completed using the wwid manager.
If you try to boot the system and receive the following error, initialize the console to get out of WWID manager mode, then reboot:
P00>>> boot
warning -- main memory zone is not free
P00>>> init
       
.
.
.
P00>>> boot
 
If you have initialized and booted the system, then shut down the system and try to use the wwidmgr utility, you may be prevented from doing so. If you receive the following error, initialize the system and retry the wwidmgr command:
P00>>> wwidmgr -show adapter
wwidmgr available only prior to booting.
Reinit system and try again.
P00>>> init
       
.
.
.
P00>>> wwidmgr -show adapter
       
.
.
.
For more information on the wwidmgr utility, see the Wwidmgr User's Manual, which is on the Alpha Systems Firmware Update CD-ROM in the DOC directory.

Use the worldwide ID manager to show all KGPSA adapters:

P00>>> wwidmgr -show adapter
Link is down.
item       adapter           WWN                Cur. Topo  Next Topo
pga0.0.0.3.1 - Nvram read failed
[ 0] pga0.0.0.3.1         1000-0000-c920-eda0    FABRIC     UNAVAIL
pgb0.0.0.4.0 - Nvram read failed
[ 1] pgb0.0.0.4.0         1000-0000-c920-da01    FABRIC     UNAVAIL
pgc0.0.0.5.1 - Nvram read failed.
[ 2] pgc0.0.0.5.1         1000-0000-c920-cd9c    FABRIC     UNAVAIL    
[9999] All of the above.

The Link is down message indicates that one of the adapters is not available, probably due to its not being plugged into a switch. The warning message Nvram read failed indicates that the KGPSA NVRAM has not been initialized and formatted. The next topology will always be UNAVAIL for the host bus adapter that has an unformatted NVRAM. Both messages are benign and can be ignored for the fabric mode of operation. To correct the Nvram read failed situation, use the wwidmgr -set adapter command.

The previous display shows that all three KGPSA host bus adapters are set for fabric topology as the current topology, the default. When operating in a fabric, if the current topology is FABRIC, it does not matter if the next topology is Unavail, or that the NVRAM is not formatted (Nvram read failed).

If, however, the current topology is LOOP, you have to change the topology to FABRIC to operate in a fabric. You will never see the Nvram read failed message if the current topology is LOOP. The NVRAM has to have been formatted to change the current mode to LOOP.

Consider the case where the KGPSA current topology is LOOP as follows:

P00>>> wwidmgr -show adapter
item       adapter           WWN                Cur. Topo  Next Topo
[ 0] pga0.0.0.3.1         1000-0000-c920-eda0    LOOP       LOOP   
[ 1] pgb0.0.0.4.0         1000-0000-c920-da01    LOOP       LOOP   
[9999] All of the above.

If the current topology for an adapter is LOOP, set an individual adapter to FABRIC by using the item number for that adapter (for example, 0 or 1). Use 9999 to set all adapters:

P00>>> wwidmgr -set adapter -item 9999 -topo fabric
Reformatting nvram
Reformatting nvram

Note

The qualifier in the previous command is -topo and not -topology. You will get an error if you use -topology.

Displaying the adapter information again will show the topology that the adapters will assume after the next console initialization:

P00>>> wwidmgr -show adapter
item       adapter           WWN                Cur. Topo  Next Topo
[ 0]    pga0.0.0.4.1    1000-0000-c920-eda0       LOOP       FABRIC
[ 1]    pgb0.0.0.3.0    1000-0000-c920-da01       LOOP       FABRIC
[9999] All of the above.
P00>>> init

This display shows that the current topology for both KGPSA host bus adapters is LOOP, but will be FABRIC after the next initialization.

The system initialization configures the KGPSAs to run on a fabric.

6.8.3.3 Setting the KGPSA-CA Adapter to Run in a Loop

If you do not want to use the KGPSA-CA adapter in loop mode, you can skip this section.

Before you can use the KGPSA adapter in loop mode, you must set the link type of the adapter to LOOP. You use the wwidmgr to accomplish this task.

Version 5.8 of the SRM console is required for boot support.

The version of the wwidmgr utility included with the SRM console can set the KGPSA to run in arbitrated loop mode or in fabric mode. Specifically, the wwidmgr -set adapter command stores the selected topology into the nonvolatile random-access memory (NVRAM) storage on the KGPSA adapter. The adapter retains this setting even if the adapter is later moved to another system.

Link Type

If a KGPSA in loop mode is connected to a Fibre Channel switch, the results are unpredictable. The same is true for a KGPSA in fabric mode that is connected to a loop. Therefore, determine the topology setting before using the adapter.

The wwidmgr utility is documented in the Wwidmgr User's Manual, which is located in the DOC subdirectory of the Alpha Systems Firmware CD-ROM.

The steps required to set the link type are summarized here; see the Wwidmgr User's Manual for complete information and additional examples.

Assuming that you have the required console firmware, use the wwidmgr utility to set the link type, as follows:

Display the adapter on the system to determine its configuration:
```
POO>>> wwidmgr -show adapter
 
item       adapter           WWN                Cur. Topo   Next Topo
kgpsaa0.0.0.4.6 - Nvram read failed.
[ 0]    kgpsaa0.0.0.4.6    1000-0000-c920-05ab    FABRIC     UNAVAIL
[9999] All of the above.
 
 
```
The warning message Nvram read failed indicates that the NVRAM on the KGPSA adapter has not been initialized and formatted. This is expected and is corrected when you set the adapter link type.

Set the link type on the adapter using the following values:
- loop : sets the link type to loop (FC-AL)
- fabric : sets the link type to fabric (point to point)
You use the item number to indicate which adapter you wanted to change. For example, to configure adapter 0 (zero) for loop, use the following command:
```
POO>>> wwidmgr -set adapter -item 0 -topo loop
 
```
The item number 9999 refers to all adapters. If you have KGPSA adapters configured for both arbitrated loop and fabric topologies, selecting 9999 will set them all to loop mode.

Verify the adapter settings:

POO>>> wwidmgr -show adapter
  item       adapter           WWN                Cur. Topo  Next Topo 
 [ 0]    kgpsaa0.0.0.4.6   1000-0000-c920-05ab     FABRIC     LOOP

After making the change, reinitialize the console:
```
POO>>> init
```

Boot the system. The emx driver (Version 1.12 or higher is required) displays a message at boot when it recognizes the console setting, and configures the link accordingly.

Repeat this process for the other cluster member if this is a two-node TruCluster configuration.

6.8.3.4 Obtaining the Worldwide Names of KGPSA Adapters

A worldwide name is a unique number assigned to a subsystem by the Institute of Electrical and Electronics Engineers (IEEE) and set by the manufacturer prior to shipping. The worldwide name assigned to a subsystem never changes. We recommend that you obtain and record the worldwide names of Fibre Channel components in case you need to verify their target ID mappings in the operating system.

Fibre Channel devices have both a node name and a port name worldwide name, both of which are 64-bit numbers. Most commands that you use with Fibre Channel only show the port name.

There are multiple ways to obtain the KGPSA port name worldwide name:

You can obtain the worldwide name from a label on the KGPSA module before you install it.

You can use the show dev command as follows:

P00>>> show dev
       
.
.
.
pga0.0.0.1.0       PGA0    WWN 1000-0000-c920-eda0
pgb0.0.0.2.0       PGB0    WWN 1000-0000-c920-da01

You can use the wwidmgr -show adapter command as follows:

P00>>> wwidmgr -show adapter
item       adapter           WWN                Cur. Topo  Next Topo 
[ 0]    pga0.0.0.4.1    1000-0000-c920-eda0       FABRIC     FABRIC     
[ 1]    pgb0.0.0.3.0    1000-0000-c920-da01       FABRIC     FABRIC     
[9999] All of the above.

If the operating system is installed, the worldwide name of a KGPSA adapter is also displayed in the boot messages generated when the emx driver attaches to the adapter when the adapter's host system boots. Or, you can use the grep utility and obtain the worldwide name from the /var/adm/messages file as follows:

# grep wwn /var/adm/messages
F/W Rev 2.20X2(1.12): wwn 1000-0000-c920-eda0
F/W Rev 2.20X2(1.12): wwn 1000-0000-c920-eda0
F/W Rev 2.20X2(1.12): wwn 1000-0000-c920-eda0
       
.
.
.

Record the worldwide name of each KGPSA adapter for later use.

6.8.4 Setting Up the HSG80 Array Controller for Tru64 UNIX Installation

This section covers setting up the HSG80 controller for operation with Tru64 UNIX Version 5.1A and TruCluster Server Version 5.1A.

The steps described here apply to both fabric and arbitrated loop configurations. However, arbitrated loop requires specific settings for the port topology and AL_PA values. If this is an arbitrated loop configuration, follow the steps described here, taking note of the difference in the port topology setting. Then see Section 6.8.4.1 for additional information.

For more information on installing the HSG80, see the Compaq StorageWorks HSG80 Array Controller ACS Version 8.5 Configuration Guide or Compaq StorageWorks HSG80 Array Controller ACS Version 8.5 CLI Reference Guide.

To set up an HSG80 for TruCluster Server operation, follow these steps:

If they are not already installed, install the HSG80 controllers into the RA8000 or ESA12000 storage arrays.

If the external cache battery (ECB) is used, ensure that it is connected to the controller cache modules.

Install the fiber-optic cables between the KGPSA and the switch or hub.

Set the power verification and addressing (PVA) ID. Use PVA ID 0 for the enclosure that contains the HSG80 controllers. Set the PVA ID to 2 and 3 on expansion enclosures (if present).

Note

Do not use PVA ID 1.
With Port-Target-LUN (PTL) addressing, the PVA ID is used to determine the target ID of the devices on ports 1 through 6 (the LUN is always zero). Valid target ID numbers are 0 through 15, excluding numbers 4 through 7. Target IDs 6 and 7 are reserved for the controller pair, and target IDs 4 and 5 are never used.
The enclosure with PVA ID 0 will contain devices with target IDs 0 through 3; with PVA ID 2, target IDs 8 through 11; with PVA ID 3, target IDs 12 through 15. Setting a PVA ID of an enclosure to 1 would set target IDs to 4 through 7, generating a conflict with the target IDs of the controllers.

Remove the program card ESD cover and insert the controller's program card. Replace the ESD cover.

Install disks into storage shelves.

Connect a terminal to the maintenance port on one of the HSG80 controllers. You need a local connection to configure the controller for the first time. The maintenance port supports serial communication with the following default values:
- 9600 bits/sec
- 8 data bits
- 1 stop bit
- No parity

Connect the RA8000 or ESA12000 to the power source and apply power.

Note

For the HSG80 to see the connection to the KGPSAs, the KGPSA host bus adapters must be cabled to the switch or hub, with the system power applied before you turn power on to the RA8000/ESA12000.

If an uninterruptible power supply (UPS) is used instead of the external cache battery, to prevent the controller from periodically checking the cache batteries after power is applied, enter the following command:
```
HSG80> set this CACHE_UPS
 
```
Note

Setting the controller variable CACHE_UPS for one controller sets it for both controllers.

From the maintenance terminal, use the show this and show other commands to verify that controllers have the current firmware version. See the Compaq StorageWorks HSG80 Array Controller ACS Version 8.5 CLI Reference Guide for information on upgrading the firmware.

To ensure proper operation of the HSG80 with Tru64 UNIX and TruCluster Server, set the the controller values as follows. Note that the port topology setting of fabric is specific to fabric configurations.
```
HSG80> set nofailover  [1]
HSG80> clear cli  [2]
HSG80> set this mirrored_cache  [3]
HSG80> set multibus copy = this   [4]
HSG80> clear cli  [5]
HSG80> set this port_1_topology = offline  [6]
HSG80> set this port_2_topology = offline  [6]
HSG80> set other port_1_topology = offline  [6] 
HSG80> set other port_2_topology = offline  [6] 
HSG80> set this port_1_topology = fabric  [7]
HSG80> set this port_2_topology = fabric  [7]
HSG80> set other port_1_topology = fabric  [7] 
HSG80> set other port_2_topology = fabric  [7]
HSG80> set this time=dd-mmm-yyyy:hh:mm:ss  [8]
HSG80> set this scsi_version = scsi-3  [9]
HSG80> set other scsi_version = scsi-3  [9]
HSG80> restart other  [10]
HSG80> restart this  [10]
 
```
1. Removes any failover mode that may have been previously configured. [Return to example]
2. Prevents the command line interpreter (CLI) from reporting a misconfiguration error resulting from not having a failover mode set. [Return to example]
3. Sets up mirrored cache, if desired, for the controller pair. [Return to example]
4. Puts the controller pair into multiple-bus failover mode. Ensure that you copy the configuration information from the controller known to have a good array configuration.
  
  Note
  
  Use the set failover copy = this_controller command to set transparent failover mode.
  [Return to example]
5. When the command is entered to set multiple-bus failover and copy the configuration information to the other controller, the other controller will restart. The restart may set off the audible alarm (which is silenced by pressing the button on the EMU). The CLI will display an event report, and continue reporting the condition until cleared with the clear cli command. [Return to example]
6. Takes the ports off line and resets the topology to prevent an error message when setting the port topology. [Return to example]
7. Sets fabric as the switch topology. If this is an arbitrated loop configuration, see Section 6.8.4.1 for configuration differences. [Return to example]
8. Sets the date and time on this controller. In a dual-redundant configuration, the command sets the time on both controllers. The value takes effect immediately. [Return to example]
9. Specifies the host protocol to use. You can use either SCSI-2 or SCSI-3.
  Setting the SCSI_VERSION to SCSI-2 allows a disk unit to be at LUN 0, and specifies that the command console LUN (CCL) is not fixed at a particular location, but floats to the first available LUN.
  If SCSI_VERSION is set to SCSI-3, the CCL is presented at LUN 0 for all connection offsets. Do not assign unit 0 at any connection offset because the unit would be masked by the CCL at LUN 0 and would not be available.
  Setting SCSI_VERSION to SCSI-3 is preferred because the CCL is fixed and it is much easier to manage a fixed CCL than a CCL that can change. [Return to example]
10. Prompts you to restart both controllers after you set the SCSI version. [Return to example]

Enter the show connection command as shown in Example 6-1 to determine the HSG80 connection names for the connections to the KGPSA host bus adapters. For an RA8000/ESA12000 with dual-redundant HSG80s in multiple-bus failover mode, there will be four connections for each KGPSA in the cluster (as long as all four HSG80 ports are connected to the same fabric).

For example, in a two-node cluster with two KGPSAs in each member system, and an RA8000 or ESA12000 with dual-redundant HSG80s, there will be 16 connections for the cluster. If you have other systems or clusters connected to the switches in the fabric, there will be other connections for the other systems. (Arbitrated loop configurations support a maximum of two systems.) In Example 6-1, note that the ! (exclamation mark) is part of the connection name. The HOST_ID is the KGPSA host name worldwide name and the ADAPTER_ID is the port name worldwide name.

Example 6-1: Determine HSG80 Connection Names

HSG80 show connection
Connection                                                                Unit
   Name      Operating system    Controller  Port   Address  Status      Offset
 
 
!NEWCON49      TRU64_UNIX           THIS       2    230813   OL this       0
           HOST_ID=1000-0000-C920-DA01         ADAPTER_ID=1000-0000-C920-DA01
 
!NEWCON50      TRU64_UNIX           THIS       1    230813   OL this       0
           HOST_ID=1000-0000-C920-DA01         ADAPTER_ID=1000-0000-C920-DA01
 
!NEWCON51      TRU64_UNIX           THIS       2    230913   OL this       0
           HOST_ID=1000-0000-C920-EDEB         ADAPTER_ID=1000-0000-C920-EDEB
 
!NEWCON52    TRU64_UNIX           THIS         1    230913   OL this       0
           HOST_ID=1000-0000-C920-EDEB         ADAPTER_ID=1000-0000-C920-EDEB
 
!NEWCON53      TRU64_UNIX           OTHER      1    230913   OL other      0
           HOST_ID=1000-0000-C920-EDEB         ADAPTER_ID=1000-0000-C920-EDEB
 
!NEWCON54      TRU64_UNIX           OTHER      1    230813   OL other      0
           HOST_ID=1000-0000-C920-DA01         ADAPTER_ID=1000-0000-C920-DA01
 
!NEWCON55      TRU64_UNIX           OTHER      2    230913   OL other      0
           HOST_ID=1000-0000-C920-EDEB         ADAPTER_ID=1000-0000-C920-EDEB
 
!NEWCON56      TRU64_UNIX           OTHER      2    230813   OL other      0
           HOST_ID=1000-0000-C920-DA01         ADAPTER_ID=1000-0000-C920-DA01

.
.
.
!NEWCON61      TRU64_UNIX           THIS       2      210513   OL this       0
           HOST_ID=1000-0000-C921-086C         ADAPTER_ID=1000-0000-C921-086C
 
!NEWCON62      TRU64_UNIX           OTHER      1      210513   OL other      0
           HOST_ID=1000-0000-C921-086C         ADAPTER_ID=1000-0000-C921-086C
 
!NEWCON63      TRU64_UNIX           OTHER      1               offline       0
           HOST_ID=1000-0000-C921-0943         ADAPTER_ID=1000-0000-C921-0943
 
!NEWCON64      TRU64_UNIX           OTHER      1      210413   OL other      0
           HOST_ID=1000-0000-C920-EDA0         ADAPTER_ID=1000-0000-C920-EDA0
 
!NEWCON65      TRU64_UNIX           OTHER      2      210513   OL other      0
           HOST_ID=1000-0000-C921-086C         ADAPTER_ID=1000-0000-C921-086C

.
.
.
!NEWCON74      TRU64_UNIX           THIS       2      210413   OL this       0
           HOST_ID=1000-0000-C920-EDA0         ADAPTER_ID=1000-0000-C920-EDA0
 
!NEWCON75      TRU64_UNIX           THIS       2               offline       0
           HOST_ID=1000-0000-C921-0A75         ADAPTER_ID=1000-0000-C921-0A75
 
!NEWCON76      TRU64_UNIX           THIS       1      210413   OL this       0
           HOST_ID=1000-0000-C920-EDA0         ADAPTER_ID=1000-0000-C920-EDA0
 
!NEWCON77      TRU64_UNIX           THIS       1    210513   OL this       0
           HOST_ID=1000-0000-C921-086C         ADAPTER_ID=1000-0000-C921-086C
 
!NEWCON78      TRU64_UNIX           THIS       2             offline       0
           HOST_ID=1000-0000-C920-CB77         ADAPTER_ID=1000-0000-C920-CB77
 
!NEWCON79      TRU64_UNIX           OTHER      1             offline       0
           HOST_ID=1000-0000-C920-CB77         ADAPTER_ID=1000-0000-C920-CB77

.
.
.

Note

You can change the connection name with the HSG80 CLI RENAME command. The new connection name is limited to nine characters. You cannot use a comma (,) or backslash (\) in the connection name, and you cannot rename the connection to a name of the form used by the HSG80 (!NEWCON50). For example, assume that member system pepicelli has two KGPSA Fibre Channel host bus adapters, and that the worldwide name for KGPSA pga is 1000-0000-C920-DA01. Example 6-1 shows that the connections for pga are !NEWCON49, !NEWCON50, !NEWCON54, and !NEWCON56. You can change the name of !NEWCON49 to indicate that it is the first connection (of four) to pga on member system pepicelli as follows:
HSG80> rename !NEWCON49 pep_pga_1
 

For each connection to your cluster, verify that the operating system is TRU64_UNIX and the unit offset is 0. Search the show connection display for the worldwide name of each of the KGPSA adapters in your cluster member systems. If the operating system and offsets are incorrect, set them, then restart both controllers as follows:
```
HSG80> set !NEWCON49 unit_offset = 0  [1]
HSG80> set !NEWCON49 operating_system = TRU64_UNIX  [2]
HSG80> restart other  [3]
HSG80> restart this  [3]
       
.
.
.
HSG80> show connection  [4]
 
```
1. Sets the relative offset for LUN numbering to 0. You can set the unit_offset to nonzero values, but use caution. You may not be able to access storage units if you set the unit_offset improperly. [Return to example]
2. Specifies that the host environment that is connected to the Fibre Channel port is TRU64_UNIX. You must change each connection to TRU64_UNIX. This is very important.
  
  Caution
  
  Failure to set this to TRU64_UNIX will prevent your system from booting correctly, recovering from run-time errors, or from booting at all. (In this release, arbitrated loop configurations cannot boot from storage connected via Fibre Channel.) The default operating system is Windows NT, which uses a different SCSI dialect to talk to the HSG80 controller.
  [Return to example]
3. Restarts both controllers to cause all changes to take effect. [Return to example]
4. Verifies that all connections have the offsets set to 0 and the operating system is set to TRU64_UNIX. [Return to example]
Note

If the fiber-optic cables are not properly installed, there will be inconsistencies in the connections shown.

Set up the storage sets as required for the applications to be used. An example is provided in Section 6.9.1.1.

6.8.4.1 Setting Up the HSG80 Array Controller for Arbitrated Loop

Section 6.8.4 describes settings that are common to both fabric and arbitrated loop configurations. This section describes settings that are unique to setting up the HSG80 controller for the arbitrated loop topology.

For more information on installing the HSG80 in an arbitrated loop topology, see the Compaq StorageWorks HSG80 Array Controller ACS Version 8.5 Configuration Guide.

To set up an HSG80 for TruCluster arbitrated loop operation, follow steps 1 through 10 in Section 6.8.4. Then, in step 11, use the maintenance terminal to set the controller values as follows:

Set the PORT_x_TOPOLOGY value to LOOP_HARD. For example:

HSG80> set nofailover                      
HSG80> clear cli                           
HSG80> set multibus copy = this            
HSG80> clear cli                           
HSG80> set this port_1_topology = offline  
HSG80> set this port_2_topology = offline  
HSG80> set other port_1_topology = offline 
HSG80> set other port_2_topology = offline 
HSG80> set this port_1_topology = LOOP_HARD   
HSG80> set this port_2_topology = LOOP_HARD   
HSG80> set other port_1_topology = LOOP_HARD  
HSG80> set other port_2_topology = LOOP_HARD

The PORT_x_TOPOLOGY value of LOOP_HARD enables arbitrated loop operation. Although the HSG80 controller also permits a topology setting of LOOP_SOFT, this is not supported in Tru64 UNIX.

Set PORT_x_AL_PA to unique values. PORT_x _AL_PA specifies the hexadecimal arbitrated loop physical address (AL_PA) for the HSG80 host ports.
This is the preferred address, but the HSG80 controller is free to use whatever AL_PA it obtains during loop initialization. However, the address you specify must be valid and must not be used by another port. If the controller is unable to obtain the address you specify (for example, because two ports are configured for the same address), the controller cannot come up on the loop.
In particular, if you do not set PORT_x_AL_PA, multiple ports might attempt to use the default address, thus causing a conflict.
The valid AL_PA addresses are within the range of 0-EF (hexadecimal), but not all addresses within this range are valid; the default value is 69 (hexadecimal).
The list of valid AL_PA addresses is as follows:
```
           0x01, 0x02, 0x04, 0x08, 0x0F, 0x10, 0x17, 0x18, 0x1B, 0x1D,
           0x1E, 0x1F, 0x23, 0x25, 0x26, 0x27, 0x29, 0x2A, 0x2B, 0x2C,
           0x2D, 0x2E, 0x31, 0x32, 0x33, 0x34, 0x35, 0x36, 0x39, 0x3A,
           0x3C, 0x43, 0x45, 0x46, 0x47, 0x49, 0x4A, 0x4B, 0x4C, 0x4D,
           0x4E, 0x51, 0x52, 0x53, 0x54, 0x55, 0x56, 0x59, 0x5A, 0x5C,
           0x63, 0x65, 0x66, 0x67, 0x69, 0x6A, 0x6B, 0x6C, 0x6D, 0x6E,
           0x71, 0x72, 0x73, 0x74, 0x75, 0x76, 0x79, 0x7A, 0x7C, 0x80,
           0x81, 0x82, 0x84, 0x88, 0x8F, 0x90, 0x97, 0x98, 0x9B, 0x9D,
           0x9E, 0x9F, 0xA3, 0xA5, 0xA6, 0xA7, 0xA9, 0xAA, 0xAB, 0xAC,
           0xAD, 0xAE, 0xB1, 0xB2, 0xB3, 0xB4, 0xB5, 0xB6, 0xB9, 0xBA,
           0xBC, 0xC3, 0xC5, 0xC6, 0xC7, 0xC9, 0xCA, 0xCB, 0xCC, 0xCD,
           0xCE, 0xD1, 0xD2, 0xD3, 0xD4, 0xD5, 0xD6, 0xD9, 0xDA, 0xDC,
           0xE0, 0xE1, 0xE2, 0xE4, 0xE8, 0xEF
 
```
In multiple-bus failover mode, each port must have a unique AL_PA address because all of the ports can be active at the same time.
The convention in transparent failover mode is to use the same AL_PA address for Port 1 on both controllers and the same AL_PA address for Port 2 on both controllers. This allows the standby port on the alternate controller to have the same AL_PA address in the event of a failover. Because the ports are not active at the same time, the AL_PA addresses do not conflict. Make sure that the AL_PA address assigned to Port 1 is not the same as that assigned to Port 2, because they are distinct devices on the Fibre Channel loop.
The following example sets the PORT_x_AL_PA value for ports on two HSG80 controllers in multiple-bus failover mode:
```
HSG80> set this PORT_1_AL_PA  = 01   
HSG80> set this PORT_2_AL_PA  = 02   
HSG80> set other PORT_1_AL_PA  = 04   
HSG80> set other PORT_2_AL_PA  = 08   
 
```
The following example sets the PORT_x_AL_PA value for ports on two HSG80 controllers in transparent failover mode:
```
HSG80> set this PORT_1_AL_PA  = 01   
HSG80> set this PORT_2_AL_PA  = 02   
HSG80> set other PORT_1_AL_PA  = 01   
HSG80> set other PORT_2_AL_PA  = 02   
 
```

After you have done this, continue with steps 12 through 14 in Section 6.8.4.

6.8.4.2 Obtaining the Worldwide Names of HSG80 Controller

The RA8000 or ESA12000 is assigned a worldwide name when the unit is manufactured. The worldwide name (and checksum) of the unit appears on a sticker placed above the controllers. The worldwide name ends in zero (0), for example, 5000-1FE1-0000-0D60. You can also use the SHOW THIS_CONTROLLER Array Controller Software (ACS) command.

For HSG80 controllers, the controller port IDs are derived from the RA8000/ESA12000 worldwide name as follows:

In a subsystem with two controllers in transparent failover mode, the controller port IDs increment as follows:
- Controller A and controller B, port 1 -- worldwide name + 1
- Controller A and controller B, port 2 -- worldwide name + 2
For example, using the worldwide name of 5000-1FE1-0000-0D60, the following port IDs are automatically assigned and shared between the ports as a REPORTED PORT_ID on each port:
- Controller A and controller B, port 1 -- 5000-1FE1-0000-0D61
- Controller A and controller B, port 2 -- 5000-1FE1-0000-0D62

In a configuration with dual-redundant controllers in multiple-bus failover mode, the controller port IDs increment as follows:
- Controller A port 1 -- worldwide name + 1
- Controller A port 2 -- worldwide name + 2
- Controller B port 1 -- worldwide name + 3
- Controller B port 2 -- worldwide name + 4
For example, using the worldwide name of 5000-1FE1-0000-0D60, the following port IDs are automatically assigned and shared between the ports as a REPORTED PORT_ID on each port:
- Controller A port 1 -- 5000-1FE1-0000-0D61
- Controller A port 2 -- 5000-1FE1-0000-0D62
- Controller B port 1 -- 5000-1FE1-0000-0D63
- Controller B port 2 -- 5000-1FE1-0000-0D64

Because the HSG80 controller's configuration information and worldwide name is stored in nonvolatile random-access memory (NVRAM) on the controller, there are different procedures for replacing HSG80 controllers in an RA8000 or ESA12000:

If you replace one controller of a dual-redundant pair, the NVRAM from the remaining controller retains the configuration information (including worldwide name). When you install the replacement controller, the existing controller transfers configuration information to the replacement controller.

If you have to replace the HSG80 controller in a single controller configuration, or if you must replace both HSG80 controllers in a dual-redundant configuration simultaneously, you have two options:
- If the configuration has been saved to disk (with the INITIALIZE DISKnnnn SAVE_CONFIGURATION or INITIALIZE storageset-name SAVE_CONFIGURATION option), you can restore it from disk with the CONFIGURATION RESTORE command.
- If you have not saved the configuration to disk, but the label containing the worldwide name and checksum is still intact, or you have recorded the worldwide name and checksum (Section 6.8.4.2) and other configuration information, you can use the command-line interpreter (CLI) commands to configure the new controller and set the worldwide name. Set the worldwide name as follows:
```
SET THIS NODEID=nnnn-nnnn-nnnn-nnnn checksum
 
```

6.9 Installing Tru64 UNIX and TruCluster Server Software Using Fibre Channel Storage

After the hardware has been installed and configured, there are preliminary steps that must be completed before you install Tru64 UNIX and TruCluster Server on Fibre Channel disks.

You may already know the physical devices on which you want to install Tru64 UNIX and TruCluster Server. That is, you probably know the location of the devices within the storage cabinet, and are aware that the HSG80 show unit command displays the unit number of each of these devices. For example, DISK10200 might be unit number D1.

You need to equate the unit number that is identified by the HSG80 controller with device names that the AlphaServer console can use. That is, the AlphaServer console must know about the devices before it can boot from, or dump to them, and it must have a valid Fibre Channel connection to each of those devices. For example, to boot from storage unit D1 as presented by the HSG80 controller, the AlphaServer console requires a device name such as dga133.1002.0.1.0 that identifies the storage unit. In addition, dga133.1002.0.1.0 must be reachable via a valid Fibre Channel connection.

This section describes how to perform the following tasks:

Before the installation:
1. Configure HSG80 storagesets -- In this manual, example storagesets are configured for both Tru64 UNIX and TruCluster Server on Fibre Channel storage. Modify the storage configuration to meet your needs (Section 6.9.1.1).
2. Set a user-defined identifier (UDID) for each storage unit -- Although Tru64 UNIX does not use this identifier directly, you use the UDID as input to the wwidmgr -quickset command in a subsequent step (Section 6.9.1.2).
3. Use the UDID as input to the wwidmgr -quickset command to set the device unit number -- The device unit number is a subset of the device name (as shown in a show device display). For example, in the device name DKA100.1001.0.1.0, the device unit number is 100 (DKA100). The Fibre Channel worldwide name (which is often referred to as the worldwide ID or WWID) is too long (64 bits) to be used as the device unit number. Therefore, you set a device unit number that is an alias for the Fibre Channel worldwide name (Section 6.9.1.3).
4. Display available Fibre Channel boot devices -- When you set the device unit number, you also set the wwidn and Nn console environment variables. These variables indicate which Fibre channel devices the console can access and which HSG80 ports can be used to access the devices. The wwidn variables also show which devices are displayed by the show dev console command, indicating that the devices can be used for booting or dumping (Section 6.9.1.4).
5. Set the bootdef_dev console environment variable -- Before you install the operating system (or cluster software), you must set the bootdef_dev console environment variable to ensure that you boot from the correct disk over a reachable path (Section 6.9.1.5).

Installing the base operating system and TruCluster Server software:
1. Install the base operating system (Section 6.9.2).
2. Determine the /dev/disk/dskn values to use as TruCluster Server disks (Section 6.9.3).
3. Label the disks that were used to create the cluster (Section 6.9.4).
4. Install the TruCluster Server software and create the first cluster member (Section 6.9.5).
5. Reset the bootdef_dev console environment variable (Section 6.9.6).
6. Add additional systems to the cluster (Section 6.9.7).

If you are installing either the Tru64 UNIX operating system or TruCluster Server software, follow the procedure in Section 6.7.

6.9.1 Before You Install

The following sections cover the preliminary steps that must be completed before you install Tru64 UNIX and TruCluster Server on Fibre Channel disks.

6.9.1.1 Configure the HSG80 Storagesets

After the hardware has been installed and configured, storagesets must be configured for software installation. The following disks/disk partitions are needed for base operating system and cluster installation:

Tru64 UNIX disk

Cluster root (/)

Cluster /usr

Cluster /var

Member boot disk (one for each cluster member system)

Quorum disk (if used)

If you are installing only the operating system, you need only the Tru64 UNIX disk (and any disks that are needed for applications). In this document we assume that both the base operating system and cluster software are to be installed on Fibre Channel disks.

If you are installing a cluster, you need one or more disks to hold the Tru64 UNIX operating system. The disks are either private disks on the system that will become the first cluster member, or disks on a shared bus that the system can access. Whether the Tru64 UNIX disk is on a private disk or a shared disk, shut down the cluster before booting a cluster member system standalone from the Tru64 UNIX disk.

An example configuration will show the procedure necessary to set up disks for base operating system and cluster installation. Modify the procedure according to your own disk needs. You can use any supported RAID level.

The example is based on four 4-GB disks that are used to create two mirrorsets (RAID level 1) to provide reliability. The mirrorsets are partitioned to provide partitions of appropriate sizes. Disks 30200, 30300, 40000, and 40100 are used for the mirrorsets.

Table 6-4 contains the necessary information to convert from the HSG80 unit numbers to /dev/disk/dskn and device names for the example configuration. A blank table (Table A-1) is provided in Appendix A for use in an actual installation.

One mirrorset, the BOOT-MIR mirrorset, is used for the Tru64 UNIX and cluster member system boot disks. The other mirrorset, CROOT-MIR, is used for the cluster root (/), cluster /usr, cluster /var, and quorum disks.

To set up the example disks for operating system and cluster installation, follow the steps in Example 6-2.

Example 6-2: Setting Up the Mirrorset

HSG80> RUN CONFIG  [1]
Config Local Program Invoked
 
Config is building its table and determining what devices exist
on the system. Please be patient.
 
add disk DISK30200  3 2 0
add disk DISK30300  3 3 0
add disk DISK40000  4 0 0
add disk DISK40100  4 1 0
...
 
Config - Normal Termination
 
HSG80> ADD MIRRORSET BOOT-MIR DISK30200 DISK40000  [2]
HSG80> ADD MIRRORSET CROOT-MIR DISK30300 DISK40100  [2]
HSG80> INITIALIZE BOOT-MIR  [3]
HSG80> INITIALIZE CROOT-MIR  [3]
HSG80> SHOW BOOT-MIR  [4]
Name          Storageset                     Uses             Used by
--------------------------------------------------------------------
 
BOOT-MIR      mirrorset                      DISK30200        
                                             DISK40000        
        Switches:
          POLICY (for replacement) = BEST_PERFORMANCE
          COPY (priority) = NORMAL
          READ_SOURCE = LEAST_BUSY
          MEMBERSHIP = 2, 2 members present
        State:
          UNKNOWN -- State only available when configured as a unit
        Size:             8378028 blocks
HSG80> SHOW CROOT-MIR  [4]
Name          Storageset                     Uses             Used by
--------------------------------------------------------------------
 
CROOT-MIR     mirrorset                      DISK30300        
                                             DISK40100        
        Switches:
          POLICY (for replacement) = BEST_PERFORMANCE
          COPY (priority) = NORMAL
          READ_SOURCE = LEAST_BUSY
          MEMBERSHIP = 2, 2 members present
        State:
          UNKNOWN -- State only available when configured as a unit
        Size:             8378028 blocks
HSG80> CREATE_PARTITION BOOT-MIR SIZE=25  [5]
HSG80> CREATE_PARTITION BOOT-MIR SIZE=25  [5]
HSG80> CREATE_PARTITION BOOT-MIR SIZE=LARGEST  [5]
HSG80> CREATE_PARTITION CROOT-MIR SIZE=5  [6]
HSG80> CREATE_PARTITION CROOT-MIR SIZE=15  [6]
HSG80> CREATE_PARTITION CROOT-MIR SIZE=40  [6]
HSG80> CREATE_PARTITION CROOT-MIR SIZE=LARGEST  [6]
HSG80> SHOW BOOT-MIR  [7]
Name          Storageset                     Uses             Used by
---------------------------------------------------------------------
 
BOOT-MIR      mirrorset                      DISK30200        
                                             DISK40000        
        Switches:
          POLICY (for replacement) = BEST_PERFORMANCE
          COPY (priority) = NORMAL
          READ_SOURCE = LEAST_BUSY
          MEMBERSHIP = 2, 2 members present
        State:
          UNKNOWN -- State only available when configured as a unit
        Size:             8378028 blocks
        Partitions:
          Partition number        Size               Starting Block     Used by
          ---------------------------------------------------------------------
            1                  2094502 (   1072.38 MB)            0         [8]
            2                  2094502 (   1072.38 MB)      2094507         [9]
            3                  4189009 (   2144.77 MB)      4189014         [10]
HSG80> 
 
 
HSG80> SHOW CROOT-MIR  [11]
Name          Storageset                     Uses             Used by
------------------------------------------------------------------------------
 
CROOT-MIR     mirrorset                      DISK30300        
                                             DISK40100        
        Switches:
          POLICY (for replacement) = BEST_PERFORMANCE
          COPY (priority) = NORMAL
          READ_SOURCE = LEAST_BUSY
          MEMBERSHIP = 2, 2 members present
        State:
          UNKNOWN -- State only available when configured as a unit
        Size:             8378028 blocks
        Partitions:
          Partition number        Size               Starting Block     Used by
          ---------------------------------------------------------------------
            1                   418896 (    214.47 MB)            0         [12]
            2                  1256699 (    643.42 MB)       418901         [13]
            3                  3351206 (   1715.81 MB)      1675605         [14]
            4                  3351207 (   1715.81 MB)      5026816         [15]

Configures the devices on the device side buses and add them to the controller configuration. The CONFIG utility takes about 2 minutes to complete. You can use the ADD DISK command to add disk drives to the configuration manually. [Return to example]

Creates the BOOT-MIR mirrorset using disks 30200 and 30300 and the CROOT-MIR mirrorset using disks 40000 and 40100. [Return to example]

Initializes the BOOT-MIR and CROOT-MIR mirrorsets. If you want to set any initialization switches, you must do so in this step. The BOOT-MIR mirrorset will be used for the Tru64 UNIX and cluster member system boot disks. The CROOT-MIR mirrorset will be used for the cluster root (/), cluster /usr and cluster /var file systems, and the quorum disk. [Return to example]

Verifies the mirrorset configuration and switches. Ensure that the mirrorsets use the correct disks. [Return to example]

Creates appropriately sized partitions in the BOOT-MIR mirrorset using the percentage of the storageset that each partition will use. These partitions will be used for the two member system boot disks (25 percent or 1 GB each) and the Tru64 UNIX disk. For the last partition, the controller assigns the largest free space available to the partition (which will be close to 50 percent, or 2 GB). [Return to example]

Creates appropriately sized partitions in the CROOT-MIR mirrorset using the percentage of the storageset that each partition will use. These partitions will be used for the quorum disk (5 percent), cluster root partition (15 percent), /usr (40 percent), and /var file systems. For the last partition, /var, the controller assigns the largest free space available to the partition (which will be close to 40 percent). See the TruCluster Server Cluster Installation manual to obtain partition sizes. [Return to example]

Verifies the BOOT-MIR mirrorset partitions. Ensure that the partitions are of the desired size. The partition number is in the first column, followed by the partition size and starting block. [Return to example]

Partitions for member system 1 boot disk. [Return to example]

Partitions for member system 2 boot disk. [Return to example]

Partitions for Tru64 UNIX operating system disk. [Return to example]

Verifies the CROOT-MIR mirrorset partitions. Ensure that the partitions are of the desired size. The partition number is in the first column, followed by the partition size and starting block. [Return to example]

Partitions for the quorum disk. [Return to example]

Partitions for cluster root (/) filesystem. [Return to example]

Partitions for cluster /usr filesystem. [Return to example]

Partitions for cluster /var filesystem. [Return to example]

6.9.1.2 Adding Units and Identifiers to the HSG80 Storagesets

After you have created the storagesets and partitions, assign a unit number to each partition and set a unique identifier as shown in Example 6-3.

Note

A storageset must reside on one controller or the other. All the partitions of a storageset must be on the same controller because all the partitions of a storageset fail over as a unit.

The steps performed in Example 6-3 include:

Assigns a unit number to each storage unit and disables all access to the storage unit.

Sets an identifier for each storage unit.

Enable selective access to the storage unit.

Example 6-3: Adding Units and Identifiers to the HSG80 Storagesets

HSG80> ADD UNIT D131 BOOT-MIR PARTITION=1 DISABLE_ACCESS_PATH=ALL  [1]
HSG80> ADD UNIT D132 BOOT-MIR PARTITION=2 DISABLE_ACCESS_PATH=ALL
HSG80> ADD UNIT D133 BOOT-MIR PARTITION=3 DISABLE_ACCESS_PATH=ALL
HSG80> ADD UNIT D141 CROOT-MIR PARTITION=1 DISABLE_ACCESS_PATH=ALL
HSG80> ADD UNIT D142 CROOT-MIR PARTITION=2 DISABLE_ACCESS_PATH=ALL
HSG80> ADD UNIT D143 CROOT-MIR PARTITION=3 DISABLE_ACCESS_PATH=ALL
HSG80> ADD UNIT D144 CROOT-MIR PARTITION=4 DISABLE_ACCESS_PATH=ALL
 
HSG80> SET D131 IDENTIFIER=131  [2]
HSG80> SET D132 IDENTIFIER=132
HSG80> SET D133 IDENTIFIER=133
HSG80> SET D141 IDENTIFIER=141
HSG80> SET D142 IDENTIFIER=142
HSG80> SET D143 IDENTIFIER=143
HSG80> SET D144 IDENTIFIER=144
HSG80> set d131 ENABLE_ACCESS_PATH = !NEWCON49,!NEWCON50,!NEWCON51,!NEWCON52  [3]
HSG80> set d131 ENABLE_ACCESS_PATH = !NEWCON53,!NEWCON54,!NEWCON55,!NEWCON56
Warning 1000: Other host(s) in addition to the one(s) specified can still
              access this unit.  If you wish to enable ONLY the host(s)
              specified, disable all access paths (DISABLE_ACCESS=ALL), then
              again enable the ones specified.
HSG80> set d131 ENABLE_ACCESS_PATH = !NEWCON61,!NEWCON62,!NEWCON64,!NEWCON65
Warning 1000: Other host(s) in addition to the one(s) specified can still
              access this unit.  If you wish to enable ONLY the host(s)
              specified, disable all access paths (DISABLE_ACCESS=ALL), then
              again enable the ones specified.
HSG80> set d131 ENABLE_ACCESS_PATH = !NEWCON68,!NEWCON74,!NEWCON76,!NEWCON77
Warning 1000: Other host(s) in addition to the one(s) specified can still
              access this unit.  If you wish to enable ONLY the host(s)
              specified, disable all access paths (DISABLE_ACCESS=ALL), then
              again enable the ones specified.
HSG80> set d132 ENABLE_ACCESS_PATH = !NEWCON49,!NEWCON50,!NEWCON51,!NEWCON52

.
.
.
HSG80> set d144 ENABLE_ACCESS_PATH = !NEWCON49,!NEWCON50,!NEWCON51,!NEWCON52
HSG80> set d144 ENABLE_ACCESS_PATH = !NEWCON53,!NEWCON54,!NEWCON55,!NEWCON56
Warning 1000: Other host(s) in addition to the one(s) specified can still
              access this unit.  If you wish to enable ONLY the host(s)
              specified, disable all access paths (DISABLE_ACCESS=ALL), then
              again enable the ones specified.
HSG80> set d144 ENABLE_ACCESS_PATH = !NEWCON61,!NEWCON62,!NEWCON64,!NEWCON65
Warning 1000: Other host(s) in addition to the one(s) specified can still
              access this unit.  If you wish to enable ONLY the host(s)
              specified, disable all access paths (DISABLE_ACCESS=ALL), then
              again enable the ones specified.
HSG80> set d144 ENABLE_ACCESS_PATH = !NEWCON68,!NEWCON74,!NEWCON76,!NEWCON77
Warning 1000: Other host(s) in addition to the one(s) specified can still
              access this unit.  If you wish to enable ONLY the host(s)
              specified, disable all access paths (DISABLE_ACCESS=ALL), then
              again enable the ones specified.
HSG80> show d131  [4]
    LUN                                      Uses             Used by
------------------------------------------------------------------------------
 
  D131                                       BOOT-MIR     (partition)
        LUN ID:      6000-1FE1-0000-0D60-0009-8080-0434-002F
        IDENTIFIER = 131
        Switches:
          RUN                    NOWRITE_PROTECT        READ_CACHE            
          READAHEAD_CACHE        WRITEBACK_CACHE       
          MAXIMUM_CACHED_TRANSFER_SIZE = 32
        Access:
          !NEWCON49, !NEWCON50, !NEWCON51, !NEWCON52, !NEWCON53, !NEWCON54,
          !NEWCON55, !NEWCON56, !NEWCON61, !NEWCON62, !NEWCON64, !NEWCON65,
          !NEWCON68, !NEWCON74, !NEWCON76, !NEWCON77
        State:
          ONLINE to the other controller
          NOPREFERRED_PATH
        Size:             2094502 blocks
        Geometry (C/H/S): ( 927 / 20 / 113 )

.
.
.
HSG80> show d144  [4]
    LUN                                      Uses             Used by
------------------------------------------------------------------------------
 
  D144                                       CROOT-MIR    (partition)
        LUN ID:      6000-1FE1-0000-0D60-0009-8080-0434-0028
        IDENTIFIER = 144
        Switches:
          RUN                    NOWRITE_PROTECT        READ_CACHE            
          READAHEAD_CACHE        WRITEBACK_CACHE       
          MAXIMUM_CACHED_TRANSFER_SIZE = 32
        Access:
          !NEWCON49, !NEWCON50, !NEWCON51, !NEWCON52, !NEWCON53, !NEWCON54,
          !NEWCON55, !NEWCON56, !NEWCON61, !NEWCON62, !NEWCON64, !NEWCON65,
          !NEWCON68, !NEWCON74, !NEWCON76, !NEWCON77
        State:
          ONLINE to the other controller
          NOPREFERRED_PATH
        Size:             3351207 blocks
        Geometry (C/H/S): ( 1483 / 20 / 113 )

Assigns a unit number to each partition. When the unit is created by the ADD UNIT command, disables access to all hosts. This allows selective access in case there are other systems or clusters that are connected to the same switch as the cluster.
Record the unit name of each partition with the intended use for that partition (Table 6-4).

Note

In a multiple-bus failover, dual-redundant configuration, you can balance the I/O load by specifying the controller through which the unit will be accessed. Use the PREFERRED_PATH switch to the add unit or set unit ACL commands (PREFERRED_PATH = this_controller or PREFERRED_PATH = other_controller).
All partitions of a container must be addressed through the same controller. If you set PREFERRED_PATH for one partition, all partitions of that container inherit the same path.
[Return to example]

Sets an identifier for each storage unit. Use any number between 1 and 9999.
To keep your storage naming as consistent and simple as possible, consider using the unit number of the unit as its UDID. For instance, if the unit number is D133, use 133 as the UDID. Note, however, that the identifier must be unique. If you have multiple RA8000 or ESA12000 storage arrays, a unit number such as D133 might not be unique across the storage arrays.
The identifier you select shows up as the UDID in the wwidmgr -show wwid display. The WWID manager uses it when setting the device unit number and boot_def_dev console environment variable. The identifier also shows up during the Tru64 UNIX installation to allow you to select the Tru64 UNIX installation disk.
The identifier is also used with the hardware manager view devices command (hwmgr -view devices) to locate the /dev/disk/dskn value.

Note

We recommend that you set the identifier for all Fibre Channel storagesets. It provides a sure method of identifying the storagesets. Make the identifiers unique numbers within the domain (or within the cluster at a minimum). In other words, do not use the same identifier on more than one HSG80. The identifiers should be easily recognized. Ensure that you record the identifiers (Table 6-4).
[Return to example]

Enables access to each unit for those hosts that you want to be able to access this unit. Because access was initially disabled to all hosts, you can ensure selective access to the units. If you do not remember the connection names, use the HSG80 show connection command as shown in Example 6-1 to determine the HSG80 connection names for the connection to the KGPSA host bus adapters. Many of the connections that are used here are shown in Example 6-1.
Use the connection name for each connection to the KGPSA host bus adapter on the host for which you want access enabled. Many of the connections that are used here are shown in Example 6-1. [Return to example]

Using the SHOW unit command (where unit is D131 through D133 and D141 through 144 in the example), verifies the identifier and that access to each unit is correct. Ensure that there is no connection to an unwanted system. Record the identifier and worldwide name for later use. Table 6-4 is a sample table filled in for the example. Table A-1 in Appendix A is a blank table for your use in an actual installation. [Return to example]

Note

At this point, even though the table is filled in, we do not yet know the device names or dskn numbers.

Table 6-4: Converting Storageset Unit Numbers to Disk Names

File System or Disk	HSG80 Unit	Worldwide Name	UDID	Device Name	dsk`n`
Member 1 boot disk	D131	6000-1FE1-0000-0D60-0009-8080-0434-002F	131	`dga131.1001.0.1.0`	`dsk17`
Member 2 boot disk	D132	6000-1FE1-0000-0D60-0009-8080-0434-0030	132	`dga132.1001.0.1.0`	`dsk16`
Tru64 UNIX disk	D133	6000-1FE1-0000-0D60-0009-8080-0434-002E	133	`dga133.1001.0.1.0`	`dsk15`
Quorum disk	D141	6000-1FE1-0000-0D60-0009-8080-0434-0029	141	N/A ^{[Footnote 22]}	`dsk21`
Cluster root (`/`)	D142	6000-1FE1-0000-0D60-0009-8080-0434-002A	142	N/A ^{[Footnote 22]}	`dsk20`
`/usr`	D143	6000-1FE1-0000-0D60-0009-8080-0434-002B	143	N/A ^{[Footnote 22]}	`dsk19`
`/var`	D144	6000-1FE1-0000-0D60-0009-8080-0434-0028	144	N/A ^{[Footnote 22]}	`dsk18`

6.9.1.3 Set the Device Unit Number

The device unit number is a subset of the device name as shown in a show device console display. For example, in the device name dga133.1001.0.1.0, the device unit number is 133. The console uses this device unit number to identify a storage unit. When you set a device unit number, you are really setting an alias for the device worldwide name. The 64-bit worldwide name is too large to be used as the device unit number, so an alias is used instead.

This section describes how to use the wwidmgr -quickset command to set the device unit number for the Fibre Channel disks to be used as the Tru64 UNIX Version 5.1A installation disk or cluster member system boot disks. Setting the device unit number allows the installation scripts to recognize a Fibre Channel disk.

To set the device unit number for a Fibre Channel device, follow these steps:

Use the HSG80 show unit command to obtain the user-defined identifier (UDID) for the HSG80 storageset to be used as the Tru64 UNIX Version 5.1A installation disk or cluster member system boot disks. Record the UDIDs. The show unit command also displays the HSG80 disk name, so you can be sure you are using the correct device.
In the example in Table 6-4, the Tru64 UNIX disk is unit D133 with a UDID 133. The UDID for the cluster member 1 boot disk is 131, and the cluster member 2 boot disk is 132.

From the AlphaServer console, use the wwidmgr -clear all command to clear the stored Fibre Channel wwid1, wwid2, wwid3, wwid4, N1, N2, N3, and N4 console environment variables. You want to start with all wwidn and Nn variables clear.
```
P00>>> wwidmgr -clear all
P00>>> show wwid*
wwid0                                   
wwid1                                   
wwid2                                   
wwid3
P00>>> show n*
N1                                      
N2                                      
N3                                      
N4
 
```
Note

The console only creates devices for which the wwidn console environment variable has been set, and are accessible through an HSG80 N_Port as specified by the Nn console environment variable also being set. These console environment variables are set with the wwidmgr -quickset or wwidmgr -set wwid commands. The use of the wwidmgr -quickset command is shown later in Example 6-5.

Example 6-4 shows the use of the wwidmgr -show wwid command to display the UDID and worldwide names of all devices known to the console. At this time, if you issue the show device console command, it will not detect the Fibre Channel devices connected to the HSG80.

Example 6-4: Displaying the UDID and Worldwide Names of Devices Known to the Console

P00>>> wwidmgr -show wwid
[0] UDID:-1 WWID:01000010:6000-1fe1-0001-4770-0009-9171-3579-0008 (ev:none)
[1] UDID:-1 WWID:01000010:6000-1fe1-0001-4770-0009-9171-3579-0007 (ev:none)
[2] UDID:-1 WWID:01000010:6000-1fe1-0001-4770-0009-9171-3579-0009 (ev:none)
[3] UDID:-1 WWID:01000010:6000-1fe1-0001-4770-0009-9171-3579-000a (ev:none)
[4] UDID:-1 WWID:01000010:6000-1fe1-0001-4770-0009-9171-3579-000b (ev:none)
[5] UDID:-1 WWID:01000010:6000-1fe1-0001-4770-0009-9171-3579-000c (ev:none)
[6] UDID:-1 WWID:01000010:6000-1fe1-0001-4770-0009-9171-3579-000d (ev:none)
[7] UDID:-1 WWID:01000010:6000-1fe1-0001-4770-0009-9171-3579-000e (ev:none)
[8] UDID:-1 WWID:01000010:6000-1fe1-0001-4770-0009-9171-3579-000f (ev:none)
[9] UDID:-1 WWID:01000010:6000-1fe1-0001-4770-0009-9171-3579-0010 (ev:none)
[10] UDID:131 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002f (ev:none)
[11] UDID:132 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-0030 (ev:none)
[12] UDID:133 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002e (ev:none)
[13] UDID:141 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-0029 (ev:none)
[14] UDID:142 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002a (ev:none)
[15] UDID:143 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002b (ev:none)
[16] UDID:144 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-0028 (ev:none)
[17] UDID:-1 WWID:01000010:6000-1fe1-0000-0ca0-0009-8090-0708-002b (ev:none)
[18] UDID:-1 WWID:01000010:6000-1fe1-0000-0ca0-0009-8090-0708-002c (ev:none)
[19] UDID:-1 WWID:01000010:6000-1fe1-0000-0ca0-0009-8090-0708-002d (ev:none)
[20] UDID:-1 WWID:01000010:6000-1fe1-0000-0ca0-0009-8090-0708-002e (ev:none)
 [1] [2]    [3]                                                   [4]

The number within the brackets ([ ]) is the item number of the device shown on any particular line. [Return to example]

The UDID is assigned at the HSG80 with the set Dn IDENTIFIER = xxx command, and is not used by the Tru64 UNIX operating system, but may be set (as we have done with the SET D131 IDENTIFIER=131 group of commands). When the identifier is not set at the HSG80, a value of -1 is displayed. [Return to example]

The worldwide name for the device. It is prefixed with the value WWID:01000010:. The most significant 64 bits of the worldwide name resembles the HSG80 worldwide name, and is assigned when the unit is manufactured. The least significant 64 bits is a volume serial number generated by the HSG80. You can use the HSG80 SHOW unit command to determine the worldwide name for each storage unit (as shown in Example 6-2). [Return to example]

The console environment variable set for this worldwide name. Only 4 wwidn console environment variables (wwid0, wwid1, wwid2, and wwid3) can be set. The console show dev command only shows those disk devices for which a wwidn console environment variable has been set using the wwidmgr -quickset or wwidmgr -set command. In this example, none of the wwidn environment variables is set. [Return to example]

Look through the wwidmgr -show wwid display (Example 6-4) and locate the UDID for the Tru64 UNIX disk (133) and each member system boot disks (131, 132) to ensure the storage unit is seen. As a second check, compare the worldwide name values.

Use the wwidmgr command with the -quickset option to set a device unit number for the Tru64 UNIX Version 5.1A installation disk and the first cluster member system boot disk.
Example 6-5 shows the use of the wwidmgr command with the -quickset option to define a device unit number, based on the UDID, as an alias for the worldwide name for the Tru64 UNIX installation disk and the first cluster member system boot disk. The wwidmgr -quickset utility sets the device unit number and also provides a display of the device names and how the disk is reachable (reachability display).
The wwidmgr -quickset command generates multiple device unit numbers based on the UDID. The device unit number is an alias for the worldwide name, in a format the console can use to identify the storage unit. The wwidmgr -quickset command may generate multiple device names for a given device unit number because each possible path to a storage unit is given its own device name.
Example 6-5 shows:
- The use of the wwidmgr -quickset command to set the device unit number for the Tru64 UNIX Version 5.1A installation disk to 133, and the first cluster member system boot disk to 131.
- The wwidmgr -quickset command provides a reachability display equivalent to execution of the wwidmgr reachability command. The reachability part of the display provides the following:
  - The worldwide name for the storage unit that is to be accessed.
  - The new device name for the storage unit.
  - The KGPSA adapters through which a connection to the storage unit is potentially available.
  - The WWID of the HSG80 port(s) (N_Ports) that will be used to access the storage unit.
  - The connected column indicates whether the storage unit is current available through the KGPSA to HSG80 controller port connection. The HSG80 controllers are in multiple-bus failover so each storage unit is presented by only one controller at a time.

Example 6-5: Setting the Device Unit Number with the wwidmgr quickset Command

P00>>> wwidmgr -quickset -udid 133
 
Disk assignment and reachability after next initialization:
 
6000-1fe1-0000-0d60-0009-8080-0434-002e
                          via adapter:         via fc nport:        connected: 
dga133.1001.0.1.0        pga0.0.0.1.0         5000-1fe1-0000-0d64      No 
dga133.1002.0.1.0        pga0.0.0.1.0         5000-1fe1-0000-0d62      Yes
dga133.1003.0.1.0        pga0.0.0.1.0         5000-1fe1-0000-0d63      No 
dga133.1004.0.1.0        pga0.0.0.1.0         5000-1fe1-0000-0d61      Yes
dgb133.1001.0.2.0        pgb0.0.0.2.0         5000-1fe1-0000-0d64      No 
dgb133.1002.0.2.0        pgb0.0.0.2.0         5000-1fe1-0000-0d62      Yes
dgb133.1003.0.2.0        pgb0.0.0.2.0         5000-1fe1-0000-0d63      No 
dgb133.1004.0.2.0        pgb0.0.0.2.0         5000-1fe1-0000-0d61      Yes
 
P00>>> wwidmgr -quickset -udid 131
 
Disk assignment and reachability after next initialization:
 
6000-1fe1-0000-0d60-0009-8080-0434-002e
                          via adapter:         via fc nport:        connected: 
dga133.1001.0.1.0        pga0.0.0.1.0         5000-1fe1-0000-0d64      No 
dga133.1002.0.1.0        pga0.0.0.1.0         5000-1fe1-0000-0d62      Yes
dga133.1003.0.1.0        pga0.0.0.1.0         5000-1fe1-0000-0d63      No 
dga133.1004.0.1.0        pga0.0.0.1.0         5000-1fe1-0000-0d61      Yes
dgb133.1001.0.2.0        pgb0.0.0.2.0         5000-1fe1-0000-0d64      No 
dgb133.1002.0.2.0        pgb0.0.0.2.0         5000-1fe1-0000-0d62      Yes
dgb133.1003.0.2.0        pgb0.0.0.2.0         5000-1fe1-0000-0d63      No 
dgb133.1004.0.2.0        pgb0.0.0.2.0         5000-1fe1-0000-0d61      Yes
 
6000-1fe1-0000-0d60-0009-8080-0434-002f
                          via adapter:         via fc nport:        connected: 
dga131.1001.0.1.0        pga0.0.0.1.0         5000-1fe1-0000-0d64      No 
dga131.1002.0.1.0        pga0.0.0.1.0         5000-1fe1-0000-0d62      Yes
dga131.1003.0.1.0        pga0.0.0.1.0         5000-1fe1-0000-0d63      No 
dga131.1004.0.1.0        pga0.0.0.1.0         5000-1fe1-0000-0d61      Yes
dgb131.1001.0.2.0        pgb0.0.0.2.0         5000-1fe1-0000-0d64      No 
dgb131.1002.0.2.0        pgb0.0.0.2.0         5000-1fe1-0000-0d62      Yes
dgb131.1003.0.2.0        pgb0.0.0.2.0         5000-1fe1-0000-0d63      No 
dgb131.1004.0.2.0        pgb0.0.0.2.0         5000-1fe1-0000-0d61      Yes
P00>>> init

Notes

The wwidmgr -quickset command can take up to a minute to complete on the AlphaServer 8x00, GS60, GS60E, and GS140 systems.
You must reinitialize the console after running the WWID manager (wwidmgr), and keep in mind that the AS1200, AS4x00, AS8x00, GS60, GS60E, and GS140 consoles are in diagnostic mode. The disks are not reachable and you cannot boot until after the system is initialized.
If you have not set the UDID, you cannot set the device unit number as shown in Example 6-5. You have to use the quickset command with the item number displayed by the wwidmgr -show wwid command (see Example 6-4). For instance, to set the device unit number of item number 17 of Example 6-4 to 770, use the following command:
P00>>> wwidmgr -quickset -item 17 -unit 770
 

The device names have also been set for the Tru64 UNIX disk and first cluster member system boot disks. Record these device names for later use.

In the reachability portion of the display, the storagesets are reachable from KGPSA pga through two HSG80 ports and from KGPSA pgb through two HSG80 ports. Also, the device unit number has been set for each KGPSA to HSG80 controller port connection, even if the storage unit is not currently reachable via that connection.

6.9.1.4 Displaying the Available Boot Devices

The only Fibre Channel devices that are displayed by the console show dev command are those devices that have been assigned to a wwidn environment variable with the wwidmgr -quickset command.

The devices that are shown in the reachability display of Example 6-5 are available for booting and the setting of the bootdef_dev console environment variable during normal console mode.

If you execute the show wwid* console command now, it will show that the environment variable wwidn is set for two disks. Also, the show n* command shows that the units are accessible through four HSG80 N_Ports as follows:

P00>>> show wwid*
wwid0  133 1 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002e
wwid1  131 1 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002f
wwid2
wwid3   
P00>>> show n*
N1                      50001fe100000d64
N2                      50001fe100000d62
N3                      50001fe100000d63
N4                      50001fe100000d61

Caution

If you reuse a UDID, ensure that you clear the wwidn console environment variable set for the original storage unit. If you do not clear the wwidn console environment variable, the wwidmgr command used to quickset the UDID the second time sets a second wwidn console environment variable for the same UDID. The original wwidn console environment variable has a lower number, and therefore is the console environment variable that will be used, but it points to a storage unit that is not reachable. Therefore, you will not be able to boot the system.
If you have already used the wwidmgr -quickset command the second time on the UDID and set a second wwidn console environment variable for the same UDID, and the system has failed to boot, use the wwidmgr command to clear the wwidn console environment variable for the unreachable storage unit. For example, consider the following state of the wwidn console environment variables where wwid1 and wwid2 are both for UDID 150, and wwid1 is the original console environment variable:
P00>>> show wwid*
wwid0    5 1 WWID:01000010:6000-1fe1-0006-3f10-0009-0270-0619-0006
wwid1    150 1 WWID:01000010:6000-1fe1-0006-3f10-0009-0270-0619-000d
wwid2    150 1 WWID:01000010:6000-1fe1-0006-3f10-0009-0270-0619-0011
wwid3
 
Clear the wwid1 console environment variable as follows:
P00>>> wwidmgr -clear wwid1
 
Then, reboot the system.

Example 6-6 provides sample device names as displayed by the show dev command after using the wwidmgr -quickset command to set the device unit numbers.

Example 6-6: Sample Fibre Channel Device Names

 P00>>> show dev
dga131.1001.0.1.0          $1$DGA131                      HSG80  V8.5F
dga131.1002.0.1.0          $1$DGA131                      HSG80  V8.5F
dga131.1003.0.1.0          $1$DGA131                      HSG80  V8.5F
dga131.1004.0.1.0          $1$DGA131                      HSG80  V8.5F
dga133.1001.0.1.0          $1$DGA133                      HSG80  V8.5F
dga133.1002.0.1.0          $1$DGA133                      HSG80  V8.5F
dga133.1003.0.1.0          $1$DGA133                      HSG80  V8.5F
dga133.1004.0.1.0          $1$DGA133                      HSG80  V8.5F
dgb131.1001.0.2.0          $1$DGA131                      HSG80  V8.5F
dgb131.1002.0.2.0          $1$DGA131                      HSG80  V8.5F
dgb131.1003.0.2.0          $1$DGA131                      HSG80  V8.5F
dgb131.1004.0.2.0          $1$DGA131                      HSG80  V8.5F
dgb133.1001.0.2.0          $1$DGA133                      HSG80  V8.5F
dgb133.1002.0.2.0          $1$DGA133                      HSG80  V8.5F
dgb133.1003.0.2.0          $1$DGA133                      HSG80  V8.5F
dgb133.1004.0.2.0          $1$DGA133                      HSG80  V8.5F
dka0.0.0.1.1               DKA0              COMPAQ BB00911CA0  3B05
dqa0.0.0.15.0              DQA0                COMPAQ CDR-8435  0013    
dva0.0.0.1000.0            DVA0                               
ewa0.0.0.5.1               EWA0              08-00-2B-C4-61-11
pga0.0.0.1.0               PGA0        WWN 1000-0000-c920-eda0
pgb0.0.0.2.0               PGB0        WWN 1000-0000-c920-da01
pka0.7.0.1.1               PKA0                  SCSI Bus ID 7  5.57

Note

The only Fibre Channel devices that are displayed by the console show dev command are those devices that have been assigned to a wwidn environment variable.

Before you start the Tru64 UNIX installation, you must set the bootdef_dev console environment variable.

6.9.1.5 Set the bootdef_dev Console Environment Variable for Tru64 UNIX Installation

When booting from Fibre Channel devices, you must set the bootdef_dev console environment variable to ensure that the installation procedure is able to boot the system after building the new kernel.

Note

The bootdef_dev environment variable values must point to the same HSG80.

To set the bootdef_dev console environment variable for the Tru64 UNIX installation when booting from a Fibre Channel device, follow these steps:

Obtain the device name for the Fibre Channel storage unit where you will install the Tru64 UNIX operating system. The device name shows up in the reachability display as shown in Example 6-5 with a Yes under the connected column. You can also use the wwidmgr -show reachability command to determine reachability. Example 6-6 provides the display for a show dev command, which shows the device names of devices that may be assigned to the bootdef_dev console environment variable. Example 6-5 and Example 6-6 show that the following device names can be used in the bootdef_dev console environment variable as possible boot devices:
- dga131.1002.0.1.0
- dga131.1004.0.1.0
- dga133.1002.0.1.0
- dga133.1004.0.1.0
- dgb131.1002.0.2.0
- dgb131.1004.0.2.0
- dgb133.1002.0.2.0
- dgb133.1004.0.2.0
Each of the storage units are reachable through four different paths, two for each host bus adapter (the Yes in the connected column).

Set the bootdef_dev console environment variable to one of the boot paths that show up as connected to the disk you will use for the Tru64 UNIX installation. For the example disk configuration on the system where you are installing the Tru64 UNIX operating system (which will eventually be the first cluster member), set bootdef_dev as follows:
```
P00>>> set bootdef_dev dga133.1002.0.1.0
 
```

You must initialize the system to use any of the device names in the bootdef_dev variable:
```
P00>>> init
       
.
.
.
```
After the initialization, the bootdef_dev will show up as follows:
```
P00>>> show bootdef_dev
bootdef_dev        dga133.1002.0.1.0
 
```

You are now ready to install the Tru64 UNIX operating system.

6.9.2 Install the Base Operating System

After you read the TruCluster Server Cluster Installation manual, and use the Tru64 UNIX Installation Guide as a reference, boot from the CD-ROM and perform a full installation of the Tru64 UNIX Version 5.1A operating system.

When the installation procedure displays the list of disks that are available for operating system installation as shown here, look for the identifier in the Location column. Verify the identifier from the table that you have been preparing (Table 6-4).

To visually locate a disk, enter "ping <disk>",
where <disk> is the device name (for example, dsk0) of the disk you
want to locate.  If that disk has a visible indicator light, it will
blink until you are ready to continue.
 
      Device    Size  Controller  Disk
      Name     in GB  Type        Model       Location
1)    dsk0       4.0  SCSI        RZ2CA-LA    bus-0-targ-0-lun-0
2)    dsk15      1.0  SCSI        HSG80       IDENTIFIER=133
3)    dsk17      2.0  SCSI        HSG80       IDENTIFIER=131

If you flash the light on a storage unit (logical disk) that is a mirrorset, stripeset, or RAIDset, the lights on all disks in the storageset will blink.

Record the /dev/disk/dskn value (dsk15) for the Tru64 UNIX disk that matches the UDID (133) (Table 6-4).

Complete the installation, following the instructions in the Tru64 UNIX Installation Guide.

6.9.3 Determining /dev/disk/dskn to Use for a Cluster Installation

Before you can install the TruCluster Server software, you must determine which /dev/disk/dskn to use for the various TruCluster Server disks.

To determine the /dev/disk/dskn to use for the cluster disks, follow these steps:

With the Tru64 UNIX Version 5.1A operating system at single-user or multi-user mode, use the hardware manager (hwmgr) utility with the -view devices option to display all devices on the system. Pipe the command through the grep utility to search for any items with the IDENTIFIER qualifier.

# hwmgr -view dev | grep IDENTIFIER
 HWID: Device Name          Mfg      Model            Location
 -----------------------------------------------------------------------
   62: /dev/disk/dsk15c     DEC      HSG80            IDENTIFIER=133
   63: /dev/disk/dsk16c     DEC      HSG80            IDENTIFIER=132
   64: /dev/disk/dsk17c     DEC      HSG80            IDENTIFIER=131
   65: /dev/disk/dsk18c     DEC      HSG80            IDENTIFIER=141
   66: /dev/disk/dsk19c     DEC      HSG80            IDENTIFIER=142
   67: /dev/disk/dsk20c     DEC      HSG80            IDENTIFIER=143
   68: /dev/disk/dsk21c     DEC      HSG80            IDENTIFIER=144

If you know that you have set the UDID for a large number of disks, you can also grep for the UDID:

# hwmgr -view dev | grep IDENTIFIER | grep 131

Note

If you have not set the UDID, you can use hwmgr to determine the /dev/disk/dskn name by using the hardware manager to display device attributes and searching for the storage unit worldwide name as follows:
# hwmgr -get attribute -a name -a dev_base_name | more
 
Use the more search utility (/) to search for the worldwide name of the storageset you have set up for the particular disk in question. The following example shows the format of the command output:
# hwmgr -get attribute -a name -a dev_base_name
1:
  name = Compaq AlphaServer ES40
2:
  name = CPU0
       
.
.
.
62:
  name = SCSI-WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002e
  dev_base_name = dsk15
63:
  name = SCSI-WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-0030
  dev_base_name = dsk16
64:
  name = SCSI-WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002f
  dev_base_name = dsk17
65:
  name = SCSI-WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-0028
  dev_base_name = dsk18
66:
  name = SCSI-WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002b
  dev_base_name = dsk19
67:
  name = SCSI-WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002a
  dev_base_name = dsk20
68:
  name = SCSI-WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-0029
  dev_base_name = dsk21
69:
  name = SCSI-WWID:0710002c:"COMPAQ CDR-8435 :d05b003t00000l00000"
  dev_base_name = cdrom0
       
.
.
.
For more information on the hardware manager, see hwmgr(8).

Search the display for the UDIDs (or worldwide names) for each of the cluster installation disks and record the /dev/disk/dskn values.
If you used the grep utility to search for a specific UDID, for example hwmgr -view dev | grep "IDENTIFIER=131" repeat the command to determine the /dev/disk/dskn for each of the remaining cluster disks. Record the information for use when you install the cluster software.

You must label the disks before you install cluster software.

6.9.4 Label the Disks to Be Used to Create the Cluster

Before you run clu_create to create the first cluster member or clu_add_member to add subsequent cluster members, you must label the disks to be used for cluster software.

On the system where you installed the Tru64 UNIX operating system, if you have not already done so, boot the system. Determine the /dev/disk/dskn values to use for cluster installation (Section 6.9.3).

Initialize disklabels for all disks needed to create the cluster. The example (Table 6-4) uses disks dsk18 (/var), dsk19 (/usr), dsk20 [cluster root (/)], and dsk21 (Quorum). For instance:

# disklabel -rw dsk20 HSG80

6.9.5 Install the TruCluster Server Software and Create the First Cluster Member

After labeling the disks, use the TruCluster Server Cluster Installation procedures and install the TruCluster Server software on the first cluster member (the system where you just installed Tru64 UNIX).

After installing the TruCluster Server software subsets, run the clu_create command to create the first cluster member using the procedures in the Cluster Installation manual.

When clu_create terminates, do not reboot the system. Shut down the system and reset the bootdef_dev console environment variable to provide multiple boot paths to the member system boot disk before booting (Section 6.9.6). Then boot the first cluster member.

6.9.6 Reset the bootdef_dev Console Environment Variable

If you set the bootdef_dev console environment variable to multiple paths in Section 6.9.1.5, the base operating system installation or clu_create procedures modify the variable and you should reset it to provide multiple boot paths.

To reset the bootdef_dev console environment variable, follow these steps:

Obtain the device name and worldwide name for the Fibre Channel unit from where you will boot cluster member system 1 (Table 6-4).

Examine the reachability display (Example 6-5) provided by the wwidmgr -quickset or the wwidmgr -reachability commands for the device names that can access the storage unit from which you are booting. Examine the show dev command output to ensure the device name may be assigned to the bootdef_dev console environment variable.
Notes

Choose device names that show up as both Yes and No in the reachability display connected column. Keep in mind, that for multiple-bus failover, only one controller is normally active for a storage unit. You must ensure that the unit is reachable if the controllers have failed over.
If you have multiple Fibre Channel host bus adapters, use device names for at least two host bus adapters.
For example, to ensure that you have a connected boot path in case of a failed host bus adapter or controller failover, choose device names for multiple host bus adapters and each controller port. For example, if you use the reachability display shown in Example 6-5, you can choose the following device names when setting the bootdef_dev console environment variable for the first cluster member system:
```
dga131.1004.0.1.0    [1]
dgb131.1002.0.2.0    [2]
dgb131.1003.0.2.0    [3]
dga131.1001.0.1.0    [4]
 
```
1. Path from host bus adapter A to controller A port 1 [Return to example]
2. Path from host bus adapter B to controller A port 2 [Return to example]
3. Path from host bus adapter B to controller B port 1 [Return to example]
4. Path from host bus adapter A to controller B port 2 [Return to example]
You can set units preferred to a specific controller, in which case both controllers will be active.
If the bootdef_dev console environment variable ends up with all boot paths in an unconnected state, you can use the ffauto or ffnext console environment variables to force a boot device from a not connected to a connected state.
The ffauto console environment variable is effective only during autoboots (boots other than manual boots). Use the set ffauto on console command to enable ffauto. (The default for ffauto is off.) It is stored in non-volatile memory and therefore persists across system resets and power cycles.
During an autoboot, the console attempts to boot from each connected device listed in the bootdef_dev console environment variable. If ffauto is on, and if the end of devices listed in bootdef_dev is reached without successfully booting, the console starts again at the beginning of devices listed in the bootdef_def console environment variable. This time, devices that are not connected are changed to connected and an attempt is made to boot from that device.
The ffnext console environment variable is a one-time variable. It does not persist across a system reset, power cycle, or reboot. This variable may be used (set ffnext on) to cause the next command to a not connected device to change the state to connected. After the command has been executed, the ffnext variable is automatically set to off, so it has no further effect.
For more information on using the ffauto and ffnext console environment variables, see the Wwidmgr User's Manual.

Set the bootdef_dev console environment variable for member system 1 boot disk to a comma-separated list of several of the boot paths that show up as connected in the reachability display (wwidmgr -quickset or wwidmgr -show reachability). You must initialize the system to use any of the device names in the bootdef_dev variable as follows:
```
P00>>> set bootdef_dev \
dga131.1001.0.1.0,dga131.1004.0.1.0,\
dgb131.1002.0.2.0,dgb131.1003.0.2.0
POO>>> init
 
```

Note

The console System Reference Manual (SRM) software guarantees that you can set the bootdef_dev console environment variable to a minimum of four device names. You may be able to set it to five, but only four are guaranteed.

6.9.7 Add Additional Systems to the Cluster

To add additional systems to the cluster, follow this procedure:

On the system where you installed the Tru64 UNIX operating system, boot the system into the cluster (single-member cluster), and referring to the TruCluster Server Cluster Installation procedures, use clu_add_member to add subsequent cluster members.
Before you boot the system being added to the cluster, on the newly added cluster member:
1. Use the wwidmgr utility with the -quickset option to set the device unit number for the member system boot disk as shown in Example 6-7. For member system 2 in the example configuration, it is the storage unit with UDID 132 (Table 6-4):
  Example 6-7: Setting Device Unit Number for Additional Member System
```
P00>>> wwidmgr -quickset -udid 132
 
Disk assignment and reachability after next initialization:
 
6000-1fe1-0000-0d60-0009-8080-0434-0030
                     via adapter:     via fc nport:       connected: 
dga132.1001.0.1.0    pga0.0.0.1.0     5000-1fe1-0000-0d64      No 
dga132.1002.0.1.0    pga0.0.0.1.0     5000-1fe1-0000-0d62      Yes
dga132.1003.0.1.0    pga0.0.0.1.0     5000-1fe1-0000-0d63      No 
dga132.1004.0.1.0    pga0.0.0.1.0     5000-1fe1-0000-0d61      Yes
dgb132.1001.0.2.0    pgb0.0.0.2.0     5000-1fe1-0000-0d64      No 
dgb132.1002.0.2.0    pgb0.0.0.2.0     5000-1fe1-0000-0d62      Yes
dgb132.1003.0.2.0    pgb0.0.0.2.0     5000-1fe1-0000-0d63      No 
dgb132.1004.0.2.0    pgb0.0.0.2.0     5000-1fe1-0000-0d61      Yes
 
P00>>> init
 
```
2. Set the bootdef_dev console environment variable to one reachable path (Yes in the connected column of Example 6-7) to the member system boot disk:
```
P00>>> set bootdef_dev dga132.1002.0.1.0
 
```
3. Boot genvmunix on the newly added cluster member system. Each installed subset will be configured and a new kernel will be built and installed. After the new kernel is built, do not reboot the new cluster member system.

Shut down the system and reset the bootdef_dev console environment variable to provide multiple boot paths to the member system boot disk as follows:
1. Obtain the device name and worldwide name for the Fibre Channel unit from where you will boot (Table 6-4). Ensure that you choose the correct device name for the entity that you are booting (cluster member system 2, cluster member system 3, and so on).
2. Examine the reachability display (Example 6-7) that is provided by the wwidmgr -quickset command for the device names that can access the storage unit from where you are booting. Examine the show dev command output to ensure that the device name may be assigned to the bootdef_dev console environment variable.
  Notes
  
  Choose device names that show up as both Yes and No in the reachability display connected column. Keep in mind that, for multiple-bus failover, only one controller is normally active for a storage unit. You must ensure that the unit is reachable if the controllers have failed over.
  If you have multiple Fibre Channel host bus adapters, use device names for at least two host bus adapters.
  For example, to ensure that you have a connected boot path in case of a failed host bus adapter or controller failover, choose device names for multiple host bus adapters and each controller port. For example, if you use the reachability display shown in Example 6-7, you can choose the following device names when setting the bootdef_dev console environment variable for the second cluster member system:
```
dga132.1004.0.1.0    [1]
dgb132.1002.0.2.0    [2]
dgb132.1003.0.2.0    [3]
dga132.1001.0.1.0    [4]
 
```
  1. Path from host bus adapter A to controller A port 1 [Return to example]
  2. Path from host bus adapter B to controller A port 2 [Return to example]
  3. Path from host bus adapter B to controller B port 1 [Return to example]
  4. Path from host bus adapter A to controller B port 2 [Return to example]
3. Set the bootdef_dev console environment variable for member system 2 boot disk to a comma-separated list of several of the boot paths that show up as connected in the reachability display (wwidmgr -quickset or wwidmgr -show reachability). You must initialize the system to use any of the device names in the bootdef_dev variable as follows:
```
P00>>> set bootdef_dev \
dga132.1001.0.1.0,dga132.1004.0.1.0,\
dgb132.1002.0.2.0,dgb132.1003.0.2.0
POO>>> init
 
```

Boot the new cluster member system into the cluster and complete the cluster installation.

Repeat steps 1 through 3 for other cluster member systems.

6.10 Converting the HSG80 from Transparent to Multiple-Bus Failover Mode

If you are migrating from Tru64 UNIX Version 4.0F or Version 4.0G and TruCluster Software Products Version 1.6 to Tru64 UNIX Version 5.1A and TruCluster Server Version 5.1A, you may want to change from transparent failover to multiple-bus failover to take advantage of multibus support in Tru64 UNIX Version 5.1A and multiple-bus failover mode and the ability to create a no-single-point-of-failure (NSPOF) cluster.

Or, you may be using transparent failover mode with Tru64 UNIX Version 5.1A and TruCluster Server Version 5.1A and want to take advantage of the ability to create an NSPOF configuration, and the availability that multiple-bus failover provides over transparent failover mode.

6.10.1 Overview

The change in failover modes cannot be accomplished with a simple SET MULTIBUS COPY=THIS HSG80 CLI command because:

Unit offsets are not changed by the HSG80 SET MULTIBUS_FAILOVER COPY=THIS command.
Each path between a Fibre Channel host bus adapter in a host computer and an active host port on an HSG80 controller is a connection. During Fibre Channel initialization, when a controller becomes aware of a connection to a host bus adapter through a switch or hub, it adds the connection to its table of known connections. The unit offset for the connection depends on the failover mode in effect at the time that the connection is discovered. In transparent failover mode, host connections to port 1 default to an offset of 0; host connections on port 2 default to an offset of 100. Host connections on port 1 can see units 0 through 99; host connections on port 2 can see units 100 through 199.
In multiple-bus failover mode, host connections on either port 1 or 2 can see units 0 through 199. In multiple-bus failover mode, the default offset for both ports is 0.
If you change the failover mode from transparent failover to multiple-bus failover, the offsets in the table of known connections remain the same as if they were for transparent failover mode; the offset on port 2 remains 100. With an offset of 100 on port 2, a host cannot see units 0 through 99 on port 2. This reduces the availability. Also, if you have only a single HSG80 controller and lose the connection to port 1, you lose access to units 0 through 99.
Therefore, if you want to change from transparent failover to multiple-bus failover mode, you must change the offset in the table of known connections for each connection that has a nonzero offset.

Note

Disconnecting and then reconnecting the cables does no good because a connection that is added to the table remains in the table until you delete the connection.

The system can access a storage device through only one HSG80 port. The system's view of the storage device is not changed when the HSG80 is placed in multiple-bus failover mode.
In transparent failover mode, the system accesses storage units D0 through D99 through port 1 and units D100 through D199 through port 2. In multiple-bus failover mode, you want the system to be able to access all units through all four ports.

6.10.2 Procedure to Convert from Transparent to Multiple-bus Failover Mode

To change from transparent failover to multiple-bus failover mode by resetting the unit offsets and modifying the systems' view of the storage units, follow these steps:

Shut down the operating systems on all host systems that are accessing the HSG80 controllers that you want to change from transparent failover to multiple-bus failover mode.

At the HSG80, set multiple-bus failover as follows. Before putting the controllers in multiple-bus failover mode, you must remove any previous failover mode:
```
HSG80> SET NOFAILOVER
HSG80> SET MULTIBUS_FAILOVER COPY=THIS
 
```
Note

Use the controller that you know has the good configuration information.

If this HSG80 is being used in an arbitrated loop topology (port topology is set to LOOP_HARD), you need to set a unique AL_PA address for each port because all of the ports can be active at the same time. (The convention in transparent failover mode is to use the same AL_PA address for Port 1 on both controllers and the same AL_PA address for Port 2 on both controllers.)
The following example sets the ports on two HSG80 controllers off line, sets the PORT_x_AL_PA value for multiple-bus failover mode, and sets the ports on line.
```
HSG80> set this port_1_topology = offline  
HSG80> set this port_2_topology = offline
HSG80> set other port_1_topology = offline
HSG80> set other port_2_topology = offline  
HSG80> set this PORT_1_AL_PA  = 01   
HSG80> set this PORT_2_AL_PA  = 02   
HSG80> set other PORT_1_AL_PA  = 04   
HSG80> set other PORT_2_AL_PA  = 08   
 
```

Execute the SHOW CONNECTION command to determine which connections have a nonzero offset as follows:

HSG80> SHOW CONNECTION
Connection                                                                Unit
   Name      Operating system    Controller  Port    Address    Status   Offset
 
!NEWCON49      TRU64_UNIX           THIS       2      230813   OL this     100
           HOST_ID=1000-0000-C920-DA01         ADAPTER_ID=1000-0000-C920-DA01
 
!NEWCON50      TRU64_UNIX           THIS       1      230813   OL this       0
           HOST_ID=1000-0000-C920-DA01         ADAPTER_ID=1000-0000-C920-DA01
 
!NEWCON51      TRU64_UNIX           THIS       2      230913   OL this     100
           HOST_ID=1000-0000-C920-EDEB         ADAPTER_ID=1000-0000-C920-EDEB
 
!NEWCON52      TRU64_UNIX           THIS       1      230913   OL this       0
           HOST_ID=1000-0000-C920-EDEB         ADAPTER_ID=1000-0000-C920-EDEB
 
!NEWCON53      TRU64_UNIX           OTHER      1      230913   OL other      0
           HOST_ID=1000-0000-C920-EDEB         ADAPTER_ID=1000-0000-C920-EDEB
 
!NEWCON54      TRU64_UNIX           OTHER      1      230813   OL other      0
           HOST_ID=1000-0000-C920-DA01         ADAPTER_ID=1000-0000-C920-DA01
 
!NEWCON55      TRU64_UNIX           OTHER      2      230913   OL other    100
           HOST_ID=1000-0000-C920-EDEB         ADAPTER_ID=1000-0000-C920-EDEB
 
!NEWCON56      TRU64_UNIX           OTHER      2      230813   OL other    100
           HOST_ID=1000-0000-C920-DA01         ADAPTER_ID=1000-0000-C920-DA01
 
!NEWCON57      TRU64_UNIX           THIS       2               offline     100
           HOST_ID=1000-0000-C921-09F7         ADAPTER_ID=1000-0000-C921-09F7
 
!NEWCON58      TRU64_UNIX           OTHER      1               offline       0
           HOST_ID=1000-0000-C921-09F7         ADAPTER_ID=1000-0000-C921-09F7
 
!NEWCON59      TRU64_UNIX           THIS       1               offline       0
           HOST_ID=1000-0000-C921-09F7         ADAPTER_ID=1000-0000-C921-09F7
 
!NEWCON60      TRU64_UNIX           OTHER      2               offline     100
           HOST_ID=1000-0000-C921-09F7         ADAPTER_ID=1000-0000-C921-09F7
 
!NEWCON61      TRU64_UNIX           THIS       2      210513   OL this     100
           HOST_ID=1000-0000-C921-086C         ADAPTER_ID=1000-0000-C921-086C
 
!NEWCON62      TRU64_UNIX           OTHER      1      210513   OL other      0
           HOST_ID=1000-0000-C921-086C         ADAPTER_ID=1000-0000-C921-086C
 
!NEWCON63      TRU64_UNIX           OTHER      1               offline       0
           HOST_ID=1000-0000-C921-0943         ADAPTER_ID=1000-0000-C921-0943
 
!NEWCON64      TRU64_UNIX           OTHER      1      210413   OL other      0
           HOST_ID=1000-0000-C920-EDA0         ADAPTER_ID=1000-0000-C920-EDA0
 
!NEWCON65      TRU64_UNIX           OTHER      2      210513   OL other    100
           HOST_ID=1000-0000-C921-086C         ADAPTER_ID=1000-0000-C921-086C
      
.
.
.

The following connections are shown to have nonzero offsets: !NEWCON49, !NEWCON51, !NEWCON55, !NEWCON56, !NEWCON57, !NEWCON60, !NEWCON61, and !NEWCON65

Set the unit offset to 0 for each connection that has a nonzero unit offset:

HSG80> SET !NEWCON49 UNIT_OFFSET = 0
HSG80> SET !NEWCON51 UNIT_OFFSET = 0
HSG80> SET !NEWCON55 UNIT_OFFSET = 0
HSG80> SET !NEWCON56 UNIT_OFFSET = 0
HSG80> SET !NEWCON57 UNIT_OFFSET = 0
HSG80> SET !NEWCON60 UNIT_OFFSET = 0
HSG80> SET !NEWCON61 UNIT_OFFSET = 0
HSG80> SET !NEWCON65 UNIT_OFFSET = 0

At the console of each system accessing storage units on this HSG80, follow these steps:

Note

The remaining steps apply only to fabric configurations. In this release, you cannot boot from storage that is connected via a Fibre Channel arbitrated loop.
1. Use the wwid manager to show the Fibre Channel environment variables and determine which units are reachable by the system. This is the information the console uses, when not in wwidmgr mode, to find Fibre Channel devices:
```
P00>>> wwidmgr -show ev
wwid0    133 1 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002e
wwid1    131 1 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002f
wwid2    132 1 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-0030
wwid3
N1       50001fe100000d64
N2
N3
N4
 
```
  Note
  
  You must set the console to diagnostic mode to use the wwidmgr command for the following AlphaServer systems: AS1200, AS4x00, AS8x00, GS60, GS60E, and GS140. Set the console to diagnostic mode as follows:
```
P00>>> set mode diag
Console is in diagnostic mode
P00>>>
 
```
2. For each wwidn line, record the unit number (131, 132, and 133) and worldwide name for the storage unit. The unit number is the first field in the display (after wwidn). The Nn value is the HSG80 port being used to access the storage units (controller B, port 2).
3. Clear the wwidn and Nn environment variables:
```
P00>>> wwidmgr -clear all
 
```
4. Initialize the console:
```
P00>>> init
 
```
5. Use the wwid manager with the -quickset option to set up the device and port path information for the storage units from where each system will need to boot. Each system may need to boot from the base operating system disk. Each system will need to boot from its member system boot disk. Using the storage units from the example, cluster member 1 will need access to the storage units with UDIDs 131 (member 1 boot disk) and 133 (Tru64 UNIX disk). Cluster member 2 will need access to the storage units with UDIDs 132 (member 2 boot disk) and 133 (Tru64 UNIX disk). Set up the device and port path for cluster member 1 as follows:
```
P00>>> wwidmgr -quickset -udid 131
      
.
.
.
P00>>> wwidmgr -quickset -udid 133
      
.
.
.
```
6. Initialize the console:
```
P00>>> init
 
```
7. Verify that the storage units and port path information is set up, and then reinitialize the console. The following example shows the information for cluster member 1:
```
P00>>> wwidmgr -show ev
wwid0    133 1 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002e
wwid1    131 1 WWID:01000010:6000-1fe1-0000-0d60-0009-8080-0434-002f
wwid2
wwid3
N1       50001fe100000d64
N2       50001fe100000d62
N3       50001fe100000d63
N4       50001fe100000d61
P00>>> init
 
```
8. Set the bootdef_dev console environment variable to the member system boot device. Use the paths shown in the reachability display of the wwidmgr -quickset command for the appropriate device (Section 6.9.6).
9. Repeat steps a through h on each system accessing devices on the HSG80.

6.11 Using the emx Manager to Display Fibre Channel Adapter Information

The emx manager (emxmgr) utility was written for the TruCluster Software Product Version 1.6 products to be used to modify and maintain emx driver worldwide name to target ID mappings. It is included with Tru64 UNIX Version 5.1A and, although it is not needed to maintain worldwide name to target ID mappings, you may use it with TruCluster Server Version 5.1A to:

Display the presence of KGPSA Fibre Channel adapters

Display the target ID mappings for a Fibre Channel adapter

Display the current Fibre Channel topology for a Fibre Channel adapter

See emxmgr(8) for more information on the emxmgr utility.

6.11.1 Using the emxmgr Utility to Display Fibre Channel Adapter Information

The primary use of the emxmgr utility for TruCluster Server is to display Fibre Channel information.

Use the emxmgr -d command to display the presence of KGPSA Fibre Channel adapters on the system. For example:

# /usr/sbin/emxmgr -d
emx0 emx1 emx2

Use the emxmgr -m command to display an adapter's target ID mapping. For example:

# /usr/sbin/emxmgr -m emx0
emx0 SCSI target id assignments:
    SCSI tgt id   0 : portname 5000-1FE1-0000-0CB2      
                      nodename 5000-1FE1-0000-0CB0
    SCSI tgt id   5 : portname 1000-0000-C920-A7AE      
                      nodename 1000-0000-C920-A7AE
    SCSI tgt id   6 : portname 1000-0000-C920-CD9C      
                      nodename 1000-0000-C920-CD9C
    SCSI tgt id   7 : portname 1000-0000-C921-0D00      (emx0)
                      nodename 1000-0000-C921-0D00

The previous example shows four Fibre Channel devices on this SCSI bus. The Fibre Channel adapter in question, emx0, at SCSI ID 7, is denoted by the presence of the emx0 designation.

Use the emxmgr -t command to display the Fibre Channel topology for the adapter. For example:

# emxmgr -t emx1
 
emx1 state information:  [1]
  Link :  connection is UP                          
          Point to Point
          Fabric attached
          FC DID 0x210413
  Link is SCSI bus 3 (e.g. scsi3)
          SCSI target id 7
          portname is 1000-0000-C921-07C4 
          nodename is 1000-0000-C921-07C4           
  N_Port at FC DID 0x210013 - SCSI tgt id 5 :  [2]
    portname 5000-1FE1-0001-8932
    nodename 5000-1FE1-0001-8930
    Present, Logged in, FCP Target, FCP Logged in,   
  N_Port at FC DID 0x210113 - SCSI tgt id 1 :  [2]
    portname 5000-1FE1-0001-8931
    nodename 5000-1FE1-0001-8930
    Present, Logged in, FCP Target, FCP Logged in, 
  N_Port at FC DID 0x210213 - SCSI tgt id 2 :  [2]
    portname 5000-1FE1-0001-8941
    nodename 5000-1FE1-0001-8940
    Present, Logged in, FCP Target, FCP Logged in, 
  N_Port at FC DID 0x210313 - SCSI tgt id 4 :  [2]
    portname 5000-1FE1-0001-8942
    nodename 5000-1FE1-0001-8940
    Present, Logged in, FCP Target, FCP Logged in, 
  N_Port at FC DID 0x210513 - SCSI tgt id 6 :  [2]
    portname 1000-0000-C921-07F4
    nodename 2000-0000-C921-07F4
    Present, Logged in, FCP Initiator, FCP Target, FCP Logged in, 
  N_Port at FC DID 0xfffffc - SCSI tgt id -1 :  [3]
    portname 20FC-0060-6900-5A1B
    nodename 1000-0060-6900-5A1B
    Present, Logged in, Directory Server, 
  N_Port at FC DID 0xfffffe - SCSI tgt id -1 :  [3]
    portname 2004-0060-6900-5A1B
    nodename 1000-0060-6900-5A1B
    Present, Logged in, F_PORT,

Status of the emx1 link. The connection is a point-to-point fabric (switch) connection, and the link is up. The adapter is on SCSI bus 3 at SCSI ID 7. Both the port name and node name of the adapter (the worldwide name) are provided. The Fibre Channel DID number is the physical Fibre Channel address being used by the N_Port. [Return to example]

A list of all other Fibre Channel devices on this SCSI bus, with their SCSI ID, port name, node name, physical Fibre Channel address and other items such as:
- Present -- The adapter indicates that this N_Port is present on the fabric.
- Logged in -- The adapter and remote N_Port have exchanged initialization parameters and have an open channel for communications (nonprotocol-specific communications).
- FCP Target -- This N_Port acts as a SCSI target device (it receives SCSI commands).
- FCP Logged in -- The adapter and remote N_Port have exchanged FCP-specific initialization parameters and have an open channel for communications (Fibre Channel protocol-specific communications).
- Logged Out -- The adapter and remote N_Port do not have an open channel for communication.
- FCP Initiator -- The remote N_Port acts as a SCSI initiator device (it sends SCSI commands).
- FCP Suspended -- The driver has invoked a temporary suspension on SCSI traffic to the N_Port while it resolves a change in connectivity.
- F_PORT -- The fabric connection (F_Port) allows the adapter to send Fibre Channel traffic into the fabric.
- Directory Server -- The N_Port is the FC entity queried to determine who is present on the Fibre Channel fabric.
[Return to example]

A target ID of -1 (or -2) that shows up for remote Fibre Channel devices that do not communicate using Fibre Channel protocol, the directory server, and F_Port. [Return to example]

Note

You can use the emxmgr utility interactively to perform any of the previous functions.

6.11.2 Using the emxmgr Utility in an Arbitrated Loop Topology

The following example shows the results of the emxmgr -t command in an arbitrated loop topology.

# emxmgr -t emx0
 
emx0 state information:
  Link :  connection is UP     
          FC-AL (Loop)  [1]
          FC DID 0x000001
  Link is SCSI bus 2 (e.g. scsi2)
          SCSI target id 7
          portname is 1000-0000-C920-5F0E 
          nodename is 1000-0000-C920-5F0E 
  N_Port at FC DID 0x000002 - SCSI tgt id 6 :
    portname 1000-0000-C920-043C
    nodename 1000-0000-C920-043C
    Present, Logged in, FCP Initiator, FCP Target, FCP Logged in, 
  N_Port at FC DID 0x00006b - SCSI tgt id 2 :
    portname 2200-0020-3704-846F
    nodename 2000-0020-3704-846F
    Present, Logged in, FCP Target, FCP Logged in,
  N_Port at FC DID 0x00006c - SCSI tgt id 3 :
    portname 2200-0020-3704-A822
    nodename 2000-0020-3704-A822
    Present, Logged in, FCP Target, FCP Logged in, 
  N_Port at FC DID 0x00002d - SCSI tgt id 1 :
    portname 2200-0020-3703-146B
    nodename 2000-0020-3703-146B
    Present, Logged in, FCP Target, FCP Logged in, 
  N_Port at FC DID 0x00002e - SCSI tgt id 0 :
    portname 2200-0020-3703-137D
    nodename 2000-0020-3703-137D
    Present, Logged in, FCP Target, FCP Logged in, 
  N_Port at FC DID 0x00006e - SCSI tgt id 4 :
    portname 2200-0020-3700-55CB
    nodename 2000-0020-3700-55CB
    Present, Logged in, FCP Target, FCP Logged in,

Status of the emx0 link. The connection is a Fibre Channel arbitrated loop (FC-AL) connection, and the link is up. The adapter is on SCSI bus 2 at SCSI ID 7. The port name and node name of the adapter are provided.
The Fibre Channel DID number is the physical Fibre Channel address being used by the N_Port. [Return to example]

6.11.3 Using the emxmgr Utility Interactively

Start the emxmgr utility without any command-line options to enter the interactive mode to:

Display the presence of KGPSA Fibre Channel adapters

Display the target ID mappings for a Fibre Channel adapter

Display the current Fibre Channel topology for a Fibre Channel adapter

You have already seen how you can perform these functions from the command line. The same output is available using the interactive mode by selecting the appropriate option (shown in the following example).

When you start the emxmgr utility with no command-line options, the default device used is the first Fibre Channel adapter it finds. If you want to perform functions for another adapter, you must change the targeted adapter to the correct adapter. For instance, if emx0 is present, when you start the emxmgr interactively, any commands executed to display information will provide the information for emx0.

Notes

The emxmgr has an extensive help facility in the interactive mode.
Option 3, "Change Target ID Mappings" is a hold-over from the Tru64 UNIX Version 4.0F product and has no use in the Tru64 UNIX Version 5.1A product. Do not use this option.

An example using the emxmgr in the interactive mode follows:

# emxmgr
 
Now issuing commands to : "emx0"
 
Select Option (against "emx0"):
 
  1.  View adapter's current Topology
  2.  View adapter's Target Id Mappings
  3.  Change Target ID Mappings
 
  d.  Display Attached Adapters
  a.  Change targeted adapter
  x.  Exit
 
  ----> 2
 
emx0 SCSI target id assignments:
    SCSI tgt id   0 : portname 5000-1FE1-0000-0CB2      
                      nodename 5000-1FE1-0000-0CB0
    SCSI tgt id   5 : portname 1000-0000-C920-A7AE      
                      nodename 1000-0000-C920-A7AE
    SCSI tgt id   6 : portname 1000-0000-C920-CD9C      
                      nodename 1000-0000-C920-CD9C
    SCSI tgt id   7 : portname 1000-0000-C921-0D00      (emx0)
                      nodename 1000-0000-C921-0D00
 
Select Option (against "emx0"):
 
  1.  View adapter's current Topology
  2.  View adapter's Target Id Mappings
  3.  Change Target ID Mappings
 
  d.  Display Attached Adapters
  a.  Change targeted adapter
  x.  Exit
 
  ----> x
#