ECO NUMBER: VAXSHAD03_072 PRODUCT: OpenVMS VAX OPERATING SYSTEM V7.2 UPDATE PRODUCT: OpenVMS VAX OPERATING SYSTEM V7.2 COVER LETTER 1 KIT NAME: VAXSHAD03_072. 2 KITS SUPERSEDED BY THIS KIT: VAXSHAD02_072. 3 KIT DEPENDENCIES: 3.1 The following remedial kit(s), or later, must be installed BEFORE installation of this, or any required kit: VAXUPDATE02_072. 3.2 In order to receive all the corrections listed in this kit, the following remedial kits, or later, should also be installed: None. 4 KIT DESCRIPTION: 4.1 Version(s) of OpenVMS to which this kit may be applied: OpenVMS VAX V7.2. 4.2 Files patched or replaced: o [SYSEXE]SHADOW_SERVER.EXE (new image) o [SYS$LDR]SHDRIVER.EXE (new image) o [SYSEXE]SHADOW_SERVER.STB (new file) 5 PROBLEMS ADDRESSED IN VAXSHAD03_072 KIT o A Host Based Raidset can hang when one member of a shadowset encounters an Operation Incomplete error. Images Affected: -- COVER LETTER -- Page 2 23 October 2001 - [SYS$LDR]SHDRIVER.EXE o A system can crash with a SHADDETINCON bugcheck at SHDRIVER+93CD4 Crashdump Summary Information: ------------------------------ Bugcheck Type: SHADDETINCON, SHADOWING detects inconsistent state Current Process: CTM$_00060006 Current Image: $1$DGA5014:[CTM$TMROOT.] [CTM_HAMMER]CTM_HAMMER_ALPHA_32.EXE;1 Failing PC: FFFFFFFF.804A1CD4 SHDRIVER+93CD4 Failing PS: 14000000.00000804 Module: SHDRIVER (Link Date/Time: 15-DEC-2000 15:08:57.95) Offset: 00093CD4 Images Affected: - [SYS$LDR]SHDRIVER.EXE o A system crash occurs with SHADDETINCON in SHDRIVER + 000762A0. Images Affected: - [SYS$LDR]SHDRIVER.EXE o A system crash can occur with SHADDETINCON in the WATCHER node when a master member identifies that an IN_SET lock value block is not a member of the set on the WATCHER node. Images Affected: - [SYS$LDR]SHDRIVER.EXE o Multiple systems can hang on cluster shutdown. Images Affected: - [SYS$LDR]SHDRIVER.EXE o A system disk MVTIMEOUT is not managed correctly. Images Affected: - [SYS$LDR]SHDRIVER.EXE -- COVER LETTER -- Page 3 23 October 2001 o A system crash occurs with SHADDETINCON in EXPEL_DEVICE when membership event status cannot be determined in end_mbr_change_vp. Images Affected: - [SYS$LDR]SHDRIVER.EXE o If SYSGEN system check is enabled, the first MOUNT of a system disk will crash the system. Images Affected: - [SYS$LDR]SHDRIVER.EXE o MountVerifcation messages can occur with no apparent cause. There is no way to identify what is causing these messages Images Affected: - [SYS$LDR]SHDRIVER.EXE 6 PROBLEMS ADDRESSED IN VAXSHAD02_072 KIT o The virtual unit hangs until MVTIMEOUT expires. Images Affected: - [SYS$LDR]SHDRIVER.EXE o If one of the existing members of the system disk shadow set cannot be found, a lock status race condition could occur. Images Affected: - [SYS$LDR]SHDRIVER.EXE o An INVEXCPETN crash occurs in SHIN$RESTORE_WLE_ENTRY when a write completes to a multi-member shadow set. Images Affected: - [SYS$LDR]SHDRIVER.EXE o An "Incompatible shadow set member" (SS$_INCSHAMEM) error message is displayed. -- COVER LETTER -- Page 4 23 October 2001 Images Affected: - [SYS$LDR]SHDRIVER.EXE o A SHADDETICON error displays after removing or adding a shadowset member. Images Affected: - [SYS$LDR]SHDRIVER.EXE o If a full merge is pending or in-process on a two member shadow set and a third member is then added to the shadow set, if there is a difference between the two SRC members and the master member is removed from the shadow set, it is possible that data may be corrupted. Images Affected: - [SYS$LDR]SHDRIVER.EXE o When a path to a device is lost during a write operation, the SCB (system control block) can contain a stale master member index value. This will cause the system to crash with a SHADDETINCON bugcheck. Images Affected: - [SYS$LDR]SHDRIVER.EXE o A system crash occurs in SHLK$MERGE_SIGNAL on a cluster node when another cluster node, holding MRGVAL lock, either crashes or is shut down. Images Affected: - [SYS$LDR]SHDRIVER.EXE o Increase the merge factor for shadowing from 1,000 to 10,000. This change also displays the merge factor only during an actual merge operation. Images Affected: - [SYSEXE]SHADOW_SERVER.EXE -- COVER LETTER -- Page 5 23 October 2001 7 PROBLEMS ADDRESSED IN VAXSHAD01_072 KIT o The SHADOWSET goes into MOUNTVERIFYTIMEOUT and cannot be remounted. The process attempting the mount hangs. Images Affected: - [SYS$LDR]SHDRIVER.EXE o After entering an incorrect command involving a shadowset, the system crashes with SHADDETINCON. Images Affected: - [SYS$LDR]SHDRIVER.EXE o Disabling a FibreChannel cascade connection results in an INVEXCPTN crash. Images Affected: - [SYS$LDR]SHDRIVER.EXE o Disabling a FibreChannel cascade connection corrupts a shadowset member. When the cascade connection is broken, two nodes can only see their local FC device. They enter Mountverifaction. One node will throw out its remote member and continue using the last member. The other node will then throw out its remote member, i.e., the disk that has just been used by the first node. This causes the first node to have zero members and the second node continues with a member that did not get the last set of writes. Images Affected: - [SYS$LDR]SHDRIVER.EXE o Bit 16 in SHADOW_SYS_DISK can be set by the user to eliminate using remote members of the shadowset for reads. Occasionally, use of bit 16 fails to eliminate remote members from being used. Images Affected: - [SYS$LDR]SHDRIVER.EXE o A CPUSPINWAIT bug check can occur, if the read of the SCB, of a shadow set member, cannot pass the checksum test. -- COVER LETTER -- Page 6 23 October 2001 Images Affected: - [SYS$LDR]SHDRIVER.EXE o DCD (Disk Copy Data) will not always be initiated properly. During an assisted operation, if the source member was dismounted or otherwise removed from the shadow set, the connection to the controller would not clean up correctly. Images Affected: - [SYS$LDR]SHDRIVER.EXE o A full copy operation that is interrupted by a mini merge may not be completed correctly. Images Affected: - [SYS$LDR]SHDRIVER.EXE o When a copy operation that interrupts a merge operation is terminating, it finds that there are no members marked for the merge and the thread crashes the system with a SHADDETINCON bug check. Images Affected: - [SYS$LDR]SHDRIVER.EXE o SHOW DEVICES shows zero percent merged status even though the shadow set status does not indicate that a merge is required. Images Affected: - [SYS$LDR]SHDRIVER.EXE o INCSHAMEM tracing is not enabled. Images Affected: - [SYS$LDR]SHDRIVER.EXE o Use of bit 16 in SHADOW_SYS_DISK bias reads of the local source shadow set member does not always work. Images Affected: - [SYS$LDR]SHDRIVER.EXE -- COVER LETTER -- Page 7 23 October 2001 o When all members of the system disk shadowset are configured, if the boot device has gone off line, SCB reads fail with dev off line errors. This causes the system to crash with SHADBOOTFAIL. Images Affected: - [SYS$LDR]SHDRIVER.EXE o When one or more shadowset members is in a copy state, the system may with an INVEXCPTN bugcheck. Images Affected: - [SYS$LDR]SHDRIVER.EXE o System crash in XQP when an IO gets a SS$_DATACHECK during a Shadowset copy operation. Images Affected: - [SYS$LDR]SHDRIVER.EXE o System can crash with a SHADDETINCON bugcheck. Images Affected: - [SYS$LDR]SHDRIVER.EXE 8 KIT INSTALLATION RATING: The following kit installation rating, based upon current CLD information, is provided to serve as a guide to which customers should apply this remedial kit. (Reference attached Disclaimer of Warranty and Limitation of Liability Statement) INSTALLATION RATING: INSTALL_2 : To be installed by all customers using the following feature(s): SHADOWING. -- COVER LETTER -- Page 8 23 October 2001 9 INSTALLATION INSTRUCTIONS: Install this kit with the {Installation Utility} utility by logging into the SYSTEM account, and typing the following at the DCL prompt: @SYS$UPDATE:VMSINSTAL VAXSHAD03_072 [location of the saveset] The saveset location may be a tape drive, CD, or a disk directory that contains the kit saveset. This kit requires a system reboot. Compaq strongly recommends that a reboot is performed immediately after kit installation to avoid system instability If you have other nodes in your OpenVMS cluster, they must also be rebooted in order to make use of the new image(s). If it is not possible or convenient to reboot the entire cluster at this time, a rolling re-boot may be performed. Copyright (c) Compaq Computer Corporation, 2001 All Rights Reserved. Unpublished rights reserved under the copyright laws of the United States. COMPAQ, the Compaq logo, VAX, Alpha, VMS, and OpenVMS are registered in the U.S. Patent and Trademark Office. All other product names mentioned herein may be trademarks of their respective companies. Confidential computer software. Valid license from Compaq required for possession, use, or copying. Consistent with FAR 12.211 and 12.212, Commercial Computer Software, Computer Software Documentation, and Technical Data for Commercial Items are licensed to the U.S. Government under vendor's standard commercial license. Compaq shall not be liable for technical or editorial errors or omissions contained herein. The information in this document is provided as is without warranty of any kind and is subject to change without notice. The warranties for Compaq products are set forth in the express limited warranty statements accompanying such products. Nothing herein should be construed as constituting an additional warranty. DISCLAIMER OF WARRANTY AND LIMITATION OF LIABILITY THIS PATCH IS PROVIDED AS IS, WITHOUT WARRANTY OF ANY KIND. ALL EXPRESS OR IMPLIED CONDITIONS, REPRESENTATIONS AND WARRANTIES, INCLUDING ANY IMPLIED WARRANTY OF MERCHANTABILITY, FITNESS FOR PARTICULAR PURPOSE, OR NON-INFRINGEMENT, ARE HEREBY EXCLUDED TO THE EXTENT PERMITTED BY APPLICABLE LAW. IN NO EVENT WILL COMPAQ BE LIABLE FOR ANY LOST REVENUE OR PROFIT, OR FOR SPECIAL, INDIRECT, CONSEQUENTIAL, INCIDENTAL OR PUNITIVE DAMAGES, HOWEVER CAUSED AND REGARDLESS OF THE THEORY OF LIABILITY, WITH RESPECT TO ANY PATCH MADE AVAILABLE HERE OR TO THE USE OF SUCH PATCH.