This manual provides guidelines and management and configuration techniques
used for Online Addition and Removal (OLAR) of system components using the
Tru64TM UNIX
operating system software.
You will be able to accomplish these tasks on any
system that has OLAR capabilities, using the System Management applications: SysMan Station, SysMan Menu,
and the
hwmgr
command.
Many of the tasks discussed in this
book can be done on systems without OLAR capabilities as well.
Related topics
including component indictment and automatic deallocation as well as related
service tools also are discussed.
Audience
This manual is intended for system administrators, service technicians and system operators who are responsible for managing and configuring a Tru64 UNIX operating system. This manual is not platform specific because a subset of the features discussed are available on all systems. Specifically, you may remove, add, or replace CPUs only on platforms that support CPU OLAR. For Tru64 UNIX Version 5.1A, this is specifically the AlphaServerTM GS160 and GS320 systems.
System administrators, service technicians and system operators should have extensive knowledge of their applications and hardware configurations prior to initiating an OLAR operation. Users should have extensive knowledge of operating system concepts, commands, and utilities.
If you plan to use your system in a clustered computing environment,
see the
TruClusterTM
Server documentation.
Features
The operating system offers various System Management
applications that support Online Addition and Removal.
The tools that support
OLAR operations are SysMan Station, SysMan Menu, and the
hwmgr
command.
The following list summarizes the features offered in this release of the operating system:
Allows you to perform capacity expansion, CPU upgrades, and replace failed CPUs, without having to shut down the entire system or an operating instance. On systems with this capability, the CPUs are separately powerable. CPU status also can be changed to off line, on line.
Provides proactive error notification of potentially failing system components. Posts events through the EVM Event Manager to notify all interested applications.
Allows you to decide whether to deallocate components without user intervention when a component is indicted.
A set of service applications for performing system diagnostics, system management and configuration.
Systematically reads and writes all of the available physical memory in an effort to locate and correct single-bit memory errors.
This manual is organized as follows:
Chapter 1 | Provides an introduction to Online Addition and Removal and related features. |
Chapter 2 | Describes concepts and features important to increase system availability. References are made to other existing documentation where needed. |
Chapter 3 | Describes the Component Indictment facility that informs operators that a component has been identified as a potential point of failure and the Automatic Deallocation policy settings that decide what to do when this situation arises. |
Chapter 4 | Describes how to perform Online Addition and Removal (OLAR) operations with the various System Management applications and the command line utilities. |
Chapter 5 | Introduces the user to the Compaq service applications and describes how best to use the service applications to monitor your system and perform system diagnostics. |
Chapter 6 | Discusses the Memory Trolling capabilites of the operating system. |
The following documents are useful references when you are configuring hardware, and performing system management and configuration tasks:
System Administration Manual
This manual describes how to configure, use, and maintain the operating system. It includes information on general day-to-day activities and tasks, changing system configurations, and locating and eliminating sources of trouble. This manual is intended for the system administrators responsible for managing the operating system. It assumes a knowledge of operating system concepts, commands, and configurations.
Network Administration: Connections Manual
This manual describes how to configure and manage the network interfaces and network transports, and solve problems that might arise on systems running the Tru64 UNIX operating system software.
Technical Overview Manual
This manual describes the major components of the operating system. This manual also describes enhancements made to the operating system and provides detailed information on various aspects of the operating system.
System Configuration and Tuning Manual
This manual describes how to plan, set up, and tune high-performance and high-availability systems running the operating system.
TruCluster Server Cluster Technical Overview
This manual describes the major components and features of the TruCluster Server product.
TruCluster Server Cluster Hardware Configuration Manual
This manual describes how to set up and maintain the hardware configuration for a cluster server.
Compaq AlphaServer GS320 technical resources web site: http://www.compaq.com/alphaserver/gs320/gs320_tech.html
Compaq Documentation for the Compaq Continuous Profiling Infrastructure (DCPI) applications: http://www.tru64unix.compaq.com/dcpi/documentation.htm
Compaq Documentation for the WEBES (Web-Based Enterprise Services) suite of tools:http://www.support.compaq.com/svctools/webes/index.html
The Tru64 UNIX documentation is available on the World Wide Web at
the following URL:
http://www.tru64unix.compaq.com/docs
Icons on Tru64 UNIX Printed Manuals
The printed version of the Tru64 UNIX documentation uses letter icons on the spines of the manuals to help specific audiences quickly find the manuals that meet their needs. (You can order the printed documentation from Compaq.) The following list describes this convention:
G | Manuals for general users |
S | Manuals for system and network administrators |
P | Manuals for programmers |
R | Manuals for reference page users |
Some manuals in the documentation help meet the needs of several audiences. For example, the information in some system manuals is also used by programmers. Keep this in mind when searching for information on specific topics.
The
Documentation Overview
provides
information on all of the manuals in the Tru64 UNIX documentation set.
Reader's Comments
Compaq welcomes any comments and suggestions you have on this and other Tru64 UNIX manuals.
You can send your comments in the following ways:
Fax: 603-884-0120 Attn: UBPG Publications, ZKO3-3/Y32
Internet electronic mail:
readers_comment@zk3.dec.com
A Reader's Comment form is located on your system in the following location:
/usr/doc/readers_comment.txt
Please include the following information along with your comments:
The full title of the manual and the order number. (The order number appears on the title page of printed and PDF versions of a manual.)
The section numbers and page numbers of the information on which you are commenting.
The version of Tru64 UNIX that you are using.
If known, the type of processor that is running the Tru64 UNIX software.
The Tru64 UNIX Publications group cannot respond to system problems or
technical support inquiries.
Please address technical questions to your
local system vendor or to the appropriate Compaq technical support office.
Information provided with the software media explains how to send problem
reports to Compaq.
Conventions
The following conventions are used in this manual:
%
cat
Boldface type in interactive examples indicates typed user input.
Colored ink indicates information that you enter from the keyboard or a screen object that you must choose or click on.
cat
(1)A cross-reference to a reference page includes the appropriate section
number in parentheses.
For example,
cat
(1)
indicates that you can find information on the
cat
command in Section 1 of the reference pages.
In an example, a key name enclosed in a box indicates that you press that key.
#
A number sign represents the superuser prompt.
In syntax definitions, brackets indicate items that are optional and braces indicate items that are required. Vertical bars separating items inside brackets or braces indicate that you choose one item from among those listed.
colored
text
In syntax definitions, literal elements are colored green. Variable values, placeholders, and function argument names are colored red. No special colored text is used outside of syntax descriptions.
%
$
A percent sign represents the C shell system prompt. A dollar sign represents the system prompt for the Bourne, Korn, and POSIX shells.
Italic (slanted) type indicates variable values, placeholders, and function argument names.
A vertical ellipsis indicates that a portion of an example that would normally be present is not shown.