The Tru64 UNIX (formerly DIGITAL UNIX) internationalization tools and routines allow you to write programs for use in a number of nations. These features let you write programs with:
An interface that appears to a nation's users as if it was designed for them
Source code that is independent of specific native languages and customs
This guide is intended for experienced applications developers who are
writing programs intended for multinational or non-English language use.
Translators
who translate the messages displayed by international programs might also
find this guide useful.
New and Changed Features
This manual was produced for Tru64 UNIX Version 5.0.
The following
section discusses changes made for this revision of the manual.
For the benefit
of customers who are upgrading from a Version 3 rather than a Version 4 release,
changes made to the manual for Versions 4.0F and 4.0 are included in subsequent
sections.
New and Changed Features for Tru64 UNIX Version 5.0
This version of the manual was revised to describe the following new features:
Enhancements to the
file
command
In addition, this manual was revised to include a replacement Latin-1
locale source example in
Chapter 7
and
Appendix E
and to correct various omissions, inaccuracies, and typographical errors.
New and Changed Features for Tru64 UNIX Version 4.0F
The version of this manual produced for Tru64 UNIX Version 4.0F was revised to discuss the following features:
Codeset converters, locales, or both for the UCS-2, UTF-8, and PC code page encoding formats
Support for the European monetary sign (euro character)
Generic internationalized PostScript print filters
Support for running a CDE application in a language different from the session language
New and changed functions for conformance to the XSH5 CAE specification. This specification includes functions added or changed by the 1994 amendment to the ISO C standard.
Minor changes to Curses library functions for conformance to Issue 4 Version 2 of the X/Open Curses CAE specification
Addition of PostScript fonts for Hebrew
New and Changed Features for DIGITAL UNIX Version 4.0
The version of this manual produced for DIGITAL UNIX Version 4.0 was revised to discuss the following features:
Locales and other software to support Catalan, Lithuanian, and Slovene
Support for character processing in UCS-4 format
Curses library routines that handle multibyte characters and also conform to the X/Open Curses CAE specification
Support for X11R6 libraries
Support for the Common Desktop Environment (CDE)
Internationalization enhancements to the printing and mail subsystems
Font renderers for use by X applications
Multilingual Emacs editor (mule
)
Codeset conversion improvements to support better the multiple codesets available for Chinese and Japanese
Functions added to the Standard C Library by the 1994 amendment to the ISO C standard
In addition, a glossary was added to the manual.
Organization
This guide includes seven chapters, five appendixes, and a glossary.
Chapter 1 | Introduction Introduces the basic concepts and procedures for writing programs that meet the needs of international users. |
Chapter 2 | Developing Internationalized Software Discusses techniques for handling character sets, cultural data, and language in an application. |
Chapter 3 | Creating and Using Message Catalogs Explains how to extract and translate text for messages, and how to generate and access message catalogs. |
Chapter 4 | Handling Wide-Character Data with curses Routines Describes the
|
Chapter 5 | Creating Internationalized X, Xt, and Motif Applications Discusses how to use GUI programming libraries (X, OSF/Motif, and DECwindows Extensions to OSF/Motif) when writing internationalized programs. |
Chapter 6 | Using Internationalized Software From a programmer's perspective, discusses setup requirements for using applications in different language environments. This chapter also explains how to use Tru64 UNIX commands and other applications in a multilanguage working environment. |
Chapter 7 | Creating Locales Discusses
the source files for a locale and how to process them with the
|
Appendix A | Summary Tables of Worldwide Portability Interfaces Lists and summarizes internationalized functions for locale initialization, character classification, case conversion, character collation, language information, date and time interpretation, printing and scanning text strings, number conversion, handling multibyte characters, input/output, and string manipulation. |
Appendix B | Setting Up and Using User-Defined Character Databases Describes support for user-defined characters (UDCs) in Chinese, Japanese, and Korean. |
Appendix C | Setting Up and Using the Chinese Phrase Input Method Describes support for phrase input that is used with Chinese. |
Appendix D | Using DECterm Localization Features in Programs Describes programming features specific to the
|
Appendix E | Sample Locale Source Files Contains complete source files for the sample locale discussed in Chapter 7. |
Glossary | Defines terms and acronyms used in this book. |
The following manuals in the Tru64 UNIX documentation set provide information about using the C compiler and other program development tools on a Tru64 UNIX system. If you are developing internationalized applications, refer to these manuals for general programming information.
OSF/Motif Programmer's Guide
The following book, published by O'Reilly and Associates, Inc., is also a good reference:
Programmer's Supplement for Release 5 of the X Window System, Version 11
The following standards or draft standards apply to software components discussed in this guide. This guide refers to some of these documents.
ANS X3.159 Programming Language C
ISO/IEC 646: 1983
Information processing - ISO 7-bit coded character set for information interchange.
ISO 6937: 1983
Information processing - Coded character sets for text communication.
ISO 8859-1: 1987
Information processing - ISO 8-bit single-byte coded graphic character sets - Latin alphabet No. 1.
ISO/IEC 9899: 1990
Information technology - Programming Languages - C.
ISO/IEC 9945-1: 1990
Information technology - Portable operating system interface (POSIX) - Part 1: System application programming interface (API) [C Language].
ISO/IEC 9945-2: 1993
Information technology - Portable operating system interface (POSIX) - Part 2: Shells and Utilities.
ISO/IEC 10646-1:1993
Information Technology - Universal Multiple-Octet Coded Character Set (UCS) - Part 1: Architecture and Basic Multilingual Plane, 1993
Code for Information Interchange, JIS X0201-1976; Japanese national standard.
Code of the Japanese Graphic Character Set for Information Interchange, JIS X0208-1990; Japanese national standard.
Code of the Supplementary Japanese Graphic Character Set, JIS X0212-1990; Japanese national standard.
Codes of Chinese Graphic Characters for Information Interchange, Primary Set (GB2312-80); National Standards Bureau of China, Beijing, 1980.
Standard Codes of Common Chinese Characters for Information Interchange, CNS 11643; Taiwan, 1986, 1992.
Standard Codes of Korean Characters for Information Interchange, KSC 5601; Korea, 1987.
Thai Industrial Standard, TIS 620-2533; Standard for a primary set of graphic characters used for Thai information interchange.
The Open Group UNIX CAE specifications, specifically:
Commands and Utilities, Issue 5
Systems Interfaces and Headers, Issue 5
System Interface Definitions, Issue 5
Networking Services, Issue 5
X/Open Curses, Issue 4 Version 2
The Unicode Standard, Version 2.0
The following book provides information about cultural and linguistic requirements around the world and the changes needed in computer systems to handle those requirements:
Programming for the World: A Guide to Internationalization, O'Donnell, Sandra Martin, Prentice-Hall, Inc., Englewood Cliffs, New Jersey, 1994
Articles in the following technical journal cover topics related to product internationalization:
Digital Technical Journal, Volume 5 Number 3 (published Summer 1993)
Icons on Tru64 UNIX Printed Books
The printed version of the Tru64 UNIX documentation uses letter icons on the spines of the books to help specific audiences quickly find the books that meet their needs. (You can order the printed documentation from Compaq.) The following list describes this convention:
G | Books for general users |
S | Books for system and network administrators |
P | Books for programmers |
D | Books for device driver writers |
R | Books for reference page users |
Some books in the documentation help meet the needs of several audiences. For example, the information in some system books is also used by programmers. Keep this in mind when searching for information on specific topics.
The
Documentation Overview
provides
information on all of the books in the Tru64 UNIX documentation set.
Reader's Comments
Compaq welcomes any comments and suggestions you have on this and other Tru64 UNIX manuals.
You can send your comments in the following ways:
Fax: 603-884-0120 Attn: UBPG Publications, ZKO3-3/Y32
Internet electronic mail:
readers_comment@zk3.dec.com
A Reader's Comment form is located on your system in the following location:
/usr/doc/readers_comment.txt
Mail:
Compaq Computer Corporation
UBPG Publications Manager
ZKO3-3/Y32
110 Spit Brook Road
Nashua, NH 03062-2698
A Reader's Comment form is located in the back of each printed manual. The form is postage paid if you mail it in the United States.
Please include the following information along with your comments:
The full title of the book and the order number. (The order number is printed on the title page of this book and on its back cover.)
The section numbers and page numbers of the information on which you are commenting.
The version of Tru64 UNIX that you are using.
If known, the type of processor that is running the Tru64 UNIX software.
The Tru64 UNIX Publications group cannot respond to system problems or
technical support inquiries.
Please address technical questions to your
local system vendor or to the appropriate Compaq technical support office.
Information provided with the software media explains how to send problem
reports to Compaq.
Conventions
The following conventions are used in this manual:
%
$
A percent sign represents the C shell system prompt. A dollar sign represents the system prompt for the Bourne and Korn shells.
#
A number sign represents the superuser prompt.
%
cat
Boldface type in interactive examples indicates typed user input.
Italic (slanted) type indicates variable values, placeholders, and function argument names.
In syntax definitions, brackets indicate items that are optional and braces indicate items that are required. Vertical bars separating items inside brackets or braces indicate that you choose one item from among those listed.
In syntax definitions, a horizontal ellipsis indicates that the preceding item can be repeated one or more times.
A vertical ellipsis indicates that a portion of an example that would normally be present is not shown.
cat
(1)A cross-reference to a reference page includes the appropriate
section number in parentheses.
For example,
cat
(1)
indicates that
you can find information on the
cat
command in Section
1 of the reference pages.
This symbol indicates that you hold down the first named key while pressing the key or mouse button that follows the hyphen, for example, Ctrl-c.
Multiple key or mouse button names separated by spaces indicate that you press and release each in sequence, for example, Alt Space.