United States    
COMPAQ STORE | PRODUCTS |
SERVICES | SUPPORT | CONTACT US | SEARCH
Compaq C

Compaq C
User's Guide for OpenVMS Systems


Previous Contents Index

A.1.19 Multibyte and Wide Character Support

To meet the needs of non-European languages with large character sets, ANSI C includes a framework to support characters encoded in multiple bytes. This framework is general enough to support character-processing extensions and character-set encodings already used in Asia, and allows for support for the draft proposed ISO Standard 10646, a multiple octet-coded character set that supports dozens of natural languages.

ANSI C supports natural languages with large character sets by recognizing that normal character constants and string literals can be used to represent multibyte characters. A multibyte character is an encoding of variable-length characters where one, two, or more bytes in the string represents a single character in the natural language. The encoding is allowed to support locking shift states that change the encoding of characters for as long as the shift state holds.

Multibyte characters can occur in comments, character constants, and string literals.

Because string manipulation is very difficult when the character size varies from character to character, ANSI C supports a fixed-size representation where each character is stored in the same number of bytes. This representation is called wide character support. Compaq C supports a new form of wide character constant and wide string literal.

A.1.19.1 The Wide Character Type

ANSI C requires that wide characters be represented by an integral type, and that there be a typedef named wchar_t for that type in the header <stddef.h> .

Compaq C defines wchar_t to be unsigned int . This allows all character sets supported by ISO 10646 to be supported simultaneously.

A.1.19.2 Multibyte Characters in Comments, Character Constants, and String Literals

Full multibyte support requires that the compiler be able to determine whether an individual byte in a multibyte string is a single byte character or part of a multiple byte character. For example, the compiler must be able to distinguish between the single byte quote ending a string literal and a quote that is embedded in a multiple byte character and does not end the string literal.

A.1.19.3 Wide Character Constants

As required by ANSI C, Compaq C supports wide character constants. The form of such a constant is the uppercase letter l , followed by a single quote, followed by a multibyte character, followed by a single quote.

The compiler collects the bytes making up the multibyte character into a string, and then calls the Compaq C RTL mbtowc function to convert the multibyte character into a wide character. The resulting value has type wchar_t .

A.1.19.4 Wide String Literals

As required by the ANSI C Standard, Compaq C supports wide string literals. The form of such a literal is the same as a normal string literal prefixed by the uppercase letter l .

The compiler collects the bytes making up the wide string literal into a string, and then calls the Compaq C RTL mbstowcs function to convert the multibyte characters into wide characters. The resulting wide character string literal has type array of wchar_t .

A.1.20 Usual Arithmetic Conversions

In Compaq C, the usual arithmetic conversions now support the long double type: if either operand of a binary operator that uses these conversions is long double , then the other operand is converted to long double .

A.1.21 Indexing as a Commutative Operator

As required by the ANSI C Standard, Compaq C now defines the array indexing operator, [], as commutative. Thus, if a is an array and i is an integer, both a[i] and i[a] are valid.

A.1.22 Cast Operators

ANSI C specifies that result of the cast operator is not an lvalue. However, VAX C does allow the cast operator to produce an lvalue.

The Compaq C compiler in VAX C mode allows the cast operator to produce an lvalue.

A.1.23 Function Calls

The following sections describe changes to function calls.

A.1.23.1 Assignment Compatibility Argument Checking

ANSI C defines a function call made with a prototype in scope as assigning the arguments to the parameters of the function. This means that all of the normal type checking and implied conversions that occur during an assignment take place when calling a function.

VAX C currently follows this model with two exceptions. First, it only performs the required type checking if /STANDARD=PORTABLE is given. Second, the assignment compatibility rules used by VAX C are not as stringent as the rules required by ANSI C. For example, two structs are assignment-compatible in VAX C only if they are the same size.

The Compaq C compiler in VAX C mode and common mode is compatible with VAX C in assignment compatibility rules. Other modes follow the stricter ANSI C rules, documented in Section A.1.27 of this guide, and issue the required messages even when /STANDARD=PORTABLE is not specified.

A.1.23.2 Passing Narrow Types to Old Syntax Functions

Traditionally, a function written in C was always called with widened argument types. (Arguments of narrow types like char , short , or float were passed as the widened types int , int , and double , respectively.) The ANSI C Standard preserves this calling mechanism for functions declared using the old syntax. Functions declared using the new prototype syntax may be called with narrow argument types.

Tradition, however, did not specify how the compiler was to interpret a function definition that declared formal arguments of narrow type. One interpretation was that the widened types actually passed should be converted to the narrow type of the formal declaration by the function in its prologue. Another interpretation was that the compiler should rewrite the formal declarations to match the type of the argument actually passed. For example, under this second interpretation, the compiler would change a declaration of a formal argument of type float to a declaration of type double .

ANSI C has standardized the first interpretation of a function with formal arguments of narrow types. Compaq C for OpenVMS Systems uses the ANSI C interpretation in all modes.

A.1.24 "Address of" Operator

In Compaq C, if the argument of the unary & operator is an array, the result now has the type "pointer to array". Previously, in VAX C, the result would have the type "pointer to the element type of the array".

A.1.25 Unary Plus

Compaq C supports the new ANSI C operator, unary plus (+). This operator returns the value of its operand (possibly widened by the integral promotions).

A.1.26 Relational Operators

As required by ANSI C, Compaq C issues a warning (in all modes except VAX C mode) to diagnose a constraint violation if one of the operands of a relational operator is a pointer to a function. For example, the following code would issue a warning:


int (*f)(); 
if (f > NULL) 

Note that it is valid to use the equality operators to compare function pointers.

A.1.27 Assignment Compatibility

ANSI C has tighter assignment compatibility rules than those previously enforced by VAX C. (Note that assignment compatibility rules also control function argument passing.) Compaq C assignment compatibility differs from that of VAX C in the following ways:

A.1.28 Declarations

Function prototype support, the new const and volatile type qualifiers, and the void type, were already implemented in VAX C. The following sections describe the additional Compaq C support that affects declarations. References are to the relevant sections in the ANSI C Standard.

A.1.28.1 Implementation Limits

The ANSI C Standard requires that an implementation support certain minimum requirements; these are listed in the referenced section. In those cases where VAX C imposes a fixed limit, that limit has always met or exceeded the Standard's requirements, and programs that exceed any of these limits elicit the appropriate errors. In strict ANSI C mode, Compaq C now issues diagnostics against any source program constructs that exceed any of the Standard limits as well.

A.1.28.2 Identifier Name Length

In strict ANSI C mode, Compaq C now issues diagnostic messages against declarations of external names in excess of six characters, or external names that are intended to denote different objects but that have the same spelling, and ignores alphabetical case.

A.1.28.3 Diagnosing Empty Declarations

The ANSI C Standard invalidates empty declarations, except for two special cases: one involving structure/union tags and the other involving the enumeration type. In strict ANSI C mode, Compaq C issues an error message against any declaration that does not declare at least one of the following: a declarator, a tag, or the members of an enumeration.

A.1.28.4 Restriction on Placement of Storage-Class Specifiers

The ANSI C Standard specifies that allowing the placement of any storage-class specifier other than at the beginning of a declaration is an obsolete feature. In strict ANSI C mode, Compaq C now issues an informational diagnostic to that effect when appropriate.

A.1.28.5 Diagnosing Old-Style Function Declarations

The ANSI C Standard specifies that old-style function declarations and definitions (that is, those not using the function prototype format) are obsolete. Old-style function declarations and definitions cause an informational message to be issued in all modes except VAX C.

A.1.28.6 Function Definitions Using typedef-names

The ANSI C Standard restricts the form of the declarator in a function definition: the function type itself may not be inherited from a typedef -name; that is, the declarator must explicitly contain a (possibly empty) parenthesized parameter list. If not, Compaq C in strict ANSI C mode issues an error message.

A.1.28.7 Initialization

Compaq C for OpenVMS Systems supports the initialization of unions.

In VAX C, an aggregate initializer consisting of a single item does not have to have the outer braces. The outer braces are required by the ANSI C Standard.

Compaq C allows this case in VAX C mode.

A.1.29 Bit-Field Initialization

The Compaq C compiler initializes bit-field structure members differently than VAX C does. See Section 4.7.2.

A.1.30 The Preprocessor

The following sections describe the differences between the VAX C and the Compaq C preprocessors. Most of these differences reflect the Compaq C preprocessor's conformance to the ANSI C Standard. References are to the relevant sections in the ANSI C Standard.

Note that most VAX C-specific preprocessor extensions are unaffected by these changes. These extensions continue to be supported quietly in VAX C mode, but elicit appropriate diagnostics in strict ANSI C mode.

A.1.30.1 White Space Appearing Before the #

The ANSI C Standard removes the VAX C restriction that requires the # character introducing a preprocessor directive to always appear in column 1 of the source line. In Compaq C, white space and comments can now precede the # on the same line.

A.1.30.2 The #define Directive and Macro Substitution

Before the ANSI C Standard, the lack of a precise definition of the behavior of macro expansion led to a number of inconsistencies among different C implementations. Compaq C, in adhering to the ANSI C Standard, removes these and many other discrepancies by specifying precisely how macro substitution is to be performed:

As required by the ANSI C Standard, Compaq C supports two new operators that can appear only within macro definitions:

The ANSI C Standard also makes specific the sequence in which rescanning and further substitution is to take place, and under what conditions substitution does not take place. The ANSI C Standard also specifies under what circumstances a macro may be redefined: only benign redefinition is allowed, permitting a macro to be redefined only if the new definition is token-wise identical to the old definition.

A.1.30.3 The #line Directive

The ANSI C Standard specifies that macro substitution can occur on the operands of the #line directive, that the line number operand is restricted to the range 1 to 32,767, and that the file name operand must be treated as any character string literal. VAX C did not support macro substitution on this directive, performed no range checking on the line number, and restricted the length of the character string to 255.

Compaq C supports macro substitution on the #line directive, diagnoses an out-of-range line number (in strict ANSI C mode only), and allows the file name character string to be as long as the maximum length supported by the compiler for ordinary strings. (Note that the ANSI C Standard requires support for a minimum of 509 characters in a string, and that Compaq C supports strings up to 65,535 characters.)

A.1.30.4 The #error Directive

Compaq C in both strict ANSI C mode and VAX C mode supports the new #error directive required by the ANSI C Standard.

A.1.30.5 The #pragma builtins Directive

The #pragma builtins directive is provided for VAX C compatibility.

Compaq C implements #pragma builtins by including the <builtins.h> header file, and is equivalent to #include <builtins.h> on OpenVMS systems.

This header file contains prototype declarations for the built-in functions that allow them to be used properly. By contrast, VAX C implemented this pragma with special-case code within the compiler, which also supported a #pragma nobuiltins preprocessor directive to turn off the special processing. Because declarations cannot be "undeclared," Compaq C does not support #pragma nobuiltins . Furthermore, the names of all the built-in functions use a naming convention defined by ANSI C to be in a namespace reserved to the C language implementation.

A.1.30.6 The #pragma dictionary Directive

The #pragma dictionary preprocessor directive replaces the #dictionary directive, but the latter is still supported in VAX C mode for compatibility.

The #pragma dictionary and #dictionary preprocessor directives now allow you to specify whether all string data type variables should be null-terminated.

A.1.30.7 The #pragma extern_model Directive

The #pragma extern_model directive is added to control the compiler's interpretation of objects that have external linkage. This pragma lets you choose the global symbol model to be used for external variables.

A.1.30.8 The #pragma linkage Directive (ALPHA ONLY)

The #pragma linkage preprocessor directive allows you to specify special linkage types for function calls.

A.1.30.9 The #pragma use_linkage Directive (ALPHA ONLY)

The #pragma use_linkage directive associates a previously defined special linkage with a function.

A.1.30.10 The #pragma message Directive

The #pragma message directive controls the issuance of individual diagnostic messages or groups of messages. Use of this pragma overrides any command-line options that may affect the issuance of messages.

A.1.30.11 The #pragma module Directive

The #pragma module preprocessor directive replaces the #module directive, but the latter is still supported in VAX C mode for compatibility.

A.2 Features Affecting the Compaq C Run-Time Library and Include Files

This section describes new features pertaining to the standard header files in the Compaq C Run-Time Library (RTL).

A.2.1 <stddef.h>

The wchar_t type is now added to this header file. The declaration of errno is also removed.

A.2.2 <ctype.h>

Because the ANSI C Standard refers to the macros in <ctype.h> as functions, the <ctype.h> header file now includes function prototypes for functions in the Compaq C RTL that perform the same operations as the macros currently defined in this header file. These functions have been added to the Compaq C RTL.

The nonstandard toascii macro remains because, according to the ANSI C Standard, Section 4.14.2, names beginning with "to" are reserved by the ANSI C Standard when <ctype.h> is included.

A.2.3 <fp_class.h>

This header file containing IEEE floating-point class constants has been added to support the new Compaq C RTL functions fp_class , fp_classf , and fp_classl available on OpenVMS Alpha systems.

A.2.4 <locale.h>

The new standard header file <locale.h> is now supported and includes prototypes for the functions setlocale and localeconv , which have been added to the Compaq C RTL.

A.2.5 <math.h>

The functions cabs and hypot are no longer defined in the <math.h> header file when the compiler is run in strict ANSI C mode.

A.2.6 <signal.h>

The sigabrt signal is implemented and defined in the <signal.h> header file. sig_atomic_t is now defined as char . In strict ANSI C mode, the following are not declared: ssignal , gsignal , kill , pause , sleep , sigvec , sigblock , sigsetmask , sigstack , and sigpause .

In strict ANSI C mode, the names of the ill_* and fpe_* macros are changed to begin with " sig " (for example, sigill_resad_fault , sigfpe_intovf_trap , and so on) or be removed.

The badsig macro is renamed to sig_err .


Previous Next Contents Index
  

1.800.AT.COMPAQ

privacy and legal statement