cpp.texi: Update.
[gcc.git] / gcc / cpp.texi
1 \input texinfo
2 @setfilename cpp.info
3 @settitle The C Preprocessor
4
5 @ifinfo
6 @dircategory Programming
7 @direntry
8 * Cpp: (cpp). The GNU C preprocessor.
9 @end direntry
10 @end ifinfo
11
12 @c @smallbook
13 @c @cropmarks
14 @c @finalout
15 @setchapternewpage odd
16 @ifinfo
17 This file documents the GNU C Preprocessor.
18
19 Copyright 1987, 1989, 1991, 1992, 1993, 1994, 1995, 1996, 1997, 1998,
20 1999, 2000, 2001 Free Software Foundation, Inc.
21
22 Permission is granted to make and distribute verbatim copies of
23 this manual provided the copyright notice and this permission notice
24 are preserved on all copies.
25
26 @ignore
27 Permission is granted to process this file through Tex and print the
28 results, provided the printed document carries copying permission
29 notice identical to this one except for the removal of this paragraph
30 (this paragraph not being relevant to the printed manual).
31
32 @end ignore
33 Permission is granted to copy and distribute modified versions of this
34 manual under the conditions for verbatim copying, provided also that
35 the entire resulting derived work is distributed under the terms of a
36 permission notice identical to this one.
37
38 Permission is granted to copy and distribute translations of this manual
39 into another language, under the above conditions for modified versions.
40 @end ifinfo
41
42 @titlepage
43 @c @finalout
44 @title The C Preprocessor
45 @subtitle Last revised January 2001
46 @subtitle for GCC version 3
47 @author Richard M. Stallman
48 @page
49 @vskip 2pc
50 This booklet is eventually intended to form the first chapter of a GNU
51 C Language manual.
52
53 @vskip 0pt plus 1filll
54 @c man begin COPYRIGHT
55 Copyright @copyright{} 1987, 1989, 1991, 1992, 1993, 1994, 1995, 1996,
56 1997, 1998, 1999, 2000, 2001
57 Free Software Foundation, Inc.
58
59 Permission is granted to make and distribute verbatim copies of
60 this manual provided the copyright notice and this permission notice
61 are preserved on all copies.
62
63 Permission is granted to copy and distribute modified versions of this
64 manual under the conditions for verbatim copying, provided also that
65 the entire resulting derived work is distributed under the terms of a
66 permission notice identical to this one.
67
68 Permission is granted to copy and distribute translations of this manual
69 into another language, under the above conditions for modified versions.
70 @c man end
71 @end titlepage
72 @page
73
74 @node Top, Global Actions,, (DIR)
75 @chapter The C Preprocessor
76 @c man begin DESCRIPTION
77
78 The C preprocessor is a @dfn{macro processor} that is used automatically
79 by the C compiler to transform your program before actual compilation.
80 It is called a macro processor because it allows you to define
81 @dfn{macros}, which are brief abbreviations for longer constructs.
82
83 The C preprocessor is intended only for macro processing of C, C++ and
84 Objective C source files. For macro processing of other files, you are
85 strongly encouraged to use alternatives like M4, which will likely give
86 you better results and avoid many problems. For example, normally the C
87 preprocessor does not preserve arbitrary whitespace verbatim, but
88 instead replaces each sequence with a single space.
89
90 For use on C-like source files, the C preprocessor provides four
91 separate facilities that you can use as you see fit:
92
93 @itemize @bullet
94 @item
95 Inclusion of header files. These are files of declarations that can be
96 substituted into your program.
97
98 @item
99 Macro expansion. You can define @dfn{macros}, which are abbreviations
100 for arbitrary fragments of C code, and then the C preprocessor will
101 replace the macros with their definitions throughout the program.
102
103 @item
104 Conditional compilation. Using special preprocessing directives, you
105 can include or exclude parts of the program according to various
106 conditions.
107
108 @item
109 Line control. If you use a program to combine or rearrange source files
110 into an intermediate file which is then compiled, you can use line
111 control to inform the compiler of where each source line originally came
112 from.
113 @end itemize
114
115 C preprocessors vary in some details. This manual discusses the GNU C
116 preprocessor, which provides a small superset of the features of ISO
117 Standard C@.
118
119 In its default mode, the GNU C preprocessor does not do a few things
120 required by the standard. These are features which are rarely, if ever,
121 used, and may cause surprising changes to the meaning of a program which
122 does not expect them. To get strict ISO Standard C, you should use the
123 @samp{-std=c89} or @samp{-std=c99} options, depending on which version
124 of the standard you want. To get all the mandatory diagnostics, you
125 must also use @samp{-pedantic}. @xref{Invocation}.
126
127 @c man end
128
129 @menu
130 * Global Actions:: Actions made uniformly on all input files.
131 * Directives:: General syntax of preprocessing directives.
132 * Header Files:: How and why to use header files.
133 * Macros:: How and why to use macros.
134 * Conditionals:: How and why to use conditionals.
135 * Assertions:: How and why to use assertions.
136 * Line Control:: Use of line control when you combine source files.
137 * Other Directives:: Miscellaneous preprocessing directives.
138 * Output:: Format of output from the C preprocessor.
139 * Implementation:: Implementation limits and behavior.
140 * Unreliable Features:: Undefined behavior and deprecated features.
141 * Invocation:: How to invoke the preprocessor; command options.
142 * Concept Index:: Index of concepts and terms.
143 * Index:: Index of directives, predefined macros and options.
144 @end menu
145
146 @node Global Actions, Directives, Top, Top
147 @section Transformations Made Globally
148 @cindex ASCII NUL handling
149
150 Most C preprocessor features are inactive unless you give specific
151 directives to request their use. (Preprocessing directives are lines
152 starting with a @samp{#} token, possibly preceded by whitespace;
153 @pxref{Directives}). However, there are four transformations that the
154 preprocessor always makes on all the input it receives, even in the
155 absence of directives. These are, in order:
156
157 @enumerate
158 @item
159 Trigraphs, if enabled, are replaced with the character they represent.
160
161 @item
162 Backslash-newline sequences are deleted, no matter where. This
163 feature allows you to break long lines for cosmetic purposes without
164 changing their meaning.
165
166 Recently, the non-traditional preprocessor has relaxed its treatment of
167 escaped newlines. Previously, the newline had to immediately follow a
168 backslash. The current implementation allows whitespace in the form of
169 spaces, horizontal and vertical tabs, and form feeds between the
170 backslash and the subsequent newline. The preprocessor issues a
171 warning, but treats it as a valid escaped newline and combines the two
172 lines to form a single logical line. This works within comments and
173 tokens, as well as between tokens. Comments are @emph{not} treated as
174 whitespace for the purposes of this relaxation, since they have not yet
175 been replaced with spaces.
176
177 @item
178 All comments are replaced with single spaces.
179
180 @item
181 Predefined macro names are replaced with their expansions
182 (@pxref{Predefined}).
183 @end enumerate
184
185 For end-of-line indicators, any of \n, \r\n, \n\r and \r are recognised,
186 and treated as ending a single line. As a result, if you mix these in a
187 single file you might get incorrect line numbering, because the
188 preprocessor would interpret the two-character versions as ending just
189 one line. Previous implementations would only handle UNIX-style \n
190 correctly, so DOS-style \r\n would need to be passed through a filter
191 first.
192
193 The first three transformations are done @emph{before} all other parsing
194 and before preprocessing directives are recognized. Thus, for example,
195 you can split a line mechanically with backslash-newline anywhere
196 (except within trigraphs since they are replaced first; see below).
197
198 @example
199 /*
200 */ # /*
201 */ defi\
202 ne FO\
203 O 10\
204 20
205 @end example
206
207 @noindent
208 is equivalent into @samp{#define FOO 1020}.
209
210 There is no way to prevent a backslash at the end of a line from being
211 interpreted as a backslash-newline. For example,
212
213 @example
214 "foo\\
215 bar"
216 @end example
217
218 is equivalent to @code{"foo\bar"}, not to @code{"foo\\bar"}. To avoid
219 having to worry about this, do not use the deprecated GNU extension
220 which permits multi-line strings. Instead, use string literal
221 concatenation:
222
223 @example
224 "foo\\"
225 "bar"
226 @end example
227
228 Your program will be more portable this way, too.
229
230 There are a few things to note about the above four transformations.
231
232 @itemize @bullet
233 @item
234 Comments and predefined macro names (or any macro names, for that
235 matter) are not recognized inside the argument of an @samp{#include}
236 directive, when it is delimited with quotes or with @samp{<} and
237 @samp{>}.
238
239 @item
240 Comments and predefined macro names are never recognized within a
241 character or string constant.
242
243 @item
244 ISO ``trigraphs'' are converted before backslash-newlines are deleted.
245 If you write what looks like a trigraph with a backslash-newline inside,
246 the backslash-newline is deleted as usual, but it is too late to
247 recognize the trigraph.
248
249 This is relevant only if you use the @samp{-trigraphs} option to enable
250 trigraph processing. @xref{Invocation}.
251 @end itemize
252
253 The preprocessor handles null characters embedded in the input file
254 depending upon the context in which the null appears. Note that here we
255 are referring not to the two-character escape sequence "\0", but to the
256 single character ASCII NUL.
257
258 There are three different contexts in which a null character may
259 appear:
260
261 @itemize @bullet
262 @item
263 Within comments. Here, null characters are silently ignored.
264
265 @item
266 Within a string or character constant. Here the preprocessor emits a
267 warning, but preserves the null character and passes it through to the
268 output file or compiler front-end.
269
270 @item
271 In any other context, the preprocessor issues a warning, and discards
272 the null character. The preprocessor treats it like whitespace,
273 combining it with any surrounding whitespace to become a single
274 whitespace block. Representing the null character by "^@@", this means
275 that code like
276
277 @example
278 #define X^@@1
279 @end example
280
281 is equivalent to
282
283 @example
284 #define X 1
285 @end example
286
287 and X is defined with replacement text "1".
288 @end itemize
289
290 @node Directives, Header Files, Global Actions, Top
291 @section Preprocessing Directives
292
293 @cindex preprocessing directives
294 @cindex directives
295 Most preprocessor features are active only if you use preprocessing
296 directives to request their use.
297
298 Preprocessing directives are lines in your program that start with
299 @samp{#}. Whitespace is allowed before and after the @samp{#}. The
300 @samp{#} is followed by an identifier that is the @dfn{directive name}.
301 For example, @samp{#define} is the directive that defines a macro.
302
303 Since the @samp{#} must be the first token on the line, it cannot come
304 from a macro expansion if you wish it to begin a directive. Also, the
305 directive name is not macro expanded. Thus, if @samp{foo} is defined as
306 a macro expanding to @samp{define}, that does not make @samp{#foo} a
307 valid preprocessing directive.
308
309 The set of valid directive names is fixed. Programs cannot define new
310 preprocessing directives.
311
312 Some directive names require arguments; these make up the rest of the
313 directive line and must be separated from the directive name by
314 whitespace. For example, @samp{#define} must be followed by a macro
315 name and the intended expansion of the macro. @xref{Object-like
316 Macros}.
317
318 A preprocessing directive cannot cover more than one line. It may be
319 logically extended with backslash-newline, but that has no effect on its
320 meaning. Comments containing newlines can also divide the directive
321 into multiple lines, but a comment is replaced by a single space before
322 the directive is interpreted.
323
324 @node Header Files, Macros, Directives, Top
325 @section Header Files
326
327 @cindex header file
328 A header file is a file containing C declarations and macro definitions
329 (@pxref{Macros}) to be shared between several source files. You request
330 the use of a header file in your program with the C preprocessing
331 directive @samp{#include}.
332
333 @menu
334 * Header Uses:: What header files are used for.
335 * Include Syntax:: How to write @samp{#include} directives.
336 * Include Operation:: What @samp{#include} does.
337 * Once-Only:: Preventing multiple inclusion of one header file.
338 * Inheritance:: Including one header file in another header file.
339 * System Headers:: Special treatment for some header files.
340 @end menu
341
342 @node Header Uses, Include Syntax, Header Files, Header Files
343 @subsection Uses of Header Files
344
345 Header files serve two kinds of purposes.
346
347 @itemize @bullet
348 @item
349 @cindex system header files
350 System header files declare the interfaces to parts of the operating
351 system. You include them in your program to supply the definitions and
352 declarations you need to invoke system calls and libraries.
353
354 @item
355 Your own header files contain declarations for interfaces between the
356 source files of your program. Each time you have a group of related
357 declarations and macro definitions all or most of which are needed in
358 several different source files, it is a good idea to create a header
359 file for them.
360 @end itemize
361
362 Including a header file produces the same results in C compilation as
363 copying the header file into each source file that needs it. Such
364 copying would be time-consuming and error-prone. With a header file,
365 the related declarations appear in only one place. If they need to be
366 changed, they can be changed in one place, and programs that include the
367 header file will automatically use the new version when next recompiled.
368 The header file eliminates the labor of finding and changing all the
369 copies as well as the risk that a failure to find one copy will result
370 in inconsistencies within a program.
371
372 The usual convention is to give header files names that end with
373 @file{.h}. Avoid unusual characters in header file names, as they
374 reduce portability.
375
376 @node Include Syntax, Include Operation, Header Uses, Header Files
377 @subsection The @samp{#include} Directive
378
379 @findex #include
380 Both user and system header files are included using the preprocessing
381 directive @samp{#include}. It has three variants:
382
383 @table @code
384 @item #include <@var{file}>
385 This variant is used for system header files. It searches for a file
386 named @var{file} in a list of directories specified by you, then in a
387 standard list of system directories. You specify directories to search
388 for header files with the command option @samp{-I} (@pxref{Invocation}).
389 The option @samp{-nostdinc} inhibits searching the standard system
390 directories; in this case only the directories you specify are searched.
391
392 The first @samp{>} character terminates the file name. The file name
393 may contain a @samp{<} character.
394
395 @item #include "@var{file}"
396 This variant is used for header files of your own program. It searches
397 for a file named @var{file} first in the current directory, then in the
398 same directories used for system header files. The current directory is
399 the directory of the current input file. It is tried first because it
400 is presumed to be the location of the files that the current input file
401 refers to. (If the @samp{-I-} option is used, the special treatment of
402 the current directory is inhibited. @xref{Invocation}.)
403
404 The first @samp{"} character terminates the file name.
405
406 In both these variants, the argument behaves like a string constant in
407 that comments are not recognized, and macro names are not expanded.
408 Thus, in @samp{#include <x/*y>} the @samp{/*} does not start a comment
409 and the directive specifies inclusion of a system header file named
410 @file{x/*y}.
411
412 However, in either variant, if backslashes occur within @var{file}, they
413 are considered ordinary text characters, not escape characters. None of
414 the character escape sequences appropriate to string constants in C are
415 processed. Thus, @samp{#include "x\n\\y"} specifies a filename
416 containing three backslashes.
417
418 @item #include @var{anything else}
419 @cindex computed @samp{#include}
420 This variant is called a @dfn{computed #include}. Any @samp{#include}
421 directive whose argument does not fit the above two forms is a computed
422 include. The text @var{anything else} is checked for macro calls, which
423 are expanded (@pxref{Macros}). When this is done, the result must match
424 one of the above two variants --- in particular, the expansion must form
425 a string literal token, or a sequence of tokens surrounded by angle
426 braces. @xref{Implementation}.
427
428 This feature allows you to define a macro which controls the file name
429 to be used at a later point in the program. One application of this is
430 to allow a site-specific configuration file for your program to specify
431 the names of the system include files to be used. This can help in
432 porting the program to various operating systems in which the necessary
433 system header files are found in different places.
434 @end table
435
436 @node Include Operation, Once-Only, Include Syntax, Header Files
437 @subsection How @samp{#include} Works
438
439 The @samp{#include} directive works by directing the C preprocessor to
440 scan the specified file as input before continuing with the rest of the
441 current file. The output from the preprocessor contains the output
442 already generated, followed by the output resulting from the included
443 file, followed by the output that comes from the text after the
444 @samp{#include} directive. For example, given a header file
445 @file{header.h} as follows,
446
447 @example
448 char *test ();
449 @end example
450
451 @noindent
452 and a main program called @file{program.c} that uses the header file,
453 like this,
454
455 @example
456 int x;
457 #include "header.h"
458
459 main ()
460 @{
461 printf (test ());
462 @}
463 @end example
464
465 @noindent
466 the output generated by the C preprocessor for @file{program.c} as input
467 would be
468
469 @example
470 int x;
471 char *test ();
472
473 main ()
474 @{
475 printf (test ());
476 @}
477 @end example
478
479 Included files are not limited to declarations and macro definitions;
480 those are merely the typical uses. Any fragment of a C program can be
481 included from another file. The include file could even contain the
482 beginning of a statement that is concluded in the containing file, or
483 the end of a statement that was started in the including file. However,
484 a comment or a string or character constant may not start in the
485 included file and finish in the including file. An unterminated
486 comment, string constant or character constant in an included file is
487 considered to end (with an error message) at the end of the file.
488
489 It is possible for a header file to begin or end a syntactic unit such
490 as a function definition, but that would be very confusing, so don't do
491 it.
492
493 The line following the @samp{#include} directive is always treated as a
494 separate line by the C preprocessor, even if the included file lacks a
495 final newline.
496
497 @node Once-Only, Inheritance, Include Operation, Header Files
498 @subsection Once-Only Include Files
499 @cindex repeated inclusion
500 @cindex including just once
501
502 Very often, one header file includes another. It can easily result that
503 a certain header file is included more than once. This may lead to
504 errors, if the header file defines structure types or typedefs, and is
505 certainly wasteful. Therefore, we often wish to prevent multiple
506 inclusion of a header file.
507
508 The standard way to do this is to enclose the entire real contents of the
509 file in a conditional, like this:
510
511 @example
512 #ifndef FILE_FOO_SEEN
513 #define FILE_FOO_SEEN
514
515 @var{the entire file}
516
517 #endif /* FILE_FOO_SEEN */
518 @end example
519
520 The macro @code{FILE_FOO_SEEN} indicates that the file has been included
521 once already. In a user header file, the macro name should not begin
522 with @samp{_}. In a system header file, this name should begin with
523 @samp{__} to avoid conflicts with user programs. In any kind of header
524 file, the macro name should contain the name of the file and some
525 additional text, to avoid conflicts with other header files.
526
527 The GNU C preprocessor is programmed to notice when a header file uses
528 this particular construct and handle it efficiently. If a header file
529 is contained entirely in a @samp{#ifndef} conditional, modulo whitespace
530 and comments, then it remembers that fact. If a subsequent
531 @samp{#include} specifies the same file, and the macro in the
532 @samp{#ifndef} is already defined, then the directive is skipped without
533 processing the specified file at all.
534
535 @findex #import
536 In the Objective C language, there is a variant of @samp{#include}
537 called @samp{#import} which includes a file, but does so at most once.
538 If you use @samp{#import} @emph{instead of} @samp{#include}, then you
539 don't need the conditionals inside the header file to prevent multiple
540 execution of the contents.
541
542 @samp{#import} is obsolete because it is not a well designed feature.
543 It requires the users of a header file --- the applications programmers
544 --- to know that a certain header file should only be included once. It
545 is much better for the header file's implementor to write the file so
546 that users don't need to know this. Using @samp{#ifndef} accomplishes
547 this goal.
548
549 @node Inheritance, System Headers, Once-Only, Header Files
550 @subsection Inheritance and Header Files
551 @cindex inheritance
552 @cindex overriding a header file
553
554 @dfn{Inheritance} is what happens when one object or file derives some
555 of its contents by virtual copying from another object or file. In
556 the case of C header files, inheritance means that one header file
557 includes another header file and then replaces or adds something.
558
559 If the inheriting header file and the base header file have different
560 names, then inheritance is straightforward: simply write @samp{#include
561 "@var{base}"} in the inheriting file.
562
563 Sometimes it is necessary to give the inheriting file the same name as
564 the base file. This is less straightforward.
565
566 For example, suppose an application program uses the system header
567 @file{sys/signal.h}, but the version of @file{/usr/include/sys/signal.h}
568 on a particular system doesn't do what the application program expects.
569 It might be convenient to define a ``local'' version, perhaps under the
570 name @file{/usr/local/include/sys/signal.h}, to override or add to the
571 one supplied by the system.
572
573 You can do this by compiling with the option @samp{-I.}, and writing a
574 file @file{sys/signal.h} that does what the application program expects.
575 Making this file include the standard @file{sys/signal.h} is not so easy
576 --- writing @samp{#include <sys/signal.h>} in that file doesn't work,
577 because it includes your own version of the file, not the standard
578 system version. Used in that file itself, this leads to an infinite
579 recursion and a fatal error in compilation.
580
581 @samp{#include </usr/include/sys/signal.h>} would find the proper file,
582 but that is not clean, since it makes an assumption about where the
583 system header file is found. This is bad for maintenance, since it
584 means that any change in where the system's header files are kept
585 requires a change somewhere else.
586
587 @findex #include_next
588 The clean way to solve this problem is to use
589 @samp{#include_next}, which means, ``Include the @emph{next} file with
590 this name.'' This directive works like @samp{#include} except in
591 searching for the specified file: it starts searching the list of header
592 file directories @emph{after} the directory in which the current file
593 was found.
594
595 Suppose you specify @samp{-I /usr/local/include}, and the list of
596 directories to search also includes @file{/usr/include}; and suppose
597 both directories contain @file{sys/signal.h}. Ordinary @samp{#include
598 <sys/signal.h>} finds the file under @file{/usr/local/include}. If that
599 file contains @samp{#include_next <sys/signal.h>}, it starts searching
600 after that directory, and finds the file in @file{/usr/include}.
601
602 @samp{#include_next} is a GCC extension and should not be used in
603 programs intended to be portable to other compilers.
604
605 @node System Headers,, Inheritance, Header Files
606 @subsection System Headers
607 @cindex system header files
608
609 The header files declaring interfaces to the operating system and
610 runtime libraries often cannot be written in strictly conforming C.
611 Therefore, GNU C gives code found in @dfn{system headers} special
612 treatment. Certain categories of warnings are suppressed, notably those
613 enabled by @samp{-pedantic}.
614
615 Normally, only the headers found in specific directories are considered
616 system headers. The set of these directories is determined when GCC is
617 compiled. There are, however, two ways to add to the set.
618
619 @findex -isystem
620 The @samp{-isystem} command line option adds its argument to the list of
621 directories to search for headers, just like @samp{-I}. In addition,
622 any headers found in that directory will be considered system headers.
623 Note that unlike @samp{-I}, you must put a space between @samp{-isystem}
624 and its argument.
625
626 All directories named by @samp{-isystem} are searched @strong{after} all
627 directories named by @samp{-I}, no matter what their order was on the
628 command line. If the same directory is named by both @samp{-I} and
629 @samp{-isystem}, @samp{-I} wins; it is as if the @samp{-isystem} option
630 had never been specified at all.
631
632 @findex #pragma GCC system_header
633 There is also a directive, @samp{#pragma GCC system_header}, which tells
634 GCC to consider the rest of the current include file a system header, no
635 matter where it was found. Code that comes before the @samp{#pragma} in
636 the file will not be affected.
637
638 @samp{#pragma GCC system_header} has no effect in the primary source file.
639
640 @node Macros, Conditionals, Header Files, Top
641 @section Macros
642
643 A macro is a sort of abbreviation which you can define once and then
644 use later. There are many complicated features associated with macros
645 in the C preprocessor.
646
647 @menu
648 * Object-like Macros:: Macros that always expand the same way.
649 * Function-like Macros:: Macros that accept arguments that are substituted
650 into the macro expansion.
651 * Macro Varargs:: Macros with variable number of arguments.
652 * Predefined:: Predefined macros that are always available.
653 * Stringification:: Macro arguments converted into string constants.
654 * Concatenation:: Building tokens from parts taken from macro arguments.
655 * Undefining:: Cancelling a macro's definition.
656 * Redefining:: Changing a macro's definition.
657 * Poisoning:: Ensuring a macro is never defined or used.
658 * Macro Pitfalls:: Macros can confuse the unwary. Here we explain
659 several common problems and strange features.
660 @end menu
661
662 @node Object-like Macros, Function-like Macros, Macros, Macros
663 @subsection Object-like Macros
664 @cindex object-like macro
665 @cindex manifest constant
666
667 An @dfn{object-like macro} is a kind of abbreviation. It is a name
668 which stands for a fragment of code. Some people refer to these as
669 @dfn{manifest constants}.
670
671 Before you can use a macro, you must @dfn{define} it explicitly with the
672 @samp{#define} directive. @samp{#define} is followed by the name of the
673 macro and then the token sequence it should be an abbreviation for,
674 which is variously referred to as the macro's @dfn{body},
675 @dfn{expansion} or @dfn{replacement list}. For example,
676
677 @example
678 #define BUFFER_SIZE 1020
679 @end example
680
681 @noindent
682 defines a macro named @samp{BUFFER_SIZE} as an abbreviation for the
683 token @samp{1020}. If somewhere after this @samp{#define} directive
684 there comes a C statement of the form
685
686 @example
687 foo = (char *) xmalloc (BUFFER_SIZE);
688 @end example
689
690 @noindent
691 then the C preprocessor will recognize and @dfn{expand} the macro
692 @samp{BUFFER_SIZE}, resulting in
693
694 @example
695 foo = (char *) xmalloc (1020);
696 @end example
697
698 The use of all upper case for macro names is a standard convention.
699 Programs are easier to read when it is possible to tell at a glance
700 which names are macros.
701
702 Normally, a macro definition can only span a single logical line, like
703 all C preprocessing directives. Comments within a macro definition may
704 contain newlines, which make no difference since each comment is
705 replaced by a space regardless of its contents.
706
707 Apart from this, there is no restriction on what can go in a macro body
708 provided it decomposes into valid preprocessing tokens. In particular,
709 parentheses need not balance, and the body need not resemble valid C
710 code. (If it does not, you may get error messages from the C
711 compiler when you use the macro.)
712
713 The C preprocessor scans your program sequentially, so macro definitions
714 take effect at the place you write them. Therefore, the following input
715 to the C preprocessor
716
717 @example
718 foo = X;
719 #define X 4
720 bar = X;
721 @end example
722
723 @noindent
724 produces as output
725
726 @example
727 foo = X;
728
729 bar = 4;
730 @end example
731
732 When the preprocessor expands a macro name, the macro's expansion
733 replaces the macro invocation, and the result is re-scanned for more
734 macros to expand. For example, after
735
736 @example
737 #define BUFSIZE 1020
738 #define TABLESIZE BUFSIZE
739 @end example
740
741 @noindent
742 the name @samp{TABLESIZE} when used in the program would go through two
743 stages of expansion, resulting ultimately in @samp{1020}.
744
745 This is not the same as defining @samp{TABLESIZE} to be @samp{1020}.
746 The @samp{#define} for @samp{TABLESIZE} uses exactly the expansion you
747 specify --- in this case, @samp{BUFSIZE} --- and does not check to see
748 whether it too contains macro names. Only when you @emph{use}
749 @samp{TABLESIZE} is the result of its expansion scanned for more macro
750 names. @xref{Cascaded Macros}.
751
752 @node Function-like Macros, Macro Varargs, Object-like Macros, Macros
753 @subsection Macros with Arguments
754 @cindex macros with argument
755 @cindex arguments in macro definitions
756 @cindex function-like macro
757
758 An object-like macro is always replaced by exactly the same tokens each
759 time it is used. Macros can be made more flexible by taking
760 @dfn{arguments}. Arguments are fragments of code that you supply each
761 time the macro is used. These fragments are included in the expansion
762 of the macro according to the directions in the macro definition. A
763 macro that accepts arguments is called a @dfn{function-like macro}
764 because the syntax for using it looks like a function call.
765
766 @findex #define
767 To define a macro that uses arguments, you write a @samp{#define}
768 directive with a list of @dfn{parameters} in parentheses after the name
769 of the macro. The parameters must be valid C identifiers, separated by
770 commas and optionally whitespace. The @samp{(} must follow the macro
771 name immediately, with no space in between. If you leave a space, you
772 instead define an object-like macro whose expansion begins with a
773 @samp{(}, and often leads to confusing errors at compile time.
774
775 As an example, here is a macro that computes the minimum of two numeric
776 values, as it is defined in many C programs:
777
778 @example
779 #define min(X, Y) ((X) < (Y) ? (X) : (Y))
780 @end example
781
782 @noindent
783 (This is not the best way to define a ``minimum'' macro in GNU C@.
784 @xref{Side Effects}, for more information.)
785
786 To invoke a function-like macro, you write the name of the macro
787 followed by a list of @dfn{arguments} in parentheses, separated by
788 commas. The invocation of the macro need not be restricted to a single
789 logical line - it can cross as many lines in the source file as you
790 wish. The number of arguments you give must match the number of
791 parameters in the macro definition; empty arguments are fine. Examples
792 of use of the macro @samp{min} include @samp{min (1, 2)} and @samp{min
793 (x + 28, *p)}.
794
795 The expansion text of the macro depends on the arguments you use. Each
796 macro parameter is replaced throughout the macro expansion with the
797 tokens of the corresponding argument. Leading and trailing argument
798 whitespace is dropped, and all whitespace between the tokens of an
799 argument is reduced to a single space. Using the same macro @samp{min}
800 defined above, @samp{min (1, 2)} expands into
801
802 @example
803 ((1) < (2) ? (1) : (2))
804 @end example
805
806 @noindent
807 where @samp{1} has been substituted for @samp{X} and @samp{2} for @samp{Y}.
808
809 Likewise, @samp{min (x + 28, *p)} expands into
810
811 @example
812 ((x + 28) < (*p) ? (x + 28) : (*p))
813 @end example
814
815 Parentheses within each argument must balance; a comma within such
816 parentheses does not end the argument. However, there is no requirement
817 for square brackets or braces to balance, and they do not prevent a
818 comma from separating arguments. Thus,
819
820 @example
821 macro (array[x = y, x + 1])
822 @end example
823
824 @noindent
825 passes two arguments to @code{macro}: @samp{array[x = y} and @samp{x +
826 1]}. If you want to supply @samp{array[x = y, x + 1]} as an argument,
827 you must write it as @samp{array[(x = y, x + 1)]}, which is equivalent C
828 code.
829
830 After the arguments have been substituted into the macro body, the
831 resulting expansion replaces the macro invocation, and re-scanned for
832 more macro calls. Therefore even arguments can contain calls to other
833 macros, either with or without arguments, and even to the same macro.
834 For example, @samp{min (min (a, b), c)} expands into this text:
835
836 @example
837 ((((a) < (b) ? (a) : (b))) < (c)
838 ? (((a) < (b) ? (a) : (b)))
839 : (c))
840 @end example
841
842 @noindent
843 (Line breaks shown here for clarity would not actually be generated.)
844
845 @cindex empty macro arguments
846 If a macro @code{foo} takes one argument, and you want to supply an
847 empty argument, simply supply no preprocessing tokens. Since whitespace
848 does not form a preprocessing token, it is optional. For example,
849 @samp{foo ()}, @samp{foo ( )} and @samp{bar (, arg2)}.
850
851 Previous GNU preprocessor implementations and documentation were
852 incorrect on this point, insisting that a function-like macro that takes
853 a single argument be passed a space if an empty argument was required.
854
855 If you use a macro name followed by something other than a @samp{(}
856 (after ignoring any whitespace that might follow), it does not form an
857 invocation of the macro, and the preprocessor does not change what you
858 have written. Therefore, it is possible for the same identifier to be a
859 variable or function in your program as well as a macro, and you can
860 choose in each instance whether to refer to the macro (if an actual
861 argument list follows) or the variable or function (if an argument list
862 does not follow). For example,
863
864 @example
865 #define foo(X) X
866 foo bar foo(baz)
867 @end example
868
869 expands to @samp{foo bar baz}. Such dual use of one name could be
870 confusing and should be avoided except when the two meanings are
871 effectively synonymous: that is, when the name is both a macro and a
872 function and the two have similar effects. You can think of the name
873 simply as a function; use of the name for purposes other than calling it
874 (such as, to take the address) will refer to the function, while calls
875 will expand the macro and generate better but equivalent code.
876
877 For example, you can use a function named @samp{min} in the same source
878 file that defines the macro. If you write @samp{&min} with no argument
879 list, you refer to the function. If you write @samp{min (x, bb)}, with
880 an argument list, the macro is expanded. If you write @samp{(min) (a,
881 bb)}, where the name @samp{min} is not followed by an open-parenthesis,
882 the macro is not expanded, so you wind up with a call to the function
883 @samp{min}.
884
885 In the definition of a macro with arguments, the list of argument names
886 must follow the macro name immediately with no space in between. If
887 there is a space after the macro name, the macro is defined as taking no
888 arguments, and all the rest of the line is taken to be the expansion.
889 The reason for this is that it is often useful to define a macro that
890 takes no arguments and whose definition begins with an identifier in
891 parentheses. This rule makes it possible for you to do either this:
892
893 @example
894 #define FOO(x) - 1 / (x)
895 @end example
896
897 @noindent
898 (which defines @samp{FOO} to take an argument and expand into minus the
899 reciprocal of that argument) or this:
900
901 @example
902 #define BAR (x) - 1 / (x)
903 @end example
904
905 @noindent
906 (which defines @samp{BAR} to take no argument and always expand into
907 @samp{(x) - 1 / (x)}).
908
909 Note that the @emph{uses} of a macro with arguments can have spaces
910 before the left parenthesis; it's the @emph{definition} where it matters
911 whether there is a space.
912
913 @node Macro Varargs, Predefined, Function-like Macros, Macros
914 @subsection Macros with Variable Numbers of Arguments
915 @cindex variable number of arguments
916 @cindex macro with variable arguments
917 @cindex rest argument (in macro)
918
919 In the ISO C standard of 1999, a macro can be declared to accept a
920 variable number of arguments much as a function can. The syntax for
921 defining the macro is similar to that of a function. Here is an
922 example:
923
924 @example
925 #define eprintf(...) fprintf (stderr, __VA_ARGS__)
926 @end example
927
928 Here @samp{@dots{}} is a @dfn{variable argument}. In the invocation of
929 such a macro, it represents the zero or more tokens until the closing
930 parenthesis that ends the invocation, including any commas. This set of
931 tokens replaces the identifier @code{__VA_ARGS__} in the macro body
932 wherever it appears. Thus, we have this expansion:
933
934 @example
935 eprintf ("%s:%d: ", input_file_name, line_number)
936 @expansion{}
937 fprintf (stderr, "%s:%d: " , input_file_name, line_number)
938 @end example
939
940 Within a @samp{#define} directive, ISO C mandates that the only place
941 the identifier @code{__VA_ARGS__} can appear is in the replacement list
942 of a variable-argument macro. It may not be used as a macro name, macro
943 argument name, or within a different type of macro. It may also be
944 forbidden in open text; the standard is ambiguous. We recommend you
945 avoid using it except for its defined purpose.
946
947 If your macro is complicated, you may want a more descriptive name for
948 the variable argument than @code{__VA_ARGS__}. GNU cpp permits this, as
949 an extension. You may write an argument name immediately before the
950 @samp{@dots{}}; that name is used for the variable argument. The
951 @code{eprintf} macro above could be written
952
953 @example
954 #define eprintf(args...) fprintf (stderr, args)
955 @end example
956
957 @noindent
958 using this extension. You cannot use @code{__VA_ARGS__} and this
959 extension in the same macro.
960
961 We might instead have defined eprintf as follows:
962
963 @example
964 #define eprintf(format, ...) fprintf (stderr, format, __VA_ARGS__)
965 @end example
966
967 This formulation looks more descriptive, but cannot be used as flexibly.
968 There is no way to produce expanded output of
969
970 @example
971 fprintf (stderr, "success!\n")
972 @end example
973
974 @noindent
975 because, in standard C, you are not allowed to leave the variable
976 argument out entirely, and passing an empty argument for the variable
977 arguments will not do what you want. Writing
978
979 @example
980 eprintf ("success!\n", )
981 @end example
982
983 @noindent
984 produces
985
986 @example
987 fprintf (stderr, "success!\n",)
988 @end example
989
990 @noindent
991 where the extra comma originates from the replacement list and not from
992 the arguments to eprintf.
993
994 There is another extension in the GNU C preprocessor which deals with
995 this difficulty. First, you are allowed to leave the variable argument
996 out entirely:
997
998 @example
999 eprintf ("success!\n")
1000 @end example
1001
1002 Second, the @samp{##} token paste operator has a special meaning when
1003 placed between a comma and a variable argument. If you write
1004
1005 @example
1006 #define eprintf(format, ...) fprintf (stderr, format, ##__VA_ARGS__)
1007 @end example
1008
1009 and the variable argument is left out when the @samp{eprintf} macro is
1010 used, then the comma before the @samp{##} will be deleted. This does
1011 @emph{not} happen if you pass an empty argument, nor does it happen if
1012 the token preceding @samp{##} is anything other than a comma.
1013
1014 Previous versions of the preprocessor implemented this extension much
1015 more generally. We have restricted it in order to minimize the
1016 difference from the C standard. @xref{Unreliable Features}.
1017
1018 @node Predefined, Stringification, Macro Varargs, Macros
1019 @subsection Predefined Macros
1020
1021 @cindex predefined macros
1022 Several object-like macros are predefined; you use them without
1023 supplying their definitions. They fall into two classes: standard
1024 macros and system-specific macros.
1025
1026 @menu
1027 * Standard Predefined:: Standard predefined macros.
1028 * Nonstandard Predefined:: Nonstandard predefined macros.
1029 @end menu
1030
1031 @node Standard Predefined, Nonstandard Predefined, Predefined, Predefined
1032 @subsubsection Standard Predefined Macros
1033 @cindex standard predefined macros
1034
1035 The standard predefined macros are available with the same meanings
1036 regardless of the machine or operating system on which you are using GNU
1037 C@. Their names all start and end with double underscores. Those
1038 preceding @code{__GNUC__} in this table are standardized by ISO C; the
1039 rest are GNU C extensions.
1040
1041 @table @code
1042 @item __FILE__
1043 @findex __FILE__
1044 This macro expands to the name of the current input file, in the form of
1045 a C string constant. The precise name returned is the one that was
1046 specified in @samp{#include} or as the input file name argument. For
1047 example, @samp{"/usr/local/include/myheader.h"} is a possible expansion
1048 of this macro.
1049
1050 @item __LINE__
1051 @findex __LINE__
1052 This macro expands to the current input line number, in the form of a
1053 decimal integer constant. While we call it a predefined macro, it's
1054 a pretty strange macro, since its ``definition'' changes with each
1055 new line of source code.
1056
1057 This and @samp{__FILE__} are useful in generating an error message to
1058 report an inconsistency detected by the program; the message can state
1059 the source line at which the inconsistency was detected. For example,
1060
1061 @smallexample
1062 fprintf (stderr, "Internal error: "
1063 "negative string length "
1064 "%d at %s, line %d.",
1065 length, __FILE__, __LINE__);
1066 @end smallexample
1067
1068 A @samp{#include} directive changes the expansions of @samp{__FILE__}
1069 and @samp{__LINE__} to correspond to the included file. At the end of
1070 that file, when processing resumes on the input file that contained
1071 the @samp{#include} directive, the expansions of @samp{__FILE__} and
1072 @samp{__LINE__} revert to the values they had before the
1073 @samp{#include} (but @samp{__LINE__} is then incremented by one as
1074 processing moves to the line after the @samp{#include}).
1075
1076 The expansions of both @samp{__FILE__} and @samp{__LINE__} are altered
1077 if a @samp{#line} directive is used. @xref{Line Control}.
1078
1079 @item __DATE__
1080 @findex __DATE__
1081 This macro expands to a string constant that describes the date on
1082 which the preprocessor is being run. The string constant contains
1083 eleven characters and looks like @w{@samp{"Feb 1 1996"}}.
1084 @c After reformatting the above, check that the date remains `Feb 1 1996',
1085 @c all on one line, with two spaces between the `Feb' and the `1'.
1086
1087 @item __TIME__
1088 @findex __TIME__
1089 This macro expands to a string constant that describes the time at
1090 which the preprocessor is being run. The string constant contains
1091 eight characters and looks like @samp{"23:59:01"}.
1092
1093 @item __STDC__
1094 @findex __STDC__
1095 This macro expands to the constant 1, to signify that this is ISO
1096 Standard C@. (Whether that is actually true depends on what C compiler
1097 will operate on the output from the preprocessor.)
1098
1099 On some hosts, system include files use a different convention, where
1100 @samp{__STDC__} is normally 0, but is 1 if the user specifies strict
1101 conformance to the C Standard. The preprocessor follows the host
1102 convention when processing system include files, but when processing
1103 user files it follows the usual GNU C convention.
1104
1105 This macro is not defined if the @samp{-traditional} option is used.
1106
1107 @item __STDC_VERSION__
1108 @findex __STDC_VERSION__
1109 This macro expands to the C Standard's version number, a long integer
1110 constant of the form @samp{@var{yyyy}@var{mm}L} where @var{yyyy} and
1111 @var{mm} are the year and month of the Standard version. This signifies
1112 which version of the C Standard the preprocessor conforms to. Like
1113 @samp{__STDC__}, whether this version number is accurate for the entire
1114 implementation depends on what C compiler will operate on the output
1115 from the preprocessor.
1116
1117 This macro is not defined if the @samp{-traditional} option is used.
1118
1119 @item __GNUC__
1120 @findex __GNUC__
1121 This macro is defined if and only if this is GNU C@. This macro is
1122 defined only when the entire GNU C compiler is in use; if you invoke the
1123 preprocessor directly, @samp{__GNUC__} is undefined. The value
1124 identifies the major version number of GNU CC (@samp{1} for GNU CC
1125 version 1, which is now obsolete, and @samp{2} for version 2).
1126
1127 @item __GNUC_MINOR__
1128 @findex __GNUC_MINOR__
1129 The macro contains the minor version number of the compiler. This can
1130 be used to work around differences between different releases of the
1131 compiler (for example, if GCC 2.6.x is known to support a feature, you
1132 can test for @code{__GNUC__ > 2 || (__GNUC__ == 2 && __GNUC_MINOR__ >= 6)}).
1133
1134 @item __GNUC_PATCHLEVEL__
1135 @findex __GNUC_PATCHLEVEL__
1136 This macro contains the patch level of the compiler. This can be
1137 used to work around differences between different patch level releases
1138 of the compiler (for example, if GCC 2.6.2 is known to contain a bug,
1139 whereas GCC 2.6.3 contains a fix, and you have code which can workaround
1140 the problem depending on whether the bug is fixed or not, you can test for
1141 @code{__GNUC__ > 2 || (__GNUC__ == 2 && __GNUC_MINOR__ > 6) ||
1142 (__GNUC__ == 2 && __GNUC_MINOR__ == 6 && __GNUC_PATCHLEVEL__ >= 3)}).
1143
1144 @item __GNUG__
1145 @findex __GNUG__
1146 The GNU C compiler defines this when the compilation language is
1147 C++; use @samp{__GNUG__} to distinguish between GNU C and GNU
1148 C++.
1149
1150 @item __cplusplus
1151 @findex __cplusplus
1152 The ISO standard for C++ requires predefining this variable. You can
1153 use @samp{__cplusplus} to test whether a header is compiled by a C
1154 compiler or a C++ compiler. The compiler currently uses a value of
1155 @samp{1}, instead of the value @samp{199711L}, which would indicate full
1156 conformance with the standard.
1157
1158 @item __STRICT_ANSI__
1159 @findex __STRICT_ANSI__
1160 GNU C defines this macro if and only if the @option{-ansi} switch, or a
1161 @option{-std} switch specifying strict conformance to some version of ISO C,
1162 was specified when GNU C was invoked. Its definition is the null string.
1163 This macro exists primarily to direct certain GNU header files not to
1164 define certain traditional Unix constructs which are incompatible with
1165 ISO C@.
1166
1167 @item __BASE_FILE__
1168 @findex __BASE_FILE__
1169 This macro expands to the name of the main input file, in the form
1170 of a C string constant. This is the source file that was specified
1171 on the command line of the preprocessor or C compiler.
1172
1173 @item __INCLUDE_LEVEL__
1174 @findex __INCLUDE_LEVEL_
1175 This macro expands to a decimal integer constant that represents the
1176 depth of nesting in include files. The value of this macro is
1177 incremented on every @samp{#include} directive and decremented at the
1178 end of every included file. It starts out at 0, it's value within the
1179 base file specified on the command line.
1180
1181 @item __VERSION__
1182 @findex __VERSION__
1183 This macro expands to a string constant which describes the version
1184 number of GNU C@. The string is normally a sequence of decimal numbers
1185 separated by periods, such as @samp{"2.6.0"}.
1186
1187 @item __OPTIMIZE__
1188 @findex __OPTIMIZE__
1189 GNU CC defines this macro in optimizing compilations. It causes certain
1190 GNU header files to define alternative macro definitions for some system
1191 library functions. You should not refer to or test the definition of
1192 this macro unless you make very sure that programs will execute with the
1193 same effect regardless.
1194
1195 @item __OPTIMIZE_SIZE__
1196 @findex __OPTIMIZE_SIZE__
1197 GNU CC defines this macro when optimizing for size with @samp{-Os}. It
1198 causes certain GNU header files to define alternative macro definitions
1199 for some system library functions. You should not refer to or test the
1200 definition of this macro unless you make very sure that programs will
1201 execute with the same effect regardless.
1202
1203 @item __CHAR_UNSIGNED__
1204 @findex __CHAR_UNSIGNED__
1205 GNU C defines this macro if and only if the data type @code{char} is
1206 unsigned on the target machine. It exists to cause the standard header
1207 file @file{limits.h} to work correctly. You should not refer to this
1208 macro yourself; instead, refer to the standard macros defined in
1209 @file{limits.h}. The preprocessor uses this macro to determine whether
1210 or not to sign-extend large character constants written in octal; see
1211 @ref{#if Directive,,The @samp{#if} Directive}.
1212
1213 @item __REGISTER_PREFIX__
1214 @findex __REGISTER_PREFIX__
1215 This macro expands to a string (not a string constant) describing the
1216 prefix applied to CPU registers in assembler code. You can use it to
1217 write assembler code that is usable in multiple environments. For
1218 example, in the @samp{m68k-aout} environment it expands to the null
1219 string, but in the @samp{m68k-coff} environment it expands to the string
1220 @samp{%}.
1221
1222 @item __USER_LABEL_PREFIX__
1223 @findex __USER_LABEL_PREFIX__
1224 Similar to @code{__REGISTER_PREFIX__}, but describes the prefix applied
1225 to user generated labels in assembler code. For example, in the
1226 @samp{m68k-aout} environment it expands to the string @samp{_}, but in
1227 the @samp{m68k-coff} environment it expands to the null string. This
1228 does not work with the @samp{-mno-underscores} option that the i386
1229 OSF/rose and m88k targets provide nor with the @samp{-mcall*} options of
1230 the rs6000 System V Release 4 target.
1231 @end table
1232
1233 @node Nonstandard Predefined,, Standard Predefined, Predefined
1234 @subsubsection Nonstandard Predefined Macros
1235
1236 The C preprocessor normally has several predefined macros that vary
1237 between machines because their purpose is to indicate what type of
1238 system and machine is in use. This manual, being for all systems and
1239 machines, cannot tell you exactly what their names are; instead, we
1240 offer a list of some typical ones. You can use @samp{cpp -dM} to see
1241 the values of predefined macros; see @ref{Invocation}.
1242
1243 Some nonstandard predefined macros describe the operating system in use,
1244 with more or less specificity. For example,
1245
1246 @table @code
1247 @item unix
1248 @findex unix
1249 @samp{unix} is normally predefined on all Unix systems.
1250
1251 @item BSD
1252 @findex BSD
1253 @samp{BSD} is predefined on recent versions of Berkeley Unix
1254 (perhaps only in version 4.3).
1255 @end table
1256
1257 Other nonstandard predefined macros describe the kind of CPU, with more or
1258 less specificity. For example,
1259
1260 @table @code
1261 @item vax
1262 @findex vax
1263 @samp{vax} is predefined on Vax computers.
1264
1265 @item mc68000
1266 @findex mc68000
1267 @samp{mc68000} is predefined on most computers whose CPU is a Motorola
1268 68000, 68010 or 68020.
1269
1270 @item m68k
1271 @findex m68k
1272 @samp{m68k} is also predefined on most computers whose CPU is a 68000,
1273 68010 or 68020; however, some makers use @samp{mc68000} and some use
1274 @samp{m68k}. Some predefine both names. What happens in GNU C
1275 depends on the system you are using it on.
1276
1277 @item M68020
1278 @findex M68020
1279 @samp{M68020} has been observed to be predefined on some systems that
1280 use 68020 CPUs --- in addition to @samp{mc68000} and @samp{m68k}, which
1281 are less specific.
1282
1283 @item _AM29K
1284 @findex _AM29K
1285 @itemx _AM29000
1286 @findex _AM29000
1287 Both @samp{_AM29K} and @samp{_AM29000} are predefined for the AMD 29000
1288 CPU family.
1289
1290 @item ns32000
1291 @findex ns32000
1292 @samp{ns32000} is predefined on computers which use the National
1293 Semiconductor 32000 series CPU.
1294 @end table
1295
1296 Yet other nonstandard predefined macros describe the manufacturer of
1297 the system. For example,
1298
1299 @table @code
1300 @item sun
1301 @findex sun
1302 @samp{sun} is predefined on all models of Sun computers.
1303
1304 @item pyr
1305 @findex pyr
1306 @samp{pyr} is predefined on all models of Pyramid computers.
1307
1308 @item sequent
1309 @findex sequent
1310 @samp{sequent} is predefined on all models of Sequent computers.
1311 @end table
1312
1313 These predefined symbols are not only nonstandard, they are contrary to the
1314 ISO standard because their names do not start with underscores.
1315 Therefore, the option @samp{-ansi} inhibits the definition of these
1316 symbols.
1317
1318 This tends to make @samp{-ansi} useless, since many programs depend on
1319 the customary nonstandard predefined symbols. Even system header files
1320 check them and will generate incorrect declarations if they do not find
1321 the names that are expected. You might think that the header files
1322 supplied for the Uglix computer would not need to test what machine they
1323 are running on, because they can simply assume it is the Uglix; but
1324 often they do, and they do so using the customary names. As a result,
1325 very few C programs will compile with @samp{-ansi}. We intend to avoid
1326 such problems on the GNU system.
1327
1328 What, then, should you do in an ISO C program to test the type of machine
1329 it will run on?
1330
1331 GNU C offers a parallel series of symbols for this purpose, whose names
1332 are made from the customary ones by adding @samp{__} at the beginning
1333 and end. Thus, the symbol @code{__vax__} would be available on a Vax,
1334 and so on.
1335
1336 The set of nonstandard predefined names in the GNU C preprocessor is
1337 controlled (when @code{cpp} is itself compiled) by the macro
1338 @samp{CPP_PREDEFINES}, which should be a string containing @samp{-D}
1339 options, separated by spaces. For example, on the Sun 3, we use the
1340 following definition:
1341
1342 @example
1343 #define CPP_PREDEFINES "-Dmc68000 -Dsun -Dunix -Dm68k"
1344 @end example
1345
1346 @noindent
1347 This macro is usually specified in @file{tm.h}.
1348
1349 @node Stringification, Concatenation, Predefined, Macros
1350 @subsection Stringification
1351
1352 @cindex stringification
1353 @dfn{Stringification} means turning a sequence of preprocessing tokens
1354 into a string literal. For example, stringifying @samp{foo (z)} results
1355 in @samp{"foo (z)"}.
1356
1357 In the C preprocessor, stringification is possible when macro arguments
1358 are substituted during macro expansion. When a parameter appears
1359 preceded by a @samp{#} token in the replacement list of a function-like
1360 macro, it indicates that both tokens should be replaced with the
1361 stringification of the corresponding argument during expansion. The
1362 same argument may be substituted in other places in the definition
1363 without stringification if the argument name appears in those places
1364 with no preceding @samp{#}.
1365
1366 Here is an example of a macro definition that uses stringification:
1367
1368 @smallexample
1369 @group
1370 #define WARN_IF(EXP) \
1371 do @{ if (EXP) \
1372 fprintf (stderr, "Warning: " #EXP "\n"); @} \
1373 while (0)
1374 @end group
1375 @end smallexample
1376
1377 @noindent
1378 Here the argument for @samp{EXP} is substituted once, as-is, into the
1379 @samp{if} statement, and once, stringified, into the argument to
1380 @samp{fprintf}. The @samp{do} and @samp{while (0)} are a kludge to make
1381 it possible to write @samp{WARN_IF (@var{arg});}, which the resemblance
1382 of @samp{WARN_IF} to a function would make C programmers want to do; see
1383 @ref{Swallow Semicolon}.
1384
1385 The stringification feature is limited to transforming the tokens of a
1386 macro argument into a string constant: there is no way to combine the
1387 argument with surrounding text and stringify it all together. The
1388 example above shows how an equivalent result can be obtained in ISO
1389 Standard C, using the fact that adjacent string constants are
1390 concatenated by the C compiler to form a single string constant. The
1391 preprocessor stringifies the actual value of @samp{EXP} into a separate
1392 string constant, resulting in text like
1393
1394 @smallexample
1395 @group
1396 do @{ if (x == 0) \
1397 fprintf (stderr, "Warning: " "x == 0" "\n"); @} \
1398 while (0)
1399 @end group
1400 @end smallexample
1401
1402 @noindent
1403 but the compiler then sees three consecutive string constants and
1404 concatenates them into one, producing effectively
1405
1406 @smallexample
1407 do @{ if (x == 0) \
1408 fprintf (stderr, "Warning: x == 0\n"); @} \
1409 while (0)
1410 @end smallexample
1411
1412 Stringification in C involves more than putting double-quote characters
1413 around the fragment. The preprocessor backslash-escapes the surrounding
1414 quotes of string literals, and all backslashes within string and
1415 character constants, in order to get a valid C string constant with the
1416 proper contents. Thus, stringifying @samp{p = "foo\n";} results in
1417 @samp{"p = \"foo\\n\";"}. However, backslashes that are not inside
1418 string or character constants are not duplicated: @samp{\n} by itself
1419 stringifies to @samp{"\n"}.
1420
1421 Whitespace (including comments) in the text being stringified is handled
1422 according to precise rules. All leading and trailing whitespace is
1423 ignored. Any sequence of whitespace in the middle of the text is
1424 converted to a single space in the stringified result.
1425
1426 @node Concatenation, Undefining, Stringification, Macros
1427 @subsection Concatenation
1428 @cindex concatenation
1429 @cindex @samp{##}
1430 @dfn{Concatenation} means joining two strings into one. In the context
1431 of macro expansion, concatenation refers to joining two preprocessing
1432 tokens to form one. In particular, a token of a macro argument can be
1433 concatenated with another argument's token or with fixed text to produce
1434 a longer name. The longer name might be the name of a function,
1435 variable, type, or a C keyword; it might even be the name of another
1436 macro, in which case it will be expanded.
1437
1438 When you define a function-like or object-like macro, you request
1439 concatenation with the special operator @samp{##} in the macro's
1440 replacement list. When the macro is called, any arguments are
1441 substituted without performing macro expansion, every @samp{##} operator
1442 is deleted, and the two tokens on either side of it are concatenated to
1443 form a single token.
1444
1445 Consider a C program that interprets named commands. There probably needs
1446 to be a table of commands, perhaps an array of structures declared as
1447 follows:
1448
1449 @example
1450 struct command
1451 @{
1452 char *name;
1453 void (*function) ();
1454 @};
1455
1456 struct command commands[] =
1457 @{
1458 @{ "quit", quit_command@},
1459 @{ "help", help_command@},
1460 @dots{}
1461 @};
1462 @end example
1463
1464 It would be cleaner not to have to give each command name twice, once in
1465 the string constant and once in the function name. A macro which takes the
1466 name of a command as an argument can make this unnecessary. The string
1467 constant can be created with stringification, and the function name by
1468 concatenating the argument with @samp{_command}. Here is how it is done:
1469
1470 @example
1471 #define COMMAND(NAME) @{ #NAME, NAME ## _command @}
1472
1473 struct command commands[] =
1474 @{
1475 COMMAND (quit),
1476 COMMAND (help),
1477 @dots{}
1478 @};
1479 @end example
1480
1481 The usual case of concatenation is concatenating two names (or a name
1482 and a number) into a longer name. This isn't the only valid case.
1483 It is also possible to concatenate two numbers (or a number and a name,
1484 such as @samp{1.5} and @samp{e3}) into a number. Also, multi-character
1485 operators such as @samp{+=} can be formed by concatenation. However,
1486 two tokens that don't together form a valid token cannot be
1487 concatenated. For example, concatenation of @samp{x} on one side and
1488 @samp{+} on the other is not meaningful because those two tokens do not
1489 form a valid preprocessing token when concatenated. UNDEFINED
1490
1491 Keep in mind that the C preprocessor converts comments to whitespace
1492 before macros are even considered. Therefore, you cannot create a
1493 comment by concatenating @samp{/} and @samp{*}: the @samp{/*} sequence
1494 that starts a comment is not a token, but rather the beginning of a
1495 comment. You can freely use comments next to @samp{##} in a macro
1496 definition, or in arguments that will be concatenated, because the
1497 comments will be converted to spaces at first sight, and concatenation
1498 operates on tokens and so ignores whitespace.
1499
1500 @node Undefining, Redefining, Concatenation, Macros
1501 @subsection Undefining Macros
1502
1503 @cindex undefining macros
1504 To @dfn{undefine} a macro means to cancel its definition. This is done
1505 with the @samp{#undef} directive. @samp{#undef} is followed by the macro
1506 name to be undefined.
1507
1508 Like definition, undefinition occurs at a specific point in the source
1509 file, and it applies starting from that point. The name ceases to be a
1510 macro name, and from that point on it is treated by the preprocessor as
1511 if it had never been a macro name.
1512
1513 For example,
1514
1515 @example
1516 #define FOO 4
1517 x = FOO;
1518 #undef FOO
1519 x = FOO;
1520 @end example
1521
1522 @noindent
1523 expands into
1524
1525 @example
1526 x = 4;
1527
1528 x = FOO;
1529 @end example
1530
1531 @noindent
1532 In this example, @samp{FOO} had better be a variable or function as well
1533 as (temporarily) a macro, in order for the result of the expansion to be
1534 valid C code.
1535
1536 The same form of @samp{#undef} directive will cancel definitions with
1537 arguments or definitions that don't expect arguments. The @samp{#undef}
1538 directive has no effect when used on a name not currently defined as a
1539 macro.
1540
1541 @node Redefining, Poisoning, Undefining, Macros
1542 @subsection Redefining Macros
1543
1544 @cindex redefining macros
1545 @dfn{Redefining} a macro means defining (with @samp{#define}) a name that
1546 is already defined as a macro.
1547
1548 A redefinition is trivial if the new definition is transparently
1549 identical to the old one. You probably wouldn't deliberately write a
1550 trivial redefinition, but they can happen automatically when a header
1551 file is included more than once (@pxref{Header Files}), so they are
1552 accepted silently and without effect.
1553
1554 Nontrivial redefinition is considered likely to be an error, so it
1555 provokes a warning message from the preprocessor. However, sometimes it
1556 is useful to change the definition of a macro in mid-compilation. You
1557 can inhibit the warning by undefining the macro with @samp{#undef}
1558 before the second definition.
1559
1560 In order for a redefinition to be trivial, the parameter names must
1561 match and be in the same order, and the new replacement list must
1562 exactly match the one already in effect, with two possible exceptions:
1563
1564 @itemize @bullet
1565 @item
1566 Whitespace may be added or deleted at the beginning or the end of the
1567 replacement list. In a sense this is vacuous, since strictly such
1568 whitespace doesn't form part of the macro's expansion.
1569
1570 @item
1571 Between tokens in the expansion, any two forms of whitespace are
1572 considered equivalent. In particular, whitespace may not be eliminated
1573 entirely, nor may it be added where there previously wasn't any.
1574 @end itemize
1575
1576 Recall that a comment counts as whitespace.
1577
1578 As a particular case of the above, you may not redefine an object-like
1579 macro as a function-like macro, and vice-versa.
1580
1581 @node Poisoning, Macro Pitfalls, Redefining, Macros
1582 @subsection Poisoning Macros
1583 @cindex poisoning macros
1584 @findex #pragma GCC poison
1585
1586 Sometimes, there is an identifier that you want to remove completely
1587 from your program, and make sure that it never creeps back in. To
1588 enforce this, the @samp{#pragma GCC poison} directive can be used.
1589 @samp{#pragma GCC poison} is followed by a list of identifiers to
1590 poison, and takes effect for the rest of the source. You cannot
1591 @samp{#undef} a poisoned identifier or test to see if it's defined with
1592 @samp{#ifdef}.
1593
1594 For example,
1595
1596 @example
1597 #pragma GCC poison printf sprintf fprintf
1598 sprintf(some_string, "hello");
1599 @end example
1600
1601 @noindent
1602 will produce an error.
1603
1604 @node Macro Pitfalls,, Poisoning, Macros
1605 @subsection Pitfalls and Subtleties of Macros
1606 @cindex problems with macros
1607 @cindex pitfalls of macros
1608
1609 In this section we describe some special rules that apply to macros and
1610 macro expansion, and point out certain cases in which the rules have
1611 counterintuitive consequences that you must watch out for.
1612
1613 @menu
1614 * Misnesting:: Macros can contain unmatched parentheses.
1615 * Macro Parentheses:: Why apparently superfluous parentheses
1616 may be necessary to avoid incorrect grouping.
1617 * Swallow Semicolon:: Macros that look like functions
1618 but expand into compound statements.
1619 * Side Effects:: Unsafe macros that cause trouble when
1620 arguments contain side effects.
1621 * Self-Reference:: Macros whose definitions use the macros' own names.
1622 * Argument Prescan:: Arguments are checked for macro calls before they
1623 are substituted.
1624 * Cascaded Macros:: Macros whose definitions use other macros.
1625 * Newlines in Args:: Sometimes line numbers get confused.
1626 @end menu
1627
1628 @node Misnesting, Macro Parentheses, Macro Pitfalls, Macro Pitfalls
1629 @subsubsection Improperly Nested Constructs
1630
1631 Recall that when a macro is called with arguments, the arguments are
1632 substituted into the macro body and the result is checked, together with
1633 the rest of the input file, for more macro calls.
1634
1635 It is possible to piece together a macro call coming partially from the
1636 macro body and partially from the arguments. For example,
1637
1638 @example
1639 #define double(x) (2*(x))
1640 #define call_with_1(x) x(1)
1641 @end example
1642
1643 @noindent
1644 would expand @samp{call_with_1 (double)} into @samp{(2*(1))}.
1645
1646 Macro definitions do not have to have balanced parentheses. By writing
1647 an unbalanced open parenthesis in a macro body, it is possible to create
1648 a macro call that begins inside the macro body but ends outside of it.
1649 For example,
1650
1651 @example
1652 #define strange(file) fprintf (file, "%s %d",
1653 @dots{}
1654 strange(stderr) p, 35)
1655 @end example
1656
1657 @noindent
1658 This bizarre example expands to @samp{fprintf (stderr, "%s %d", p, 35)}!
1659
1660 @node Macro Parentheses, Swallow Semicolon, Misnesting, Macro Pitfalls
1661 @subsubsection Unintended Grouping of Arithmetic
1662 @cindex parentheses in macro bodies
1663
1664 You may have noticed that in most of the macro definition examples shown
1665 above, each occurrence of a macro argument name had parentheses around
1666 it. In addition, another pair of parentheses usually surround the
1667 entire macro definition. Here is why it is best to write macros that
1668 way.
1669
1670 Suppose you define a macro as follows,
1671
1672 @example
1673 #define ceil_div(x, y) (x + y - 1) / y
1674 @end example
1675
1676 @noindent
1677 whose purpose is to divide, rounding up. (One use for this operation is
1678 to compute how many @samp{int} objects are needed to hold a certain
1679 number of @samp{char} objects.) Then suppose it is used as follows:
1680
1681 @example
1682 a = ceil_div (b & c, sizeof (int));
1683 @end example
1684
1685 @noindent
1686 This expands into
1687
1688 @example
1689 a = (b & c + sizeof (int) - 1) / sizeof (int);
1690 @end example
1691
1692 @noindent
1693 which does not do what is intended. The operator-precedence rules of
1694 C make it equivalent to this:
1695
1696 @example
1697 a = (b & (c + sizeof (int) - 1)) / sizeof (int);
1698 @end example
1699
1700 @noindent
1701 What we want is this:
1702
1703 @example
1704 a = ((b & c) + sizeof (int) - 1)) / sizeof (int);
1705 @end example
1706
1707 @noindent
1708 Defining the macro as
1709
1710 @example
1711 #define ceil_div(x, y) ((x) + (y) - 1) / (y)
1712 @end example
1713
1714 @noindent
1715 provides the desired result.
1716
1717 Unintended grouping can result in another way. Consider @samp{sizeof
1718 ceil_div(1, 2)}. That has the appearance of a C expression that would
1719 compute the size of the type of @samp{ceil_div (1, 2)}, but in fact it
1720 means something very different. Here is what it expands to:
1721
1722 @example
1723 sizeof ((1) + (2) - 1) / (2)
1724 @end example
1725
1726 @noindent
1727 This would take the size of an integer and divide it by two. The
1728 precedence rules have put the division outside the @samp{sizeof} when it
1729 was intended to be inside.
1730
1731 Parentheses around the entire macro definition can prevent such
1732 problems. Here, then, is the recommended way to define @samp{ceil_div}:
1733
1734 @example
1735 #define ceil_div(x, y) (((x) + (y) - 1) / (y))
1736 @end example
1737
1738 @node Swallow Semicolon, Side Effects, Macro Parentheses, Macro Pitfalls
1739 @subsubsection Swallowing the Semicolon
1740
1741 @cindex semicolons (after macro calls)
1742 Often it is desirable to define a macro that expands into a compound
1743 statement. Consider, for example, the following macro, that advances a
1744 pointer (the argument @samp{p} says where to find it) across whitespace
1745 characters:
1746
1747 @example
1748 #define SKIP_SPACES(p, limit) \
1749 @{ register char *lim = (limit); \
1750 while (p != lim) @{ \
1751 if (*p++ != ' ') @{ \
1752 p--; break; @}@}@}
1753 @end example
1754
1755 @noindent
1756 Here backslash-newline is used to split the macro definition, which must
1757 be a single logical line, so that it resembles the way such C code would
1758 be laid out if not part of a macro definition.
1759
1760 A call to this macro might be @samp{SKIP_SPACES (p, lim)}. Strictly
1761 speaking, the call expands to a compound statement, which is a complete
1762 statement with no need for a semicolon to end it. However, since it
1763 looks like a function call, it minimizes confusion if you can use it
1764 like a function call, writing a semicolon afterward, as in
1765 @samp{SKIP_SPACES (p, lim);}
1766
1767 This can cause trouble before @samp{else} statements, because the
1768 semicolon is actually a null statement. Suppose you write
1769
1770 @example
1771 if (*p != 0)
1772 SKIP_SPACES (p, lim);
1773 else @dots{}
1774 @end example
1775
1776 @noindent
1777 The presence of two statements --- the compound statement and a null
1778 statement --- in between the @samp{if} condition and the @samp{else}
1779 makes invalid C code.
1780
1781 The definition of the macro @samp{SKIP_SPACES} can be altered to solve
1782 this problem, using a @samp{do @dots{} while} statement. Here is how:
1783
1784 @example
1785 #define SKIP_SPACES(p, limit) \
1786 do @{ register char *lim = (limit); \
1787 while (p != lim) @{ \
1788 if (*p++ != ' ') @{ \
1789 p--; break; @}@}@} \
1790 while (0)
1791 @end example
1792
1793 Now @samp{SKIP_SPACES (p, lim);} expands into
1794
1795 @example
1796 do @{@dots{}@} while (0);
1797 @end example
1798
1799 @noindent
1800 which is one statement.
1801
1802 @node Side Effects, Self-Reference, Swallow Semicolon, Macro Pitfalls
1803 @subsubsection Duplication of Side Effects
1804
1805 @cindex side effects (in macro arguments)
1806 @cindex unsafe macros
1807 Many C programs define a macro @samp{min}, for ``minimum'', like this:
1808
1809 @example
1810 #define min(X, Y) ((X) < (Y) ? (X) : (Y))
1811 @end example
1812
1813 When you use this macro with an argument containing a side effect,
1814 as shown here,
1815
1816 @example
1817 next = min (x + y, foo (z));
1818 @end example
1819
1820 @noindent
1821 it expands as follows:
1822
1823 @example
1824 next = ((x + y) < (foo (z)) ? (x + y) : (foo (z)));
1825 @end example
1826
1827 @noindent
1828 where @samp{x + y} has been substituted for @samp{X} and @samp{foo (z)}
1829 for @samp{Y}.
1830
1831 The function @samp{foo} is used only once in the statement as it appears
1832 in the program, but the expression @samp{foo (z)} has been substituted
1833 twice into the macro expansion. As a result, @samp{foo} might be called
1834 two times when the statement is executed. If it has side effects or if
1835 it takes a long time to compute, the results might not be what you
1836 intended. We say that @samp{min} is an @dfn{unsafe} macro.
1837
1838 The best solution to this problem is to define @samp{min} in a way that
1839 computes the value of @samp{foo (z)} only once. The C language offers
1840 no standard way to do this, but it can be done with GNU C extensions as
1841 follows:
1842
1843 @example
1844 #define min(X, Y) \
1845 (@{ typeof (X) __x = (X), __y = (Y); \
1846 (__x < __y) ? __x : __y; @})
1847 @end example
1848
1849 If you do not wish to use GNU C extensions, the only solution is to be
1850 careful when @emph{using} the macro @samp{min}. For example, you can
1851 calculate the value of @samp{foo (z)}, save it in a variable, and use
1852 that variable in @samp{min}:
1853
1854 @example
1855 #define min(X, Y) ((X) < (Y) ? (X) : (Y))
1856 @dots{}
1857 @{
1858 int tem = foo (z);
1859 next = min (x + y, tem);
1860 @}
1861 @end example
1862
1863 @noindent
1864 (where we assume that @samp{foo} returns type @samp{int}).
1865
1866 @node Self-Reference, Argument Prescan, Side Effects, Macro Pitfalls
1867 @subsubsection Self-Referential Macros
1868
1869 @cindex self-reference
1870 A @dfn{self-referential} macro is one whose name appears in its
1871 definition. A special feature of ISO Standard C is that the
1872 self-reference is not considered a macro call. It is passed into the
1873 preprocessor output unchanged.
1874
1875 Let's consider an example:
1876
1877 @example
1878 #define foo (4 + foo)
1879 @end example
1880
1881 @noindent
1882 where @samp{foo} is also a variable in your program.
1883
1884 Following the ordinary rules, each reference to @samp{foo} will expand
1885 into @samp{(4 + foo)}; then this will be rescanned and will expand into
1886 @samp{(4 + (4 + foo))}; and so on until it causes a fatal error (memory
1887 full) in the preprocessor.
1888
1889 However, the special rule about self-reference cuts this process short
1890 after one step, at @samp{(4 + foo)}. Therefore, this macro definition
1891 has the possibly useful effect of causing the program to add 4 to the
1892 value of @samp{foo} wherever @samp{foo} is referred to.
1893
1894 In most cases, it is a bad idea to take advantage of this feature. A
1895 person reading the program who sees that @samp{foo} is a variable will
1896 not expect that it is a macro as well. The reader will come across the
1897 identifier @samp{foo} in the program and think its value should be that
1898 of the variable @samp{foo}, whereas in fact the value is four greater.
1899
1900 The special rule for self-reference applies also to @dfn{indirect}
1901 self-reference. This is the case where a macro @var{x} expands to use a
1902 macro @samp{y}, and the expansion of @samp{y} refers to the macro
1903 @samp{x}. The resulting reference to @samp{x} comes indirectly from the
1904 expansion of @samp{x}, so it is a self-reference and is not further
1905 expanded. Thus, after
1906
1907 @example
1908 #define x (4 + y)
1909 #define y (2 * x)
1910 @end example
1911
1912 @noindent
1913 @samp{x} would expand into @samp{(4 + (2 * x))}. Clear?
1914
1915 Suppose @samp{y} is used elsewhere, not from the definition of @samp{x}.
1916 Then the use of @samp{x} in the expansion of @samp{y} is not a
1917 self-reference because @samp{x} is not ``in progress''. So it does
1918 expand. However, the expansion of @samp{x} contains a reference to
1919 @samp{y}, and that is an indirect self-reference now because @samp{y} is
1920 ``in progress''. The result is that @samp{y} expands to @samp{(2 * (4 +
1921 y))}.
1922
1923 This behavior is specified by the ISO C standard, so you may need to
1924 understand it.
1925
1926 @node Argument Prescan, Cascaded Macros, Self-Reference, Macro Pitfalls
1927 @subsubsection Separate Expansion of Macro Arguments
1928 @cindex expansion of arguments
1929 @cindex macro argument expansion
1930 @cindex prescan of macro arguments
1931
1932 We have explained that the expansion of a macro, including the substituted
1933 arguments, is re-scanned for macro calls to be expanded.
1934
1935 What really happens is more subtle: first each argument is scanned
1936 separately for macro calls. Then the resulting tokens are substituted
1937 into the macro body to produce the macro expansion, and the macro
1938 expansion is scanned again for macros to expand.
1939
1940 The result is that the arguments are scanned @emph{twice} to expand
1941 macro calls in them.
1942
1943 Most of the time, this has no effect. If the argument contained any
1944 macro calls, they are expanded during the first scan. The result
1945 therefore contains no macro calls, so the second scan does not change
1946 it. If the argument were substituted as given, with no prescan, the
1947 single remaining scan would find the same macro calls and produce the
1948 same results.
1949
1950 You might expect the double scan to change the results when a
1951 self-referential macro is used in an argument of another macro
1952 (@pxref{Self-Reference}): the self-referential macro would be expanded
1953 once in the first scan, and a second time in the second scan. However,
1954 this is not what happens. The self-references that do not expand in the
1955 first scan are marked so that they will not expand in the second scan
1956 either.
1957
1958 The prescan is not done when an argument is stringified or concatenated.
1959 Thus,
1960
1961 @example
1962 #define str(s) #s
1963 #define foo 4
1964 str (foo)
1965 @end example
1966
1967 @noindent
1968 expands to @samp{"foo"}. Once more, prescan has been prevented from
1969 having any noticeable effect.
1970
1971 More precisely, stringification and concatenation use the argument
1972 tokens as given without initially scanning for macros. The same
1973 argument would be used in expanded form if it is substituted elsewhere
1974 without stringification or concatenation.
1975
1976 @example
1977 #define str(s) #s lose(s)
1978 #define foo 4
1979 str (foo)
1980 @end example
1981
1982 expands to @samp{"foo" lose(4)}.
1983
1984 You might now ask, ``Why mention the prescan, if it makes no difference?
1985 And why not skip it and make the preprocessor faster?'' The answer is
1986 that the prescan does make a difference in three special cases:
1987
1988 @itemize @bullet
1989 @item
1990 Nested calls to a macro.
1991
1992 @item
1993 Macros that call other macros that stringify or concatenate.
1994
1995 @item
1996 Macros whose expansions contain unshielded commas.
1997 @end itemize
1998
1999 We say that @dfn{nested} calls to a macro occur when a macro's argument
2000 contains a call to that very macro. For example, if @samp{f} is a macro
2001 that expects one argument, @samp{f (f (1))} is a nested pair of calls to
2002 @samp{f}. The desired expansion is made by expanding @samp{f (1)} and
2003 substituting that into the definition of @samp{f}. The prescan causes
2004 the expected result to happen. Without the prescan, @samp{f (1)} itself
2005 would be substituted as an argument, and the inner use of @samp{f} would
2006 appear during the main scan as an indirect self-reference and would not
2007 be expanded. Here, the prescan cancels an undesirable side effect (in
2008 the medical, not computational, sense of the term) of the special rule
2009 for self-referential macros.
2010
2011 Prescan causes trouble in certain other cases of nested macro calls.
2012 Here is an example:
2013
2014 @example
2015 #define foo a,b
2016 #define bar(x) lose(x)
2017 #define lose(x) (1 + (x))
2018
2019 bar(foo)
2020 @end example
2021
2022 @noindent
2023 We would like @samp{bar(foo)} to turn into @samp{(1 + (foo))}, which
2024 would then turn into @samp{(1 + (a,b))}. Instead, @samp{bar(foo)}
2025 expands into @samp{lose(a,b)}, and you get an error because @code{lose}
2026 requires a single argument. In this case, the problem is easily solved
2027 by the same parentheses that ought to be used to prevent misnesting of
2028 arithmetic operations:
2029
2030 @example
2031 #define foo (a,b)
2032 #define bar(x) lose((x))
2033 @end example
2034
2035 The problem is more serious when the operands of the macro are not
2036 expressions; for example, when they are statements. Then parentheses
2037 are unacceptable because they would make for invalid C code:
2038
2039 @example
2040 #define foo @{ int a, b; @dots{} @}
2041 @end example
2042
2043 @noindent
2044 In GNU C you can shield the commas using the @samp{(@{@dots{}@})}
2045 construct which turns a compound statement into an expression:
2046
2047 @example
2048 #define foo (@{ int a, b; @dots{} @})
2049 @end example
2050
2051 Or you can rewrite the macro definition to avoid such commas:
2052
2053 @example
2054 #define foo @{ int a; int b; @dots{} @}
2055 @end example
2056
2057 There is also one case where prescan is useful. It is possible to use
2058 prescan to expand an argument and then stringify it --- if you use two
2059 levels of macros. Let's add a new macro @samp{xstr} to the example
2060 shown above:
2061
2062 @example
2063 #define xstr(s) str(s)
2064 #define str(s) #s
2065 #define foo 4
2066 xstr (foo)
2067 @end example
2068
2069 This expands into @samp{"4"}, not @samp{"foo"}. The reason for the
2070 difference is that the argument of @samp{xstr} is expanded at prescan
2071 (because @samp{xstr} does not specify stringification or concatenation
2072 of the argument). The result of prescan then forms the argument for
2073 @samp{str}. @samp{str} uses its argument without prescan because it
2074 performs stringification; but it cannot prevent or undo the prescanning
2075 already done by @samp{xstr}.
2076
2077 @node Cascaded Macros, Newlines in Args, Argument Prescan, Macro Pitfalls
2078 @subsubsection Cascaded Use of Macros
2079
2080 @cindex cascaded macros
2081 @cindex macro body uses macro
2082 A @dfn{cascade} of macros is when one macro's body contains a reference
2083 to another macro. This is very common practice. For example,
2084
2085 @example
2086 #define BUFSIZE 1020
2087 #define TABLESIZE BUFSIZE
2088 @end example
2089
2090 This is not at all the same as defining @samp{TABLESIZE} to be
2091 @samp{1020}. The @samp{#define} for @samp{TABLESIZE} uses exactly the
2092 body you specify --- in this case, @samp{BUFSIZE} --- and does not check
2093 to see whether it too is the name of a macro.
2094
2095 It's only when you @emph{use} @samp{TABLESIZE} that the result of its
2096 expansion is checked for more macro names.
2097
2098 This makes a difference if you change the definition of @samp{BUFSIZE}
2099 at some point in the source file. @samp{TABLESIZE}, defined as shown,
2100 will always expand using the definition of @samp{BUFSIZE} that is
2101 currently in effect:
2102
2103 @example
2104 #define BUFSIZE 1020
2105 #define TABLESIZE BUFSIZE
2106 #undef BUFSIZE
2107 #define BUFSIZE 37
2108 @end example
2109
2110 @noindent
2111 Now @samp{TABLESIZE} expands (in two stages) to @samp{37}. (The
2112 @samp{#undef} is to prevent any warning about the nontrivial
2113 redefinition of @code{BUFSIZE}.)
2114
2115 @node Newlines in Args,, Cascaded Macros, Macro Pitfalls
2116 @subsection Newlines in Macro Arguments
2117 @cindex newlines in macro arguments
2118
2119 The invocation of a function-like macro can extend over many logical
2120 lines. The ISO C standard requires that newlines within a macro
2121 invocation be treated as ordinary whitespace. This means that when the
2122 expansion of a function-like macro replaces its invocation, it appears
2123 on the same line as the macro name did. Thus line numbers emitted by
2124 the compiler or debugger refer to the line the invocation started on,
2125 which might be different to the line containing the argument causing the
2126 problem.
2127
2128 Here is an example illustrating this:
2129
2130 @example
2131 #define ignore_second_arg(a,b,c) a; c
2132
2133 ignore_second_arg (foo (),
2134 ignored (),
2135 syntax error);
2136 @end example
2137
2138 @noindent
2139 The syntax error triggered by the tokens @samp{syntax error} results in
2140 an error message citing line three --- the line of ignore_second_arg ---
2141 even though the problematic code comes from line five.
2142
2143 @node Conditionals, Assertions, Macros, Top
2144 @section Conditionals
2145
2146 @cindex conditionals
2147 In a macro processor, a @dfn{conditional} is a directive that allows a
2148 part of the program to be ignored during compilation, on some
2149 conditions. In the C preprocessor, a conditional can test either an
2150 arithmetic expression or whether a name is defined as a macro.
2151
2152 A conditional in the C preprocessor resembles in some ways an @samp{if}
2153 statement in C, but it is important to understand the difference between
2154 them. The condition in an @samp{if} statement is tested during the
2155 execution of your program. Its purpose is to allow your program to
2156 behave differently from run to run, depending on the data it is
2157 operating on. The condition in a preprocessing conditional directive is
2158 tested when your program is compiled. Its purpose is to allow different
2159 code to be included in the program depending on the situation at the
2160 time of compilation.
2161
2162 @menu
2163 * Uses: Conditional Uses. What conditionals are for.
2164 * Syntax: Conditional Syntax. How conditionals are written.
2165 * Deletion: Deleted Code. Making code into a comment.
2166 * Macros: Conditionals-Macros. Why conditionals are used with macros.
2167 * Errors: #error Directive. Detecting inconsistent compilation parameters.
2168 @end menu
2169
2170 @node Conditional Uses
2171 @subsection Why Conditionals are Used
2172
2173 Generally there are three kinds of reason to use a conditional.
2174
2175 @itemize @bullet
2176 @item
2177 A program may need to use different code depending on the machine or
2178 operating system it is to run on. In some cases the code for one
2179 operating system may be erroneous on another operating system; for
2180 example, it might refer to library routines that do not exist on the
2181 other system. When this happens, it is not enough to avoid executing
2182 the invalid code: merely having it in the program makes it impossible to
2183 link the program and run it. With a preprocessing conditional, the
2184 offending code can be effectively excised from the program when it is
2185 not valid.
2186
2187 @item
2188 You may want to be able to compile the same source file into two
2189 different programs. Sometimes the difference between the programs is
2190 that one makes frequent time-consuming consistency checks on its
2191 intermediate data, or prints the values of those data for debugging,
2192 while the other does not.
2193
2194 @item
2195 A conditional whose condition is always false is a good way to exclude
2196 code from the program but keep it as a sort of comment for future
2197 reference.
2198 @end itemize
2199
2200 Most simple programs that are intended to run on only one machine will
2201 not need to use preprocessing conditionals.
2202
2203 @node Conditional Syntax
2204 @subsection Syntax of Conditionals
2205
2206 @findex #if
2207 A conditional in the C preprocessor begins with a @dfn{conditional
2208 directive}: @samp{#if}, @samp{#ifdef} or @samp{#ifndef}.
2209 @xref{Conditionals-Macros}, for information on @samp{#ifdef} and
2210 @samp{#ifndef}; only @samp{#if} is explained here.
2211
2212 @menu
2213 * If: #if Directive. Basic conditionals using @samp{#if} and @samp{#endif}.
2214 * Else: #else Directive. Including some text if the condition fails.
2215 * Elif: #elif Directive. Testing several alternative possibilities.
2216 @end menu
2217
2218 @node #if Directive
2219 @subsubsection The @samp{#if} Directive
2220
2221 The @samp{#if} directive in its simplest form consists of
2222
2223 @example
2224 #if @var{expression}
2225 @var{controlled text}
2226 #endif /* @var{expression} */
2227 @end example
2228
2229 The comment following the @samp{#endif} is not required, but it is a
2230 good practice because it helps people match the @samp{#endif} to the
2231 corresponding @samp{#if}. Such comments should always be used, except
2232 in short conditionals that are not nested. In fact, you can put
2233 anything at all after the @samp{#endif} and it will be ignored by the
2234 GNU C preprocessor, but only comments are acceptable in ISO Standard C@.
2235
2236 @var{expression} is a C expression of integer type, subject to stringent
2237 restrictions. It may contain
2238
2239 @itemize @bullet
2240 @item
2241 Integer constants, which are all regarded as @code{long} or
2242 @code{unsigned long}.
2243
2244 @item
2245 Character constants, which are interpreted according to the character
2246 set and conventions of the machine and operating system on which the
2247 preprocessor is running. The GNU C preprocessor uses the C data type
2248 @samp{char} for these character constants; therefore, whether some
2249 character codes are negative is determined by the C compiler used to
2250 compile the preprocessor. If it treats @samp{char} as signed, then
2251 character codes large enough to set the sign bit will be considered
2252 negative; otherwise, no character code is considered negative.
2253
2254 @item
2255 Arithmetic operators for addition, subtraction, multiplication,
2256 division, bitwise operations, shifts, comparisons, and logical
2257 operations (@samp{&&} and @samp{||}). The latter two obey the usual
2258 short-circuiting rules of standard C.
2259
2260 @item
2261 Identifiers that are not macros, which are all treated as zero(!).
2262
2263 @item
2264 Macro calls. All macro calls in the expression are expanded before
2265 actual computation of the expression's value begins.
2266 @end itemize
2267
2268 Note that @samp{sizeof} operators and @code{enum}-type values are not
2269 allowed. @code{enum}-type values, like all other identifiers that are
2270 not taken as macro calls and expanded, are treated as zero.
2271
2272 The @var{controlled text} inside of a conditional can include
2273 preprocessing directives. Then the directives inside the conditional
2274 are obeyed only if that branch of the conditional succeeds. The text
2275 can also contain other conditional groups. However, the @samp{#if} and
2276 @samp{#endif} directives must balance.
2277
2278 @node #else Directive
2279 @subsubsection The @samp{#else} Directive
2280
2281 @findex #else
2282 The @samp{#else} directive can be added to a conditional to provide
2283 alternative text to be used if the condition is false. This is what
2284 it looks like:
2285
2286 @example
2287 #if @var{expression}
2288 @var{text-if-true}
2289 #else /* Not @var{expression} */
2290 @var{text-if-false}
2291 #endif /* Not @var{expression} */
2292 @end example
2293
2294 If @var{expression} is nonzero, and thus the @var{text-if-true} is
2295 active, then @samp{#else} acts like a failing conditional and the
2296 @var{text-if-false} is ignored. Conversely, if the @samp{#if}
2297 conditional fails, the @var{text-if-false} is considered included.
2298
2299 @node #elif Directive
2300 @subsubsection The @samp{#elif} Directive
2301
2302 @findex #elif
2303 One common case of nested conditionals is used to check for more than two
2304 possible alternatives. For example, you might have
2305
2306 @example
2307 #if X == 1
2308 @dots{}
2309 #else /* X != 1 */
2310 #if X == 2
2311 @dots{}
2312 #else /* X != 2 */
2313 @dots{}
2314 #endif /* X != 2 */
2315 #endif /* X != 1 */
2316 @end example
2317
2318 Another conditional directive, @samp{#elif}, allows this to be
2319 abbreviated as follows:
2320
2321 @example
2322 #if X == 1
2323 @dots{}
2324 #elif X == 2
2325 @dots{}
2326 #else /* X != 2 and X != 1*/
2327 @dots{}
2328 #endif /* X != 2 and X != 1*/
2329 @end example
2330
2331 @samp{#elif} stands for ``else if''. Like @samp{#else}, it goes in the
2332 middle of a @samp{#if}-@samp{#endif} pair and subdivides it; it does not
2333 require a matching @samp{#endif} of its own. Like @samp{#if}, the
2334 @samp{#elif} directive includes an expression to be tested.
2335
2336 The text following the @samp{#elif} is processed only if the original
2337 @samp{#if}-condition failed and the @samp{#elif} condition succeeds.
2338 More than one @samp{#elif} can go in the same @samp{#if}-@samp{#endif}
2339 group. Then the text after each @samp{#elif} is processed only if the
2340 @samp{#elif} condition succeeds after the original @samp{#if} and any
2341 previous @samp{#elif} directives within it have failed. @samp{#else} is
2342 equivalent to @samp{#elif 1}, and @samp{#else} is allowed after any
2343 number of @samp{#elif} directives, but @samp{#elif} may not follow
2344 @samp{#else}.
2345
2346 @node Deleted Code
2347 @subsection Keeping Deleted Code for Future Reference
2348 @cindex commenting out code
2349
2350 If you replace or delete a part of the program but want to keep the old
2351 code around as a comment for future reference, the easy way to do this
2352 is to put @samp{#if 0} before it and @samp{#endif} after it. This is
2353 better than using comment delimiters @samp{/*} and @samp{*/} since those
2354 won't work if the code already contains comments (C comments do not
2355 nest).
2356
2357 This works even if the code being turned off contains conditionals, but
2358 they must be entire conditionals (balanced @samp{#if} and @samp{#endif}).
2359
2360 Conversely, do not use @samp{#if 0} for comments which are not C code.
2361 Use the comment delimiters @samp{/*} and @samp{*/} instead. The
2362 interior of @samp{#if 0} must consist of complete tokens; in particular,
2363 single-quote characters must balance. Comments often contain unbalanced
2364 single-quote characters (known in English as apostrophes). These
2365 confuse @samp{#if 0}. They do not confuse @samp{/*}.
2366
2367 @node Conditionals-Macros
2368 @subsection Conditionals and Macros
2369
2370 Conditionals are useful in connection with macros or assertions, because
2371 those are the only ways that an expression's value can vary from one
2372 compilation to another. A @samp{#if} directive whose expression uses no
2373 macros or assertions is equivalent to @samp{#if 1} or @samp{#if 0}; you
2374 might as well determine which one, by computing the value of the
2375 expression yourself, and then simplify the program.
2376
2377 For example, here is a conditional that tests the expression
2378 @samp{BUFSIZE == 1020}, where @samp{BUFSIZE} must be a macro.
2379
2380 @example
2381 #if BUFSIZE == 1020
2382 printf ("Large buffers!\n");
2383 #endif /* BUFSIZE is large */
2384 @end example
2385
2386 (Programmers often wish they could test the size of a variable or data
2387 type in @samp{#if}, but this does not work. The preprocessor does not
2388 understand @code{sizeof}, or typedef names, or even the type keywords
2389 such as @code{int}.)
2390
2391 @findex defined
2392 The special operator @samp{defined} is used in @samp{#if} and
2393 @samp{#elif} expressions to test whether a certain name is defined as a
2394 macro. Either @samp{defined @var{name}} or @samp{defined (@var{name})}
2395 is an expression whose value is 1 if @var{name} is defined as macro at
2396 the current point in the program, and 0 otherwise. To the
2397 @samp{defined} operator it makes no difference what the definition of
2398 the macro is; all that matters is whether there is a definition. Thus,
2399 for example,@refill
2400
2401 @example
2402 #if defined (vax) || defined (ns16000)
2403 @end example
2404
2405 @noindent
2406 would succeed if either of the names @samp{vax} and @samp{ns16000} is
2407 defined as a macro. You can test the same condition using assertions
2408 (@pxref{Assertions}), like this:
2409
2410 @example
2411 #if #cpu (vax) || #cpu (ns16000)
2412 @end example
2413
2414 If a macro is defined and later undefined with @samp{#undef}, subsequent
2415 use of the @samp{defined} operator returns 0, because the name is no
2416 longer defined. If the macro is defined again with another
2417 @samp{#define}, @samp{defined} will recommence returning 1.
2418
2419 If the @samp{defined} operator appears as a result of a macro expansion,
2420 the C standard says the behavior is undefined. GNU cpp treats it as a
2421 genuine @samp{defined} operator and evaluates it normally. It will warn
2422 wherever your code uses this feature if you use the command-line option
2423 @samp{-pedantic}, since other compilers may handle it differently.
2424
2425 @findex #ifdef
2426 @findex #ifndef
2427 Conditionals that test whether a single macro is defined are very common,
2428 so there are two special short conditional directives for this case.
2429
2430 @table @code
2431 @item #ifdef @var{name}
2432 is equivalent to @samp{#if defined (@var{name})}.
2433
2434 @item #ifndef @var{name}
2435 is equivalent to @samp{#if ! defined (@var{name})}.
2436 @end table
2437
2438 Macro definitions can vary between compilations for several reasons.
2439
2440 @itemize @bullet
2441 @item
2442 Some macros are predefined on each kind of machine. For example, on a
2443 Vax, the name @samp{vax} is a predefined macro. On other machines, it
2444 would not be defined.
2445
2446 @item
2447 Many more macros are defined by system header files. Different systems
2448 and machines define different macros, or give them different values. It
2449 is useful to test these macros with conditionals to avoid using a system
2450 feature on a machine where it is not implemented.
2451
2452 @item
2453 Macros are a common way of allowing users to customize a program for
2454 different machines or applications. For example, the macro
2455 @samp{BUFSIZE} might be defined in a configuration file for your program
2456 that is included as a header file in each source file. You would use
2457 @samp{BUFSIZE} in a preprocessing conditional in order to generate
2458 different code depending on the chosen configuration.
2459
2460 @item
2461 Macros can be defined or undefined with @samp{-D} and @samp{-U} command
2462 options when you compile the program. You can arrange to compile the
2463 same source file into two different programs by choosing a macro name to
2464 specify which program you want, writing conditionals to test whether or
2465 how this macro is defined, and then controlling the state of the macro
2466 with compiler command options. @xref{Invocation}.
2467 @end itemize
2468
2469 @ifinfo
2470 Assertions are usually predefined, but can be defined with preprocessor
2471 directives or command-line options.
2472 @end ifinfo
2473
2474 @node #error Directive
2475 @subsection The @samp{#error} and @samp{#warning} Directives
2476
2477 @findex #error
2478 The directive @samp{#error} causes the preprocessor to report a fatal
2479 error. The tokens forming the rest of the line following @samp{#error}
2480 are used as the error message, and not macro-expanded. Internal
2481 whitespace sequences are each replaced with a single space. The line
2482 must consist of complete tokens.
2483
2484 You would use @samp{#error} inside of a conditional that detects a
2485 combination of parameters which you know the program does not properly
2486 support. For example, if you know that the program will not run
2487 properly on a Vax, you might write
2488
2489 @smallexample
2490 @group
2491 #ifdef __vax__
2492 #error "Won't work on Vaxen. See comments at get_last_object."
2493 #endif
2494 @end group
2495 @end smallexample
2496
2497 @noindent
2498 @xref{Nonstandard Predefined}, for why this works.
2499
2500 If you have several configuration parameters that must be set up by
2501 the installation in a consistent way, you can use conditionals to detect
2502 an inconsistency and report it with @samp{#error}. For example,
2503
2504 @smallexample
2505 #if HASH_TABLE_SIZE % 2 == 0 || HASH_TABLE_SIZE % 3 == 0 \
2506 || HASH_TABLE_SIZE % 5 == 0
2507 #error HASH_TABLE_SIZE should not be divisible by a small prime
2508 #endif
2509 @end smallexample
2510
2511 @findex #warning
2512 The directive @samp{#warning} is like the directive @samp{#error}, but
2513 causes the preprocessor to issue a warning and continue preprocessing.
2514 The tokens following @samp{#warning} are used as the warning message,
2515 and not macro-expanded.
2516
2517 You might use @samp{#warning} in obsolete header files, with a message
2518 directing the user to the header file which should be used instead.
2519
2520 @node Assertions, Line Control, Conditionals, Top
2521 @section Assertions
2522 @cindex assertions
2523 @dfn{Assertions} are a more systematic alternative to macros in writing
2524 conditionals to test what sort of computer or system the compiled
2525 program will run on. Assertions are usually predefined, but you can
2526 define them with preprocessing directives or command-line options.
2527
2528 @cindex predicates
2529 The macros traditionally used to describe the type of target are not
2530 classified in any way according to which question they answer; they may
2531 indicate a hardware architecture, a particular hardware model, an
2532 operating system, a particular version of an operating system, or
2533 specific configuration options. These are jumbled together in a single
2534 namespace. In contrast, each assertion consists of a named question and
2535 an answer. The question is usually called the @dfn{predicate}. An
2536 assertion looks like this:
2537
2538 @example
2539 #@var{predicate} (@var{answer})
2540 @end example
2541
2542 @noindent
2543 You must use a properly formed identifier for @var{predicate}. The
2544 value of @var{answer} can be any sequence of words; all characters are
2545 significant except for leading and trailing whitespace, and differences
2546 in internal whitespace sequences are ignored. (This is similar to the
2547 rules governing macro redefinition.) Thus, @samp{x + y} is different
2548 from @samp{x+y} but equivalent to @samp{ x + y }. @samp{)} is not
2549 allowed in an answer.
2550
2551 @cindex testing predicates
2552 Here is a conditional to test whether the answer @var{answer} is asserted
2553 for the predicate @var{predicate}:
2554
2555 @example
2556 #if #@var{predicate} (@var{answer})
2557 @end example
2558
2559 @noindent
2560 There may be more than one answer asserted for a given predicate. If
2561 you omit the answer, you can test whether @emph{any} answer is asserted
2562 for @var{predicate}:
2563
2564 @example
2565 #if #@var{predicate}
2566 @end example
2567
2568 @findex #system
2569 @findex #machine
2570 @findex #cpu
2571 Most of the time, the assertions you test will be predefined assertions.
2572 GNU C provides three predefined predicates: @code{system}, @code{cpu},
2573 and @code{machine}. @code{system} is for assertions about the type of
2574 software, @code{cpu} describes the type of computer architecture, and
2575 @code{machine} gives more information about the computer. For example,
2576 on a GNU system, the following assertions would be true:
2577
2578 @example
2579 #system (gnu)
2580 #system (mach)
2581 #system (mach 3)
2582 #system (mach 3.@var{subversion})
2583 #system (hurd)
2584 #system (hurd @var{version})
2585 @end example
2586
2587 @noindent
2588 and perhaps others. The alternatives with
2589 more or less version information let you ask more or less detailed
2590 questions about the type of system software.
2591
2592 On a Unix system, you would find @code{#system (unix)} and perhaps one of:
2593 @code{#system (aix)}, @code{#system (bsd)}, @code{#system (hpux)},
2594 @code{#system (lynx)}, @code{#system (mach)}, @code{#system (posix)},
2595 @code{#system (svr3)}, @code{#system (svr4)}, or @code{#system (xpg4)}
2596 with possible version numbers following.
2597
2598 Other values for @code{system} are @code{#system (mvs)}
2599 and @code{#system (vms)}.
2600
2601 @strong{Portability note:} Many Unix C compilers provide only one answer
2602 for the @code{system} assertion: @code{#system (unix)}, if they support
2603 assertions at all. This is less than useful.
2604
2605 An assertion with a multi-word answer is completely different from several
2606 assertions with individual single-word answers. For example, the presence
2607 of @code{system (mach 3.0)} does not mean that @code{system (3.0)} is true.
2608 It also does not directly imply @code{system (mach)}, but in GNU C, that
2609 last will normally be asserted as well.
2610
2611 The current list of possible assertion values for @code{cpu} is:
2612 @code{#cpu (a29k)}, @code{#cpu (alpha)}, @code{#cpu (arm)}, @code{#cpu
2613 (clipper)}, @code{#cpu (convex)}, @code{#cpu (elxsi)}, @code{#cpu
2614 (tron)}, @code{#cpu (h8300)}, @code{#cpu (i370)}, @code{#cpu (i386)},
2615 @code{#cpu (i860)}, @code{#cpu (i960)}, @code{#cpu (m68k)}, @code{#cpu
2616 (m88k)}, @code{#cpu (mips)}, @code{#cpu (ns32k)}, @code{#cpu (hppa)},
2617 @code{#cpu (pyr)}, @code{#cpu (ibm032)}, @code{#cpu (rs6000)},
2618 @code{#cpu (sh)}, @code{#cpu (sparc)}, @code{#cpu (spur)}, @code{#cpu
2619 (tahoe)}, @code{#cpu (vax)}, @code{#cpu (we32000)}.
2620
2621 @findex #assert
2622 You can create assertions within a C program using @samp{#assert}, like
2623 this:
2624
2625 @example
2626 #assert @var{predicate} (@var{answer})
2627 @end example
2628
2629 @noindent
2630 (Note the absence of a @samp{#} before @var{predicate}.)
2631
2632 @cindex unassert
2633 @cindex assertions, undoing
2634 @cindex retracting assertions
2635 @findex #unassert
2636 Each time you do this, you assert a new true answer for @var{predicate}.
2637 Asserting one answer does not invalidate previously asserted answers;
2638 they all remain true. The only way to remove an answer is with
2639 @samp{#unassert}. @samp{#unassert} has the same syntax as
2640 @samp{#assert}. You can also remove all answers to a @var{predicate}
2641 like this:
2642
2643 @example
2644 #unassert @var{predicate}
2645 @end example
2646
2647 You can also add or cancel assertions using command options
2648 when you run @code{gcc} or @code{cpp}. @xref{Invocation}.
2649
2650 @node Line Control, Other Directives, Assertions, Top
2651 @section Combining Source Files
2652
2653 @cindex line control
2654 One of the jobs of the C preprocessor is to inform the C compiler of where
2655 each line of C code came from: which source file and which line number.
2656
2657 C code can come from multiple source files if you use @samp{#include};
2658 both @samp{#include} and the use of conditionals and macros can cause
2659 the line number of a line in the preprocessor output to be different
2660 from the line's number in the original source file. You will appreciate
2661 the value of making both the C compiler (in error messages) and symbolic
2662 debuggers such as GDB use the line numbers in your source file.
2663
2664 The C preprocessor builds on this feature by offering a directive by
2665 which you can control the feature explicitly. This is useful when a
2666 file for input to the C preprocessor is the output from another program
2667 such as the @code{bison} parser generator, which operates on another
2668 file that is the true source file. Parts of the output from
2669 @code{bison} are generated from scratch, other parts come from a
2670 standard parser file. The rest are copied nearly verbatim from the
2671 source file, but their line numbers in the @code{bison} output are not
2672 the same as their original line numbers. Naturally you would like
2673 compiler error messages and symbolic debuggers to know the original
2674 source file and line number of each line in the @code{bison} input.
2675
2676 @findex #line
2677 @code{bison} arranges this by writing @samp{#line} directives into the output
2678 file. @samp{#line} is a directive that specifies the original line number
2679 and source file name for subsequent input in the current preprocessor input
2680 file. @samp{#line} has three variants:
2681
2682 @table @code
2683 @item #line @var{linenum}
2684 Here @var{linenum} is a decimal integer constant. This specifies that
2685 the line number of the following line of input, in its original source file,
2686 was @var{linenum}.
2687
2688 @item #line @var{linenum} @var{filename}
2689 Here @var{linenum} is a decimal integer constant and @var{filename} is a
2690 string constant. This specifies that the following line of input came
2691 originally from source file @var{filename} and its line number there was
2692 @var{linenum}. Keep in mind that @var{filename} is not just a file
2693 name; it is surrounded by double-quote characters so that it looks like
2694 a string constant.
2695
2696 @item #line @var{anything else}
2697 @var{anything else} is checked for macro calls, which are expanded.
2698 The result should be a decimal integer constant followed optionally
2699 by a string constant, as described above.
2700 @end table
2701
2702 @samp{#line} directives alter the results of the @samp{__FILE__} and
2703 @samp{__LINE__} predefined macros from that point on. @xref{Standard
2704 Predefined}.
2705
2706 The output of the preprocessor (which is the input for the rest of the
2707 compiler) contains directives that look much like @samp{#line}
2708 directives. They start with just @samp{#} instead of @samp{#line}, but
2709 this is followed by a line number and file name as in @samp{#line}.
2710 @xref{Output}.
2711
2712 @node Other Directives, Output, Line Control, Top
2713 @section Miscellaneous Preprocessing Directives
2714
2715 This section describes some additional, rarely used, preprocessing
2716 directives.
2717
2718 @findex #pragma
2719 @findex #pragma GCC
2720
2721 The ISO standard specifies that the effect of the @samp{#pragma}
2722 directive is implementation-defined. The GNU C preprocessor recognizes
2723 some pragmas, and passes unrecognized ones through to the preprocessor
2724 output, so they are available to the compilation pass.
2725
2726 In line with the C99 standard, which introduces a STDC namespace for C99
2727 pragmas, the preprocessor introduces a GCC namespace for GCC pragmas.
2728 Supported GCC preprocessor pragmas are of the form @samp{#pragma GCC
2729 ...}. For backwards compatibility previously supported pragmas are also
2730 recognized without the @samp{GCC} prefix, however that use is
2731 deprecated. Pragmas that are already deprecated are not recognized with
2732 a @samp{GCC} prefix.
2733
2734 @findex #pragma GCC dependency
2735 The @samp{#pragma GCC dependency} allows you to check the relative dates
2736 of the current file and another file. If the other file is more recent
2737 than the current file, a warning is issued. This is useful if the
2738 include file is derived from the other file, and should be regenerated.
2739 The other file is searched for using the normal include search path.
2740 Optional trailing text can be used to give more information in the
2741 warning message.
2742
2743 @smallexample
2744 #pragma GCC dependency "parse.y"
2745 #pragma GCC dependency "/usr/include/time.h" rerun /path/to/fixincludes
2746 @end smallexample
2747
2748 @findex _Pragma
2749 The C99 standard also introduces the @samp{_Pragma} operator. The
2750 syntax is @code{_Pragma (string-literal)}, where @samp{string-literal}
2751 can be either a normal or wide-character string literal. It is
2752 destringized, by replacing all @samp{\\} with a single @samp{\} and all
2753 @samp{\"} with a @samp{"}. The result is then processed as if it had
2754 appeared as the right hand side of a @samp{#pragma} directive. For
2755 example,
2756
2757 @smallexample
2758 _Pragma ("GCC dependency \"parse.y\"")
2759 @end smallexample
2760
2761 @noindent has the same effect as @samp{#pragma GCC dependency
2762 "parse.y"}. The same effect could be achieved using macros, for example
2763
2764 @smallexample
2765 #define DO_PRAGMA(x) _Pragma (#x)
2766 DO_PRAGMA (GCC dependency "parse.y")
2767 @end smallexample
2768
2769 The standard is unclear on where a @samp{_Pragma} operator can appear.
2770 The preprocessor accepts it even within a preprocessing conditional
2771 directive like @samp{#if}. To be safe, you are probably best keeping it
2772 out of directives other than @samp{#define}, and putting it on a line of
2773 its own.
2774
2775 @findex #ident
2776 The @samp{#ident} directive is supported for compatibility with certain
2777 other systems. It is followed by a line of text. On some systems, the
2778 text is copied into a special place in the object file; on most systems,
2779 the text is ignored and this directive has no effect. Typically
2780 @samp{#ident} is only used in header files supplied with those systems
2781 where it is meaningful.
2782
2783 @cindex null directive
2784 The @dfn{null directive} consists of a @samp{#} followed by a newline,
2785 with only whitespace (including comments) in between. A null directive
2786 is understood as a preprocessing directive but has no effect on the
2787 preprocessor output. The primary significance of the existence of the
2788 null directive is that an input line consisting of just a @samp{#} will
2789 produce no output, rather than a line of output containing just a
2790 @samp{#}. Supposedly some old C programs contain such lines.
2791
2792 @node Output, Implementation, Other Directives, Top
2793 @section C Preprocessor Output
2794
2795 @cindex output format
2796 The output from the C preprocessor looks much like the input, except
2797 that all preprocessing directive lines have been replaced with blank
2798 lines and all comments with spaces.
2799
2800 The ISO standard specifies that it is implementation defined whether a
2801 preprocessor preserves whitespace between tokens, or replaces it with
2802 e.g. a single space. In the GNU C preprocessor, whitespace between
2803 tokens is collapsed to become a single space, with the exception that
2804 the first token on a non-directive line is preceded with sufficient
2805 spaces that it appears in the same column in the preprocessed output
2806 that it appeared in in the original source file. This is so the output
2807 is easy to read. @xref{Unreliable Features}.
2808
2809 Source file name and line number information is conveyed by lines
2810 of the form
2811
2812 @example
2813 # @var{linenum} @var{filename} @var{flags}
2814 @end example
2815
2816 @noindent
2817 which are inserted as needed into the output (but never within a string
2818 or character constant), and in place of long sequences of empty lines.
2819 Such a line means that the following line originated in file
2820 @var{filename} at line @var{linenum}.
2821
2822 After the file name comes zero or more flags, which are @samp{1},
2823 @samp{2}, @samp{3}, or @samp{4}. If there are multiple flags, spaces
2824 separate them. Here is what the flags mean:
2825
2826 @table @samp
2827 @item 1
2828 This indicates the start of a new file.
2829 @item 2
2830 This indicates returning to a file (after having included another file).
2831 @item 3
2832 This indicates that the following text comes from a system header file,
2833 so certain warnings should be suppressed.
2834 @item 4
2835 This indicates that the following text should be treated as C@.
2836 @c maybe cross reference NO_IMPLICIT_EXTERN_C
2837 @end table
2838
2839 @node Implementation, Unreliable Features, Output, Top
2840 @section Implementation-defined Behavior and Implementation Limits
2841 @cindex implementation limits
2842 @cindex implementation-defined behavior
2843
2844 The ISO C standard mandates that implementations document various
2845 aspects of preprocessor behavior. You should try to avoid undue
2846 reliance on behaviour described here, as it is possible that it will
2847 change subtly in future implementations.
2848
2849 @itemize @bullet
2850
2851 @item The mapping of physical source file multi-byte characters to the
2852 execution character set.
2853
2854 Currently, GNU cpp only supports character sets that are strict supersets
2855 of ASCII, and performs no translation of characters.
2856
2857 @item Non-empty sequences of whitespace characters.
2858
2859 Each whitespace sequence is not preserved, but collapsed to a single
2860 space. For aesthetic reasons, the first token on each non-directive
2861 line of output is preceded with sufficient spaces that it appears in the
2862 same column as it did in the original source file.
2863
2864 @item The numeric value of character constants in preprocessor expressions.
2865
2866 The preprocessor interprets character constants in preprocessing
2867 directives on the host machine. Expressions outside preprocessing
2868 directives are compiled to be interpreted on the target machine. In the
2869 normal case of a native compiler, these two environments are the same
2870 and so character constants will be evaluated identically in both cases.
2871 However, in the case of a cross compiler, the values may be different.
2872
2873 Multi-character character constants are interpreted a character at a
2874 time, shifting the previous result left by the number of bits per
2875 character on the host, and adding the new character. For example, 'ab'
2876 on an 8-bit host would be interpreted as 'a' * 256 + 'b'. If there are
2877 more characters in the constant than can fit in the widest native
2878 integer type on the host, usually a @samp{long}, the behavior is
2879 undefined.
2880
2881 Evaluation of wide character constants is not properly implemented yet.
2882
2883 @item Source file inclusion.
2884
2885 For a discussion on how the preprocessor locates header files,
2886 @pxref{Include Operation}.
2887
2888 @item Interpretation of the filename resulting from a macro-expanded
2889 @samp{#include} directive.
2890
2891 If the macro expands to a string literal, the @samp{#include} directive
2892 is processed as if the string had been specified directly. Otherwise,
2893 the macro must expand to a token stream beginning with a @samp{<} token
2894 and including a @samp{>} token. In this case, the tokens between the
2895 @samp{<} and the first @samp{>} are combined to form the filename to be
2896 included. Any whitespace between tokens is reduced to a single space;
2897 then any space after the initial @samp{<} is retained, but a trailing
2898 space before the closing @samp{>} is ignored.
2899
2900 In either case, if any excess tokens remain, an error occurs and the
2901 directive is not processed.
2902
2903 @item Treatment of a @samp{#pragma} directive that after macro-expansion
2904 results in a standard pragma.
2905
2906 The pragma is processed as if it were a normal standard pragma.
2907
2908 @end itemize
2909
2910 The following documents internal limits of GNU cpp.
2911
2912 @itemize @bullet
2913
2914 @item Nesting levels of @samp{#include} files.
2915
2916 We impose an arbitrary limit of 200 levels, to avoid runaway recursion.
2917 The standard requires at least 15 levels.
2918
2919 @item Nesting levels of conditional inclusion.
2920
2921 The C standard mandates this be at least 63. The GNU C preprocessor
2922 is limited only by available memory.
2923
2924 @item Levels of parenthesised expressions within a full expression.
2925
2926 The C standard requires this to be at least 63. In preprocessor
2927 conditional expressions it is limited only by available memory.
2928
2929 @item Significant initial characters in an identifier or macro name.
2930
2931 The preprocessor treats all characters as significant. The C standard
2932 requires only that the first 63 be significant.
2933
2934 @item Number of macros simultaneously defined in a single translation unit.
2935
2936 The standard requires at least 4095 be possible; GNU cpp is limited only
2937 by available memory.
2938
2939 @item Number of parameters in a macro definition and arguments in a macro call.
2940
2941 We allow USHRT_MAX, which is normally 65,535, and above the minimum of
2942 127 required by the standard.
2943
2944 @item Number of characters on a logical source line.
2945
2946 The C standard requires a minimum of 4096 be permitted. GNU cpp places
2947 no limits on this, but you may get incorrect column numbers reported in
2948 diagnostics for lines longer than 65,535 characters.
2949
2950 @end itemize
2951
2952 @node Unreliable Features, Invocation, Implementation, Top
2953 @section Undefined Behavior and Deprecated Features
2954 @cindex undefined behavior
2955 @cindex deprecated features
2956
2957 This section details GNU C preprocessor behavior that is subject to
2958 change or deprecated. You are @emph{strongly advised} to write your
2959 software so it does not rely on anything described here; future versions
2960 of the preprocessor may subtly change such behavior or even remove the
2961 feature altogether.
2962
2963 Preservation of the form of whitespace between tokens is unlikely to
2964 change from current behavior (@ref{Output}), but you are advised not
2965 to rely on it.
2966
2967 The following are undocumented and subject to change:-
2968
2969 @itemize @bullet
2970
2971 @item Precedence of ## operators with respect to each other
2972
2973 Whether a sequence of ## operators is evaluated left-to-right,
2974 right-to-left or indeed in a consistent direction at all is not
2975 specified. An example of where this might matter is pasting the
2976 arguments @samp{1}, @samp{e} and @samp{-2}. This would be fine for
2977 left-to-right pasting, but right-to-left pasting would produce an
2978 invalid token @samp{e-2}. It is possible to guarantee precedence by
2979 suitable use of nested macros.
2980
2981 @item Precedence of # operator with respect to the ## operator
2982
2983 Which of these two operators is evaluated first is not specified.
2984
2985 @end itemize
2986
2987 The following features are in flux and should not be used in portable
2988 code:
2989
2990 @itemize @bullet
2991
2992 @item Optional argument when invoking rest argument macros
2993
2994 As an extension, GCC permits you to omit the variable arguments entirely
2995 when you use a variable argument macro. This works whether or not you
2996 give the variable argument a name. For example, the two macro
2997 invocations in the example below expand to the same thing:
2998
2999 @smallexample
3000 #define debug(format, ...) printf (format, __VA_ARGS__)
3001 debug("string"); /* Not permitted by C standard. */
3002 debug("string",); /* OK. */
3003 @end smallexample
3004
3005 This extension will be preserved, but the special behavior of @samp{##}
3006 in this context has changed in the past and may change again in the
3007 future.
3008
3009 @item ## swallowing preceding text in rest argument macros
3010
3011 Formerly, in a macro expansion, if @samp{##} appeared before a variable
3012 arguments parameter, and the set of tokens specified for that argument in
3013 the macro invocation was empty, previous versions of the GNU C
3014 preprocessor would back up and remove the preceding sequence of
3015 non-whitespace characters (@strong{not} the preceding token). This
3016 extension is in direct conflict with the 1999 C standard and has been
3017 drastically pared back.
3018
3019 In the current version of the preprocessor, if @samp{##} appears between
3020 a comma and a variable arguments parameter, and the variable argument is
3021 omitted entirely, the comma will be removed from the expansion. If the
3022 variable argument is empty, or the token before @samp{##} is not a
3023 comma, then @samp{##} behaves as a normal token paste.
3024
3025 Portable code should avoid this extension at all costs.
3026
3027 @end itemize
3028
3029 The following features are deprecated and will likely be removed at some
3030 point in the future:-
3031
3032 @itemize @bullet
3033
3034 @item Attempting to paste two tokens which together do not form a valid
3035 preprocessing token
3036
3037 The preprocessor currently warns about this and outputs the two tokens
3038 adjacently, which is probably the behavior the programmer intends. It
3039 may not work in future, though.
3040
3041 Most of the time, when you get this warning, you will find that @samp{##}
3042 is being used superstitiously, to guard against whitespace appearing
3043 between two tokens. It is almost always safe to delete the @samp{##}.
3044
3045 @findex #pragma once
3046 @item #pragma once
3047
3048 This pragma was once used to tell the preprocessor that it need not
3049 include a file more than once. It is now obsolete and should not be
3050 used at all.
3051
3052 @item #pragma poison
3053
3054 This pragma has been superseded by @samp{#pragma GCC poison}.
3055 @xref{Poisoning}.
3056
3057 @item Multi-line string literals
3058
3059 The preprocessor currently allows raw newlines in string literals. This
3060 extension is deprecated and will be removed in a future version of GCC.
3061 The preprocessor already forbids such string literals in all directives
3062 apart from #define.
3063
3064 Instead, make use of ISO C concatenation of adjacent string literals, or
3065 use @samp{\n} followed by an escaped newline.
3066
3067 @item Preprocessing things which are not C
3068
3069 The C preprocessor is intended to be used only with C, C++, and
3070 Objective C source code. In the past, it has been abused as a general
3071 text processor. It will choke on input which is not lexically valid C;
3072 for example, apostrophes will be interpreted as the beginning of
3073 character constants, and cause errors. Also, you cannot rely on it
3074 preserving characteristics of the input which are not significant to
3075 C-family languages. For instance, if a Makefile is preprocessed, all
3076 the hard tabs will be lost, and the Makefile will not work.
3077
3078 Having said that, you can often get away with using cpp on things which
3079 are not C. Other Algol-ish programming languages are often safe
3080 (Pascal, Ada, ...) and so is assembly, with caution. @samp{-traditional}
3081 mode is much more permissive, and can safely be used with e.g. Fortran.
3082 Many of the problems go away if you write C or C++ style comments
3083 instead of native language comments, and if you avoid elaborate macros.
3084
3085 Wherever possible, you should use a preprocessor geared to the language
3086 you are writing in. Modern versions of the GNU assembler have macro
3087 facilities. Most high level programming languages have their own
3088 conditional compilation and inclusion mechanism. If all else fails,
3089 try a true general text processor, such as @xref{Top, M4, , m4, GNU `m4'}.
3090
3091 @end itemize
3092
3093 @node Invocation, Concept Index, Unreliable Features, Top
3094 @section Invoking the C Preprocessor
3095 @cindex invocation of the preprocessor
3096
3097 Most often when you use the C preprocessor you will not have to invoke it
3098 explicitly: the C compiler will do so automatically. However, the
3099 preprocessor is sometimes useful on its own.
3100
3101 @ignore
3102 @c man begin SYNOPSIS
3103 cpp [@samp{-P}] [@samp{-C}] [@samp{-gcc}] [@samp{-traditional}]
3104 [@samp{-undef}] [@samp{-trigraphs}] [@samp{-pedantic}]
3105 [@samp{-W}@var{warn}...] [@samp{-I}@var{dir}...]
3106 [@samp{-D}@var{macro}[=@var{defn}]...] [@samp{-U}@var{macro}]
3107 [@samp{-A}@var{predicate}(@var{answer})]
3108 [@samp{-M}|@samp{-MM}][@samp{-MG}][@samp{-MF}@var{filename}]
3109 [@samp{-MP}][@samp{-MQ}@var{target}...][@samp{-MT}@var{target}...]
3110 [@samp{-x} @var{language}] [@samp{-std=}@var{standard}]
3111 @var{infile} @var{outfile}
3112
3113 Only the most useful options are listed here; see below for the remainder.
3114 @c man end
3115 @c man begin SEEALSO
3116 gcc(1), as(1), ld(1), and the Info entries for @file{cpp}, @file{gcc}, and
3117 @file{binutils}.
3118 @c man end
3119 @end ignore
3120
3121 @c man begin OPTIONS
3122 The C preprocessor expects two file names as arguments, @var{infile} and
3123 @var{outfile}. The preprocessor reads @var{infile} together with any
3124 other files it specifies with @samp{#include}. All the output generated
3125 by the combined input files is written in @var{outfile}.
3126
3127 Either @var{infile} or @var{outfile} may be @samp{-}, which as
3128 @var{infile} means to read from standard input and as @var{outfile}
3129 means to write to standard output. Also, if either file is omitted, it
3130 means the same as if @samp{-} had been specified for that file.
3131
3132 @cindex options
3133 Here is a table of command options accepted by the C preprocessor.
3134 These options can also be given when compiling a C program; they are
3135 passed along automatically to the preprocessor when it is invoked by the
3136 compiler.
3137
3138 @table @samp
3139 @item -P
3140 @findex -P
3141 Inhibit generation of @samp{#}-lines with line-number information in the
3142 output from the preprocessor. This might be useful when running the
3143 preprocessor on something that is not C code and will be sent to a
3144 program which might be confused by the @samp{#}-lines. @xref{Output}.
3145
3146 @item -C
3147 @findex -C
3148 Do not discard comments. All comments are passed through to the output
3149 file, except for comments in processed directives, which are deleted
3150 along with the directive. Comments appearing in the expansion list of a
3151 macro will be preserved, and appear in place wherever the macro is
3152 invoked.
3153
3154 You should be prepared for side effects when using @samp{-C}; it causes
3155 the preprocessor to treat comments as tokens in their own right. For
3156 example, macro redefinitions that were trivial when comments were
3157 replaced by a single space might become significant when comments are
3158 retained. Also, comments appearing at the start of what would be a
3159 directive line have the effect of turning that line into an ordinary
3160 source line, since the first token on the line is no longer a @samp{#}.
3161
3162 @item -traditional
3163 @findex -traditional
3164 Try to imitate the behavior of old-fashioned C, as opposed to ISO C@.
3165
3166 @itemize @bullet
3167 @item
3168 Traditional macro expansion pays no attention to single-quote or
3169 double-quote characters; macro argument symbols are replaced by the
3170 argument values even when they appear within apparent string or
3171 character constants.
3172
3173 @item
3174 Traditionally, it is permissible for a macro expansion to end in the
3175 middle of a string or character constant. The constant continues into
3176 the text surrounding the macro call.
3177
3178 @item
3179 However, traditionally the end of the line terminates a string or
3180 character constant, with no error.
3181
3182 @item
3183 In traditional C, a comment is equivalent to no text at all. (In ISO
3184 C, a comment counts as whitespace.)
3185
3186 @item
3187 Traditional C does not have the concept of a ``preprocessing number''.
3188 It considers @samp{1.0e+4} to be three tokens: @samp{1.0e}, @samp{+},
3189 and @samp{4}.
3190
3191 @item
3192 A macro is not suppressed within its own definition, in traditional C@.
3193 Thus, any macro that is used recursively inevitably causes an error.
3194
3195 @item
3196 The character @samp{#} has no special meaning within a macro definition
3197 in traditional C@.
3198
3199 @item
3200 In traditional C, the text at the end of a macro expansion can run
3201 together with the text after the macro call, to produce a single token.
3202 (This is impossible in ISO C@.)
3203
3204 @item
3205 None of the GNU extensions to the preprocessor are available in
3206 @samp{-traditional} mode.
3207
3208 @end itemize
3209
3210 @cindex Fortran
3211 @cindex unterminated
3212 Use the @samp{-traditional} option when preprocessing Fortran code, so
3213 that single-quotes and double-quotes within Fortran comment lines (which
3214 are generally not recognized as such by the preprocessor) do not cause
3215 diagnostics about unterminated character or string constants.
3216
3217 However, this option does not prevent diagnostics about unterminated
3218 comments when a C-style comment appears to start, but not end, within
3219 Fortran-style commentary.
3220
3221 So, the following Fortran comment lines are accepted with
3222 @samp{-traditional}:
3223
3224 @smallexample
3225 C This isn't an unterminated character constant
3226 C Neither is "20000000000, an octal constant
3227 C in some dialects of Fortran
3228 @end smallexample
3229
3230 However, this type of comment line will likely produce a diagnostic, or
3231 at least unexpected output from the preprocessor, due to the
3232 unterminated comment:
3233
3234 @smallexample
3235 C Some Fortran compilers accept /* as starting
3236 C an inline comment.
3237 @end smallexample
3238
3239 @cindex g77
3240 Note that @code{g77} automatically supplies the @samp{-traditional}
3241 option when it invokes the preprocessor. However, a future version of
3242 @code{g77} might use a different, more-Fortran-aware preprocessor in
3243 place of @code{cpp}.
3244
3245 @item -trigraphs
3246 @findex -trigraphs
3247 Process ISO standard trigraph sequences. These are three-character
3248 sequences, all starting with @samp{??}, that are defined by ISO C to
3249 stand for single characters. For example, @samp{??/} stands for
3250 @samp{\}, so @samp{'??/n'} is a character constant for a newline. By
3251 default, GCC ignores trigraphs, but in standard-conforming modes it
3252 converts them. See the @samp{-std} option.
3253
3254 The nine trigraph sequences are
3255 @table @samp
3256 @item ??(
3257 -> @samp{[}
3258
3259 @item ??)
3260 -> @samp{]}
3261
3262 @item ??<
3263 -> @samp{@{}
3264
3265 @item ??>
3266 -> @samp{@}}
3267
3268 @item ??=
3269 -> @samp{#}
3270
3271 @item ??/
3272 -> @samp{\}
3273
3274 @item ??'
3275 -> @samp{^}
3276
3277 @item ??!
3278 -> @samp{|}
3279
3280 @item ??-
3281 -> @samp{~}
3282
3283 @end table
3284
3285 Trigraph support is not popular, so many compilers do not implement it
3286 properly. Portable code should not rely on trigraphs being either
3287 converted or ignored.
3288
3289 @item -pedantic
3290 @findex -pedantic
3291 Issue warnings required by the ISO C standard in certain cases such
3292 as when text other than a comment follows @samp{#else} or @samp{#endif}.
3293
3294 @item -pedantic-errors
3295 @findex -pedantic-errors
3296 Like @samp{-pedantic}, except that errors are produced rather than
3297 warnings.
3298
3299 @item -Wcomment
3300 @findex -Wcomment
3301 @itemx -Wcomments
3302 (Both forms have the same effect).
3303 Warn whenever a comment-start sequence @samp{/*} appears in a @samp{/*}
3304 comment, or whenever a backslash-newline appears in a @samp{//} comment.
3305
3306 @item -Wtrigraphs
3307 @findex -Wtrigraphs
3308 Warn if any trigraphs are encountered. This option used to take effect
3309 only if @samp{-trigraphs} was also specified, but now works
3310 independently. Warnings are not given for trigraphs within comments, as
3311 we feel this is obnoxious.
3312
3313 @item -Wwhite-space
3314 @findex -Wwhite-space
3315 Warn about possible white space confusion, e.g. white space between a
3316 backslash and a newline.
3317
3318 @item -Wall
3319 @findex -Wall
3320 Requests @samp{-Wcomment}, @samp{-Wtrigraphs}, and @samp{-Wwhite-space}
3321 (but not @samp{-Wtraditional} or @samp{-Wundef}).
3322
3323 @item -Wtraditional
3324 @findex -Wtraditional
3325 Warn about certain constructs that behave differently in traditional and
3326 ISO C@.
3327
3328 @itemize @bullet
3329 @item
3330 Macro parameters that appear within string literals in the macro body.
3331 In traditional C macro replacement takes place within string literals,
3332 but does not in ISO C.
3333
3334 @item
3335 In traditional C, some preprocessor directives did not exist.
3336 Traditional preprocessors would only consider a line to be a directive
3337 if the @samp{#} appeared in column 1 on the line. Therefore
3338 @samp{-Wtraditional} warns about directives that traditional C
3339 understands but would ignore because the @samp{#} does not appear as the
3340 first character on the line. It also suggests you hide directives like
3341 @samp{#pragma} not understood by traditional C by indenting them. Some
3342 traditional implementations would not recognise @samp{#elif}, so it
3343 suggests avoiding it altogether.
3344
3345 @item
3346 A function-like macro that appears without arguments.
3347
3348 @item
3349 The unary plus operator.
3350
3351 @item
3352 The `U' integer constant suffix. (Traditonal C does support the `L'
3353 suffix on integer constants.) Note, these suffixes appear in macros
3354 defined in the system headers of most modern systems, e.g. the _MIN/_MAX
3355 macros in limits.h. Use of these macros can lead to spurious warnings
3356 as they do not necessarily reflect whether the code in question is any
3357 less portable to traditional C given that suitable backup definitions
3358 are provided.
3359 @end itemize
3360
3361 @item -Wundef
3362 @findex -Wundef
3363 Warn if an undefined identifier is evaluated in an @samp{#if} directive.
3364
3365 @item -I @var{directory}
3366 @findex -I
3367 Add the directory @var{directory} to the head of the list of
3368 directories to be searched for header files (@pxref{Include Syntax}).
3369 This can be used to override a system header file, substituting your
3370 own version, since these directories are searched before the system
3371 header file directories. If you use more than one @samp{-I} option,
3372 the directories are scanned in left-to-right order; the standard
3373 system directories come after.
3374
3375 @item -I-
3376 Any directories specified with @samp{-I} options before the @samp{-I-}
3377 option are searched only for the case of @samp{#include "@var{file}"};
3378 they are not searched for @samp{#include <@var{file}>}.
3379
3380 If additional directories are specified with @samp{-I} options after
3381 the @samp{-I-}, these directories are searched for all @samp{#include}
3382 directives.
3383
3384 In addition, the @samp{-I-} option inhibits the use of the current
3385 directory as the first search directory for @samp{#include "@var{file}"}.
3386 Therefore, the current directory is searched only if it is requested
3387 explicitly with @samp{-I.}. Specifying both @samp{-I-} and @samp{-I.}
3388 allows you to control precisely which directories are searched before
3389 the current one and which are searched after.
3390
3391 @item -nostdinc
3392 @findex -nostdinc
3393 Do not search the standard system directories for header files.
3394 Only the directories you have specified with @samp{-I} options
3395 (and the current directory, if appropriate) are searched.
3396
3397 By using both @samp{-nostdinc} and @samp{-I-}, you can limit the include-file
3398 search path to only those directories you specify explicitly.
3399
3400 @item -nostdinc++
3401 @findex -nostdinc++
3402 Do not search for header files in the C++-specific standard directories,
3403 but do still search the other standard directories. (This option is
3404 used when building the C++ library.)
3405
3406 @item -remap
3407 @findex -remap
3408 When searching for a header file in a directory, remap file names if a
3409 file named @file{header.gcc} exists in that directory. This can be used
3410 to work around limitations of file systems with file name restrictions.
3411 The @file{header.gcc} file should contain a series of lines with two
3412 tokens on each line: the first token is the name to map, and the second
3413 token is the actual name to use.
3414
3415 @item -D @var{name}
3416 @findex -D
3417 Predefine @var{name} as a macro, with definition @samp{1}.
3418
3419 @item -D @var{name}=@var{definition}
3420 Predefine @var{name} as a macro, with definition @var{definition}.
3421 There are no restrictions on the contents of @var{definition}, but if
3422 you are invoking the preprocessor from a shell or shell-like program you
3423 may need to use the shell's quoting syntax to protect characters such as
3424 spaces that have a meaning in the shell syntax. If you use more than
3425 one @samp{-D} for the same @var{name}, the rightmost definition takes
3426 effect.
3427
3428 Any @samp{-D} and @samp{-U} options on the command line are processed in
3429 order, and always before @samp{-imacros @var{file}}, regardless of the
3430 order in which they are written.
3431
3432 @item -U @var{name}
3433 @findex -U
3434 Do not predefine @var{name}.
3435
3436 Any @samp{-D} and @samp{-U} options on the command line are processed in
3437 order, and always before @samp{-imacros @var{file}}, regardless of the
3438 order in which they are written.
3439
3440 @item -undef
3441 @findex -undef
3442 Do not predefine any nonstandard macros.
3443
3444 @item -gcc
3445 @findex -gcc
3446 Define the macros @var{__GNUC__}, @var{__GNUC_MINOR__} and
3447 @var{__GNUC_PATCHLEVEL__}. These are defined automatically when you use
3448 @samp{gcc -E}; you can turn them off in that case with @samp{-no-gcc}.
3449
3450 @item -A @var{predicate}=@var{answer}
3451 @findex -A
3452 Make an assertion with the predicate @var{predicate} and answer
3453 @var{answer}. This form is preferred to the older form @samp{-A
3454 @var{predicate}(@var{answer})}, which is still supported, because
3455 it does not use shell special characters. @xref{Assertions}.
3456
3457 @item -A -@var{predicate}=@var{answer}
3458 Disable an assertion with the predicate @var{predicate} and answer
3459 @var{answer}. Specifying no predicate, by @samp{-A-} or @samp{-A -},
3460 disables all predefined assertions and all assertions preceding it on
3461 the command line; and also undefines all predefined macros and all
3462 macros preceding it on the command line.
3463
3464 @item -dM
3465 @findex -dM
3466 Instead of outputting the result of preprocessing, output a list of
3467 @samp{#define} directives for all the macros defined during the
3468 execution of the preprocessor, including predefined macros. This gives
3469 you a way of finding out what is predefined in your version of the
3470 preprocessor; assuming you have no file @samp{foo.h}, the command
3471
3472 @example
3473 touch foo.h; cpp -dM foo.h
3474 @end example
3475
3476 @noindent
3477 will show the values of any predefined macros.
3478
3479 @item -dD
3480 @findex -dD
3481 Like @samp{-dM} except in two respects: it does @emph{not} include the
3482 predefined macros, and it outputs @emph{both} the @samp{#define}
3483 directives and the result of preprocessing. Both kinds of output go to
3484 the standard output file.
3485
3486 @item -dN
3487 @findex -dN
3488 Like @samp{-dD}, but emit only the macro names, not their expansions.
3489
3490 @item -dI
3491 @findex -dI
3492 Output @samp{#include} directives in addition to the result of
3493 preprocessing.
3494
3495 @item -M
3496 @findex -M
3497 Instead of outputting the result of preprocessing, output a rule
3498 suitable for @code{make} describing the dependencies of the main source
3499 file. The preprocessor outputs one @code{make} rule containing the
3500 object file name for that source file, a colon, and the names of all the
3501 included files, including those coming from @samp{-include} or
3502 @samp{-imacros} command line options. Unless specified explicitly (with
3503 @samp{-MT} or @samp{-MQ}), the object file name consists of the basename
3504 of the source file with any suffix replaced with object file suffix.
3505 If there are many included files
3506 then the rule is split into several lines using @samp{\}-newline.
3507
3508 @item -MM
3509 @findex -MM
3510 Like @samp{-M}, but mention only the files included with @samp{#include
3511 "@var{file}"} or with @samp{-include} or @samp{-imacros} command line
3512 options. System header files included with @samp{#include <@var{file}>}
3513 are omitted.
3514
3515 @item -MF @var{file}
3516 @findex -MF
3517 When used with @samp{-M} or @samp{-MM}, specifies a file to write the
3518 dependencies to. This allows the preprocessor to write the preprocessed
3519 file to stdout normally. If no @samp{-MF} switch is given, CPP sends
3520 the rules to stdout and suppresses normal preprocessed output.
3521
3522 @item -MG
3523 @findex -MG
3524 When used with @samp{-M} or @samp{-MM}, @samp{-MG} says to treat missing
3525 header files as generated files and assume they live in the same
3526 directory as the source file. It suppresses preprocessed output, as a
3527 missing header file is ordinarily an error.
3528
3529 This feature is used in automatic updating of makefiles.
3530
3531 @item -MP
3532 @findex -MP
3533 This option instructs CPP to add a phony target for each dependency
3534 other than the main file, causing each to depend on nothing. These
3535 dummy rules work around errors @code{make} gives if you remove header
3536 files without updating the @code{Makefile} to match.
3537
3538 This is typical output:-
3539
3540 @smallexample
3541 /tmp/test.o: /tmp/test.c /tmp/test.h
3542
3543 /tmp/test.h:
3544 @end smallexample
3545
3546 @item -MQ @var{target}
3547 @item -MT @var{target}
3548 @findex -MQ
3549 @findex -MT
3550 By default CPP uses the main file name, including any path, and appends
3551 the object suffix, normally ``.o'', to it to obtain the name of the
3552 target for dependency generation. With @samp{-MT} you can specify a
3553 target yourself, overriding the default one.
3554
3555 If you want multiple targets, you can specify them as a single argument
3556 to @samp{-MT}, or use multiple @samp{-MT} options.
3557
3558 The targets you specify are output in the order they appear on the
3559 command line. @samp{-MQ} is identical to @samp{-MT}, except that the
3560 target name is quoted for Make, but with @samp{-MT} it isn't. For
3561 example, -MT '$(objpfx)foo.o' gives
3562
3563 @smallexample
3564 $(objpfx)foo.o: /tmp/foo.c
3565 @end smallexample
3566
3567 but -MQ '$(objpfx)foo.o' gives
3568
3569 @smallexample
3570 $$(objpfx)foo.o: /tmp/foo.c
3571 @end smallexample
3572
3573 The default target is automatically quoted, as if it were given with
3574 @samp{-MQ}.
3575
3576 @item -H
3577 @findex -H
3578 Print the name of each header file used, in addition to other normal
3579 activities.
3580
3581 @item -imacros @var{file}
3582 @findex -imacros
3583 Process @var{file} as input, discarding the resulting output, before
3584 processing the regular input file. Because the output generated from
3585 @var{file} is discarded, the only effect of @samp{-imacros @var{file}}
3586 is to make the macros defined in @var{file} available for use in the
3587 main input.
3588
3589 @item -include @var{file}
3590 @findex -include
3591 Process @var{file} as input, and include all the resulting output,
3592 before processing the regular input file.
3593
3594 @item -idirafter @var{dir}
3595 @findex -idirafter
3596 @cindex second include path
3597 Add the directory @var{dir} to the second include path, marking it as a
3598 system directory. The directories on the second include path are searched
3599 when a header file is not found in any of the directories in the main
3600 include path (the one that @samp{-I} adds to).
3601
3602 @item -iprefix @var{prefix}
3603 @findex -iprefix
3604 Specify @var{prefix} as the prefix for subsequent @samp{-iwithprefix}
3605 options. If the prefix represents a directory, you should include the
3606 final @samp{/}.
3607
3608 @item -iwithprefix @var{dir}
3609 @findex -iwithprefix
3610 Add a directory to the second include path, marking it as a system
3611 directory. The directory's name is made by concatenating @var{prefix}
3612 and @var{dir}, where @var{prefix} was specified previously with
3613 @samp{-iprefix}.
3614
3615 @item -isystem @var{dir}
3616 @findex -isystem
3617 Add a directory to the beginning of the second include path, marking it
3618 as a system directory, so that it gets the same special treatment as
3619 is applied to the standard system directories. @xref{System Headers}.
3620
3621 @item -x c
3622 @itemx -x c++
3623 @itemx -x objective-c
3624 @itemx -x assembler-with-cpp
3625 @findex -x c
3626 @findex -x objective-c
3627 @findex -x assembler-with-cpp
3628 Specify the source language: C, C++, Objective-C, or assembly. This has
3629 nothing to do with standards conformance or extensions; it merely
3630 selects which base syntax to expect. If you give none of these options,
3631 cpp will deduce the language from the extension of the source file:
3632 @samp{.c}, @samp{.cc}, @samp{.m}, or @samp{.S}. Some other common
3633 extensions for C++ and assembly are also recognized. If cpp does not
3634 recognize the extension, it will treat the file as C; this is the most
3635 generic mode.
3636
3637 @strong{Note:} Previous versions of cpp accepted a @samp{-lang} option
3638 which selected both the language and the standards conformance level.
3639 This option has been removed, because it conflicts with the @samp{-l}
3640 option.
3641
3642 @item -std=@var{standard}
3643 @itemx -ansi
3644 @findex -std
3645 @findex -ansi
3646 Specify the standard to which the code should conform. Currently cpp
3647 only knows about the standards for C; other language standards will be
3648 added in the future.
3649
3650 @var{standard}
3651 may be one of:
3652 @table @code
3653 @item iso9899:1990
3654 @itemx c89
3655 The ISO C standard from 1990. @samp{c89} is the customary shorthand for
3656 this version of the standard.
3657
3658 The @samp{-ansi} option is equivalent to @samp{-std=c89}.
3659
3660 @item iso9899:199409
3661 The 1990 C standard, as amended in 1994.
3662
3663 @item iso9899:1999
3664 @itemx c99
3665 @itemx iso9899:199x
3666 @itemx c9x
3667 The revised ISO C standard, published in December 1999. Before
3668 publication, this was known as C9X.
3669
3670 @item gnu89
3671 The 1990 C standard plus GNU extensions. This is the default.
3672
3673 @item gnu99
3674 @itemx gnu9x
3675 The 1999 C standard plus GNU extensions.
3676 @end table
3677
3678 @item -ftabstop=NUMBER
3679 @findex -ftabstop
3680 Set the distance between tab stops. This helps the preprocessor
3681 report correct column numbers in warnings or errors, even if tabs appear
3682 on the line. Values less than 1 or greater than 100 are ignored. The
3683 default is 8.
3684
3685 @item -$
3686 @findex -$
3687 Forbid the use of @samp{$} in identifiers. The C standard allows
3688 implementations to define extra characters that can appear in
3689 identifiers. By default the GNU C preprocessor permits @samp{$}, a
3690 common extension.
3691 @end table
3692 @c man end
3693
3694 @node Concept Index, Index, Invocation, Top
3695 @unnumbered Concept Index
3696 @printindex cp
3697
3698 @node Index,, Concept Index, Top
3699 @unnumbered Index of Directives, Macros and Options
3700 @printindex fn
3701
3702 @contents
3703 @bye