Traditional-lexical-analysis.html 6.7 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155
  1. <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
  2. <html>
  3. <!-- Copyright (C) 1987-2017 Free Software Foundation, Inc.
  4. Permission is granted to copy, distribute and/or modify this document
  5. under the terms of the GNU Free Documentation License, Version 1.3 or
  6. any later version published by the Free Software Foundation. A copy of
  7. the license is included in the
  8. section entitled "GNU Free Documentation License".
  9. This manual contains no Invariant Sections. The Front-Cover Texts are
  10. (a) (see below), and the Back-Cover Texts are (b) (see below).
  11. (a) The FSF's Front-Cover Text is:
  12. A GNU Manual
  13. (b) The FSF's Back-Cover Text is:
  14. You have freedom to copy and modify this GNU Manual, like GNU
  15. software. Copies published by the Free Software Foundation raise
  16. funds for GNU development. -->
  17. <!-- Created by GNU Texinfo 5.2, http://www.gnu.org/software/texinfo/ -->
  18. <head>
  19. <title>The C Preprocessor: Traditional lexical analysis</title>
  20. <meta name="description" content="The C Preprocessor: Traditional lexical analysis">
  21. <meta name="keywords" content="The C Preprocessor: Traditional lexical analysis">
  22. <meta name="resource-type" content="document">
  23. <meta name="distribution" content="global">
  24. <meta name="Generator" content="makeinfo">
  25. <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
  26. <link href="index.html#Top" rel="start" title="Top">
  27. <link href="Index-of-Directives.html#Index-of-Directives" rel="index" title="Index of Directives">
  28. <link href="index.html#SEC_Contents" rel="contents" title="Table of Contents">
  29. <link href="Traditional-Mode.html#Traditional-Mode" rel="up" title="Traditional Mode">
  30. <link href="Traditional-macros.html#Traditional-macros" rel="next" title="Traditional macros">
  31. <link href="Traditional-Mode.html#Traditional-Mode" rel="prev" title="Traditional Mode">
  32. <style type="text/css">
  33. <!--
  34. a.summary-letter {text-decoration: none}
  35. blockquote.smallquotation {font-size: smaller}
  36. div.display {margin-left: 3.2em}
  37. div.example {margin-left: 3.2em}
  38. div.indentedblock {margin-left: 3.2em}
  39. div.lisp {margin-left: 3.2em}
  40. div.smalldisplay {margin-left: 3.2em}
  41. div.smallexample {margin-left: 3.2em}
  42. div.smallindentedblock {margin-left: 3.2em; font-size: smaller}
  43. div.smalllisp {margin-left: 3.2em}
  44. kbd {font-style:oblique}
  45. pre.display {font-family: inherit}
  46. pre.format {font-family: inherit}
  47. pre.menu-comment {font-family: serif}
  48. pre.menu-preformatted {font-family: serif}
  49. pre.smalldisplay {font-family: inherit; font-size: smaller}
  50. pre.smallexample {font-size: smaller}
  51. pre.smallformat {font-family: inherit; font-size: smaller}
  52. pre.smalllisp {font-size: smaller}
  53. span.nocodebreak {white-space:nowrap}
  54. span.nolinebreak {white-space:nowrap}
  55. span.roman {font-family:serif; font-weight:normal}
  56. span.sansserif {font-family:sans-serif; font-weight:normal}
  57. ul.no-bullet {list-style: none}
  58. -->
  59. </style>
  60. </head>
  61. <body lang="en" bgcolor="#FFFFFF" text="#000000" link="#0000FF" vlink="#800080" alink="#FF0000">
  62. <a name="Traditional-lexical-analysis"></a>
  63. <div class="header">
  64. <p>
  65. Next: <a href="Traditional-macros.html#Traditional-macros" accesskey="n" rel="next">Traditional macros</a>, Up: <a href="Traditional-Mode.html#Traditional-Mode" accesskey="u" rel="up">Traditional Mode</a> &nbsp; [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Index-of-Directives.html#Index-of-Directives" title="Index" rel="index">Index</a>]</p>
  66. </div>
  67. <hr>
  68. <a name="Traditional-lexical-analysis-1"></a>
  69. <h3 class="section">10.1 Traditional lexical analysis</h3>
  70. <p>The traditional preprocessor does not decompose its input into tokens
  71. the same way a standards-conforming preprocessor does. The input is
  72. simply treated as a stream of text with minimal internal form.
  73. </p>
  74. <p>This implementation does not treat trigraphs (see <a href="Initial-processing.html#trigraphs">trigraphs</a>)
  75. specially since they were an invention of the standards committee. It
  76. handles arbitrarily-positioned escaped newlines properly and splices
  77. the lines as you would expect; many traditional preprocessors did not
  78. do this.
  79. </p>
  80. <p>The form of horizontal whitespace in the input file is preserved in
  81. the output. In particular, hard tabs remain hard tabs. This can be
  82. useful if, for example, you are preprocessing a Makefile.
  83. </p>
  84. <p>Traditional CPP only recognizes C-style block comments, and treats the
  85. &lsquo;<samp>/*</samp>&rsquo; sequence as introducing a comment only if it lies outside
  86. quoted text. Quoted text is introduced by the usual single and double
  87. quotes, and also by an initial &lsquo;<samp>&lt;</samp>&rsquo; in a <code>#include</code>
  88. directive.
  89. </p>
  90. <p>Traditionally, comments are completely removed and are not replaced
  91. with a space. Since a traditional compiler does its own tokenization
  92. of the output of the preprocessor, this means that comments can
  93. effectively be used as token paste operators. However, comments
  94. behave like separators for text handled by the preprocessor itself,
  95. since it doesn&rsquo;t re-lex its input. For example, in
  96. </p>
  97. <div class="smallexample">
  98. <pre class="smallexample">#if foo/**/bar
  99. </pre></div>
  100. <p>&lsquo;<samp>foo</samp>&rsquo; and &lsquo;<samp>bar</samp>&rsquo; are distinct identifiers and expanded
  101. separately if they happen to be macros. In other words, this
  102. directive is equivalent to
  103. </p>
  104. <div class="smallexample">
  105. <pre class="smallexample">#if foo bar
  106. </pre></div>
  107. <p>rather than
  108. </p>
  109. <div class="smallexample">
  110. <pre class="smallexample">#if foobar
  111. </pre></div>
  112. <p>Generally speaking, in traditional mode an opening quote need not have
  113. a matching closing quote. In particular, a macro may be defined with
  114. replacement text that contains an unmatched quote. Of course, if you
  115. attempt to compile preprocessed output containing an unmatched quote
  116. you will get a syntax error.
  117. </p>
  118. <p>However, all preprocessing directives other than <code>#define</code>
  119. require matching quotes. For example:
  120. </p>
  121. <div class="smallexample">
  122. <pre class="smallexample">#define m This macro's fine and has an unmatched quote
  123. &quot;/* This is not a comment. */
  124. /* <span class="roman">This is a comment. The following #include directive
  125. is ill-formed.</span> */
  126. #include &lt;stdio.h
  127. </pre></div>
  128. <p>Just as for the ISO preprocessor, what would be a closing quote can be
  129. escaped with a backslash to prevent the quoted text from closing.
  130. </p>
  131. <hr>
  132. <div class="header">
  133. <p>
  134. Next: <a href="Traditional-macros.html#Traditional-macros" accesskey="n" rel="next">Traditional macros</a>, Up: <a href="Traditional-Mode.html#Traditional-Mode" accesskey="u" rel="up">Traditional Mode</a> &nbsp; [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Index-of-Directives.html#Index-of-Directives" title="Index" rel="index">Index</a>]</p>
  135. </div>
  136. </body>
  137. </html>