123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187 |
- <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
- <html>
- <!-- Copyright (C) 1988-2017 Free Software Foundation, Inc.
- Permission is granted to copy, distribute and/or modify this document
- under the terms of the GNU Free Documentation License, Version 1.3 or
- any later version published by the Free Software Foundation; with the
- Invariant Sections being "Funding Free Software", the Front-Cover
- Texts being (a) (see below), and with the Back-Cover Texts being (b)
- (see below). A copy of the license is included in the section entitled
- "GNU Free Documentation License".
- (a) The FSF's Front-Cover Text is:
- A GNU Manual
- (b) The FSF's Back-Cover Text is:
- You have freedom to copy and modify this GNU Manual, like GNU
- software. Copies published by the Free Software Foundation raise
- funds for GNU development. -->
- <!-- Created by GNU Texinfo 5.2, http://www.gnu.org/software/texinfo/ -->
- <head>
- <title>GNU Compiler Collection (GCC) Internals: Looping Patterns</title>
- <meta name="description" content="GNU Compiler Collection (GCC) Internals: Looping Patterns">
- <meta name="keywords" content="GNU Compiler Collection (GCC) Internals: Looping Patterns">
- <meta name="resource-type" content="document">
- <meta name="distribution" content="global">
- <meta name="Generator" content="makeinfo">
- <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
- <link href="index.html#Top" rel="start" title="Top">
- <link href="Option-Index.html#Option-Index" rel="index" title="Option Index">
- <link href="index.html#SEC_Contents" rel="contents" title="Table of Contents">
- <link href="Machine-Desc.html#Machine-Desc" rel="up" title="Machine Desc">
- <link href="Insn-Canonicalizations.html#Insn-Canonicalizations" rel="next" title="Insn Canonicalizations">
- <link href="Jump-Patterns.html#Jump-Patterns" rel="prev" title="Jump Patterns">
- <style type="text/css">
- <!--
- a.summary-letter {text-decoration: none}
- blockquote.smallquotation {font-size: smaller}
- div.display {margin-left: 3.2em}
- div.example {margin-left: 3.2em}
- div.indentedblock {margin-left: 3.2em}
- div.lisp {margin-left: 3.2em}
- div.smalldisplay {margin-left: 3.2em}
- div.smallexample {margin-left: 3.2em}
- div.smallindentedblock {margin-left: 3.2em; font-size: smaller}
- div.smalllisp {margin-left: 3.2em}
- kbd {font-style:oblique}
- pre.display {font-family: inherit}
- pre.format {font-family: inherit}
- pre.menu-comment {font-family: serif}
- pre.menu-preformatted {font-family: serif}
- pre.smalldisplay {font-family: inherit; font-size: smaller}
- pre.smallexample {font-size: smaller}
- pre.smallformat {font-family: inherit; font-size: smaller}
- pre.smalllisp {font-size: smaller}
- span.nocodebreak {white-space:nowrap}
- span.nolinebreak {white-space:nowrap}
- span.roman {font-family:serif; font-weight:normal}
- span.sansserif {font-family:sans-serif; font-weight:normal}
- ul.no-bullet {list-style: none}
- -->
- </style>
- </head>
- <body lang="en" bgcolor="#FFFFFF" text="#000000" link="#0000FF" vlink="#800080" alink="#FF0000">
- <a name="Looping-Patterns"></a>
- <div class="header">
- <p>
- Next: <a href="Insn-Canonicalizations.html#Insn-Canonicalizations" accesskey="n" rel="next">Insn Canonicalizations</a>, Previous: <a href="Jump-Patterns.html#Jump-Patterns" accesskey="p" rel="prev">Jump Patterns</a>, Up: <a href="Machine-Desc.html#Machine-Desc" accesskey="u" rel="up">Machine Desc</a> [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Option-Index.html#Option-Index" title="Index" rel="index">Index</a>]</p>
- </div>
- <hr>
- <a name="Defining-Looping-Instruction-Patterns"></a>
- <h3 class="section">16.13 Defining Looping Instruction Patterns</h3>
- <a name="index-looping-instruction-patterns"></a>
- <a name="index-defining-looping-instruction-patterns"></a>
- <p>Some machines have special jump instructions that can be utilized to
- make loops more efficient. A common example is the 68000 ‘<samp>dbra</samp>’
- instruction which performs a decrement of a register and a branch if the
- result was greater than zero. Other machines, in particular digital
- signal processors (DSPs), have special block repeat instructions to
- provide low-overhead loop support. For example, the TI TMS320C3x/C4x
- DSPs have a block repeat instruction that loads special registers to
- mark the top and end of a loop and to count the number of loop
- iterations. This avoids the need for fetching and executing a
- ‘<samp>dbra</samp>’-like instruction and avoids pipeline stalls associated with
- the jump.
- </p>
- <p>GCC has three special named patterns to support low overhead looping.
- They are ‘<samp>decrement_and_branch_until_zero</samp>’, ‘<samp>doloop_begin</samp>’,
- and ‘<samp>doloop_end</samp>’. The first pattern,
- ‘<samp>decrement_and_branch_until_zero</samp>’, is not emitted during RTL
- generation but may be emitted during the instruction combination phase.
- This requires the assistance of the loop optimizer, using information
- collected during strength reduction, to reverse a loop to count down to
- zero. Some targets also require the loop optimizer to add a
- <code>REG_NONNEG</code> note to indicate that the iteration count is always
- positive. This is needed if the target performs a signed loop
- termination test. For example, the 68000 uses a pattern similar to the
- following for its <code>dbra</code> instruction:
- </p>
- <div class="smallexample">
- <pre class="smallexample">(define_insn "decrement_and_branch_until_zero"
- [(set (pc)
- (if_then_else
- (ge (plus:SI (match_operand:SI 0 "general_operand" "+d*am")
- (const_int -1))
- (const_int 0))
- (label_ref (match_operand 1 "" ""))
- (pc)))
- (set (match_dup 0)
- (plus:SI (match_dup 0)
- (const_int -1)))]
- "find_reg_note (insn, REG_NONNEG, 0)"
- "…")
- </pre></div>
- <p>Note that since the insn is both a jump insn and has an output, it must
- deal with its own reloads, hence the ‘m’ constraints. Also note that
- since this insn is generated by the instruction combination phase
- combining two sequential insns together into an implicit parallel insn,
- the iteration counter needs to be biased by the same amount as the
- decrement operation, in this case -1. Note that the following similar
- pattern will not be matched by the combiner.
- </p>
- <div class="smallexample">
- <pre class="smallexample">(define_insn "decrement_and_branch_until_zero"
- [(set (pc)
- (if_then_else
- (ge (match_operand:SI 0 "general_operand" "+d*am")
- (const_int 1))
- (label_ref (match_operand 1 "" ""))
- (pc)))
- (set (match_dup 0)
- (plus:SI (match_dup 0)
- (const_int -1)))]
- "find_reg_note (insn, REG_NONNEG, 0)"
- "…")
- </pre></div>
- <p>The other two special looping patterns, ‘<samp>doloop_begin</samp>’ and
- ‘<samp>doloop_end</samp>’, are emitted by the loop optimizer for certain
- well-behaved loops with a finite number of loop iterations using
- information collected during strength reduction.
- </p>
- <p>The ‘<samp>doloop_end</samp>’ pattern describes the actual looping instruction
- (or the implicit looping operation) and the ‘<samp>doloop_begin</samp>’ pattern
- is an optional companion pattern that can be used for initialization
- needed for some low-overhead looping instructions.
- </p>
- <p>Note that some machines require the actual looping instruction to be
- emitted at the top of the loop (e.g., the TMS320C3x/C4x DSPs). Emitting
- the true RTL for a looping instruction at the top of the loop can cause
- problems with flow analysis. So instead, a dummy <code>doloop</code> insn is
- emitted at the end of the loop. The machine dependent reorg pass checks
- for the presence of this <code>doloop</code> insn and then searches back to
- the top of the loop, where it inserts the true looping insn (provided
- there are no instructions in the loop which would cause problems). Any
- additional labels can be emitted at this point. In addition, if the
- desired special iteration counter register was not allocated, this
- machine dependent reorg pass could emit a traditional compare and jump
- instruction pair.
- </p>
- <p>The essential difference between the
- ‘<samp>decrement_and_branch_until_zero</samp>’ and the ‘<samp>doloop_end</samp>’
- patterns is that the loop optimizer allocates an additional pseudo
- register for the latter as an iteration counter. This pseudo register
- cannot be used within the loop (i.e., general induction variables cannot
- be derived from it), however, in many cases the loop induction variable
- may become redundant and removed by the flow pass.
- </p>
- <hr>
- <div class="header">
- <p>
- Next: <a href="Insn-Canonicalizations.html#Insn-Canonicalizations" accesskey="n" rel="next">Insn Canonicalizations</a>, Previous: <a href="Jump-Patterns.html#Jump-Patterns" accesskey="p" rel="prev">Jump Patterns</a>, Up: <a href="Machine-Desc.html#Machine-Desc" accesskey="u" rel="up">Machine Desc</a> [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Option-Index.html#Option-Index" title="Index" rel="index">Index</a>]</p>
- </div>
- </body>
- </html>
|