Go to the next section.

Copyright (C) 1989, 1992, 1993, 1994 Free Software Foundation, Inc.

Permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and this permission notice are preserved on all copies.

Permission is granted to copy and distribute modified versions of this manual under the conditions for verbatim copying, provided that the entire resulting derived work is distributed under the terms of a permission notice identical to this one.

Permission is granted to copy and distribute translations of this manual into another language, under the above conditions for modified versions, except that this permission notice may be stated in a translation approved by the Foundation.

The indent Program

The indent program changes the appearance of a C program by inserting or deleting whitespace. It can be used to make code easier to read. It can also convert from one style of writing C to another.

indent understands a substantial amount about the syntax of C, but it also attempts to cope with incomplete and misformed syntax.

In version 1.2 and more recent versions, the GNU style of indenting is the default.

Invoking indent

As of version 1.3, the format of the indent command is:

indent [options] [input-files]

indent [options] [single-input-file] [-o output-file]

This format is different from earlier versions and other versions of indent.

In the first form, one or more input files are specified. indent makes a backup copy of each file, and the original file is replaced with its indented version. See section Backup Files, for an explanation of how backups are made.

In the second form, only one input file is specified. In this case, or when the standard input is used, you may specify an output file after the `-o' option.

To cause indent to write to standard output, use the `-st' option. This is only allowed when there is only one input file, or when the standard input is used.

If no input files are named, the standard input is read for input. Also, if a filename named `-' is specified, then the standard input is read.

As an example, each of the following commands will input the program `slithy_toves.c' and write its indented text to `slithy_toves.out':

indent slithy_toves.c -o slithy_toves.out

indent -st slithy_toves.c > slithy_toves.out

cat slithy_toves.c | indent -o slithy_toves.out

Most other options to indent control how programs are formatted. As of version 1.2, indent also recognizes a long name for each option name. Long options are prefixed by either `--' or `+'.(1) In most of this document, the traditional, short names are used for the sake of brevity. See section Option Summary, for a list of options, including both long and short names.

Here is another example:

indent -br test/metabolism.c -l85

This will indent the program `test/metabolism.c' using the `-br' and `-l85' options, write the output back to `test/metabolism.c', and write the original contents of `test/metabolism.c' to a backup file in the directory `test'.

Equivalent invocations using long option names for this example would be:

indent --braces-on-if-line --line-length185 test/metabolism.c

indent +braces-on-if-line +line-length185 test/metabolism.c

If you find that you often use indent with the same options, you may put those options into a file called `.indent.pro'. indent will first look for `.indent.pro' in the current directory and use that if found. Otherwise, indent will search your home directory for `.indent.pro' and use that file if it is found. This behaviour is different from that of other versions of indent, which load both files if they both exist.

Command line switches are handled after processing `.indent.pro'. Options specified later override arguments specified earlier, with one exception: Explicitly specified options always override background options (see section Common styles). You can prevent indent from reading an `.indent.pro' file by specifying the `-npro' option.

Backup Files

As of version 1.3, GNU indent makes GNU--style backup files, the same way GNU Emacs does. This means that either simple or numbered backup filenames may be made.

Simple backup file names are generated by appending a suffix to the original file name. The default for the this suffix is the one-character string `~' (tilde). Thus, the backup file for `python.c' would be `python.c~'.

Instead of the default, you may specify any string as a suffix by setting the environment variable SIMPLE_BACKUP_SUFFIX to your preferred suffix.

Numbered backup versions of a file `momewraths' look like `momewraths.c.~23~', where 23 is the version of this particular backup. When making a numbered backup of the file `src/momewrath.c', the backup file will be named `src/momewrath.c.~V~', where V is one greater than the highest version currently existing in the directory `src'.

The type of backup file made is controlled by the value of the environment variable VERSION_CONTROL. If it is the string `simple', then only simple backups will be made. If its value is the string `numbered', then numbered backups will be made. If its value is `numbered-existing', then numbered backups will be made if there already exist numbered backups for the file being indented; otherwise, a simple backup is made. If VERSION_CONTROL is not set, then indent assumes the behaviour of `numbered-existing'.

Other versions of indent use the suffix `.BAK' in naming backup files. This behaviour can be emulated by setting SIMPLE_BACKUP_SUFFIX to `.BAK'.

Note also that other versions of indent make backups in the current directory, rather than in the directory of the source file as GNU indent now does.

Common styles

There are several common styles of C code, including the GNU style, the Kernighan & Ritchie style, and the original Berkeley style. A style may be selected with a single background option, which specifies a set of values for all other options. However, explicitly specified options always override options implied by a background option.

As of version 1.2, the default style of GNU indent is the GNU style. Thus, it is no longer neccessary to specify the option `-gnu' to obtain this format, although doing so will not cause an error. Option settings which correspond to the GNU style are:

-nbad -bap -nbc -bl -bli2 -c33 -cd33 -ncdb -nce -cli0
-cp1 -di2 -nfc1 -nfca -i2 -ip5 -lp -pcs -psl -cs
-nsc -nsob -nss -ts8 -d0 -ci0 -l78

The GNU coding style is that preferred by the GNU project. It is the style that the GNU Emacs C mode encourages and which is used in the C portions of GNU Emacs. (People interested in writing programs for Project GNU should get a copy of The GNU Coding Standards, which also covers semantic and portability issues such as memory usage, the size of integers, etc.)

The Kernighan & Ritchie style is used throughout their well-known book The C Programming Language. It is enabled with the `-kr' option. The Kernighan & Ritchie style corresponds to the following set of options:

-nbad -bap -nbc -br -c33 -cd33 -ncdb -ce -ci4
-cli0 -cp33 -d0 -di1 -nfc1 -nfca -i4 -ip0 -l75 -lp
-npcs -npsl -nsc -nsob -nss -ts8 -cs

Kernighan & Ritchie style does not put comments to the right of code in the same column at all times (nor does it use only one space to the right of the code), so for this style indent has arbitrarily chosen column 33.

The style of the original Berkeley indent may be obtained by specifying `-orig' (or by specifyfying `--original', using the long option name). This style is equivalent to the following settings:

-nbap -nbad -bc -br -c33 -cd33 -cdb -ce -ci4
-cli0 -cp33 -d4 -di16 -fc1 -fca -i4 -ip4 -l75 -lp 
-npcs -psl -sc -nsob -nss -ts8 -ncs

Blank lines

Various programming styles use blank lines in different places. indent has a number of options to insert or delete blank lines in specific places.

The `-bad' option causes indent to force a blank line after every block of declarations. The `-nbad' option causes indent not to force such blank lines.

The `-bap' option forces a blank line after every procedure body. The `-nbap' option forces no such blank line.

The `-sob' option causes indent to swallow optional blank lines (that is, any optional blank lines present in the input will be removed from the output). If the `-nsob' is specified, any blank lines present in the input file will be copied to the output file.

For example, given the input

char *foo;
char *bar;
/* This separates blocks of declarations.  */
int baz;

indent -bad produces

char *foo;
char *bar;

/* This separates blocks of declarations.  */
int baz;

and indent -nbad produces

char *foo;
char *bar;
/* This separates blocks of declarations.  */
int baz;

The `-bap' option forces a blank line after every procedure body.

For example, given the input

int
foo ()
{
  puts("Hi");
}
/* The procedure bar is even less interesting.  */
char *
bar ()
{
  puts("Hello");
}

indent -bap produces

int
foo ()
{
  puts ("Hi");
}

/* The procedure bar is even less interesting.  */
char *
bar ()
{
  puts ("Hello");
}

and indent -nbap produces

int
foo ()
{
  puts ("Hi");
}
/* The procedure bar is even less interesting.  */
char *
bar ()
{
  puts ("Hello");
}

No blank line will be added after the procedure foo.

Comments

indent formats both C and C++ comments. C comments are begun with `/*' and terminated with `*/' and may contain newline characters. C++ comments begin with the delimiter `//' and end at the newline.

indent handles comments differently depending upon their context. indent attempts to distinguish amongst comments which follow statements, comments which follow declarations, comments following preprocessor directives, and comments which are not preceded by code of any sort, i.e., they begin the text of the line (although not neccessarily in column 1).

indent further attempts to leave boxed comments unmodified. The general idea of such a comment is that it is enclosed in a rectangle or "box" of stars or dashes to visually set it apart. More precisely, boxed comments are defined as those in which the initial `/*' is followed immediately by the character `*', `=', `_', or `-', or those in which the beginning comment delimiter (`/*') is on a line by itself, and the following line begins with a `*' in the same column as the star of the opening delimiter.

Examples of boxed comments are:

/**********************
 * Comment in a box!! *
 **********************/

       /*
        * A different kind of scent,
        * for a different kind of comment.
        */

indent attempts to leave boxed comments exactly as they are found in the source file. Thus the indentation of the comment is unchanged, and its length is not checked in any way. The only alteration made is that an embedded tab character may be converted into the appropriate number of spaces.

Comments which are not boxed may be formatted, which means that the line is broken to fit within a right margin and left-filled with whitespace. Single newlines are equivalent to a space, but blank lines (two or more newlines in a row) are taken to mean a paragraph break. Formatting of comments which begin after the first column is enabled with the `-fca' option. To format those beginning in column one, specify `-fc1'. Such formatting is disabled by default.

The right margin for formatting defaults to 78, but may be changed with the `-lc' or the `-l' option. `-l' specifies the right margin for all code, and `-lc' specifies the margin for only for comments. If `-l' is used alone, comments will be formatted according to the margin specified with that option.

If the margin specified does not allow the comment to be printed, the margin will be automatically extended for the duration of that comment. The margin is not respected if the comment is not being formatted.

If the comment begins a line (i.e., there is no program text to its left), it will be indented to the column it was found in unless the comment is within a block of code. In that case, such a comment will be aligned with the indented code of that block. This alignment may be affected by the `-d' option, which specifies an amount by which such comments are moved to the left, or unindented. For example, `-d2' places comments two spaces to the left of code. By default, comments are aligned with code.

Comments to the right of code will appear by default in column 33. This may be changed with one of three options. `-c' will specify the column for comments following code, `-cd' specifies the column for comments following declarations, and `-cp' specifies the column for comments following preprocessor directives #else and #endif.

If the code to the left of the comment exceeds the beginning column, the comment column will be extended to the next tabstop column past the end of the code, or in the case of preprocessor directives, to one space past the end of the directive. This extension lasts only for the output of that particular comment.

The `-cdb' option places the comment delimiters on blank lines. Thus, a single line comment like /* Claustrophobia */ can be transformed into:

/*
   Claustrophobia
 */

Stars can be placed at the beginning of multi-line comments with the `-sc' option. Thus, the single-line comment above can be transformed (with `-cdb -sc') into:

/*
 * Claustrophobia
 */

Statements

The `-br' or `-bl' option specifies how to format braces.

The `-br' option formats statement braces like this:

if (x > 0) {
  x--;
}

The `-bl' option formats them like this:

if (x > 0)
  {
    x--;
  }

These options also affect structure and enumeration declarations. The `-br' option produces structure declarations like the following:

struct Sname {
    int i;
    char chp;
} Vname;

The default behaviour, also obtained by specifying `-bl', would yield the following format for the same declaration:

struct Sname
  {
     int i;
     char chp;
  }
Vname;

If you use the `-bl' option, you may also want to specify the `-bli' option. This option specifies the number of spaces by which braces are indented. `-bli2', the default, gives the result shown above. `-bli0' results in the following:

if (x > 0)
{
  x--;
}

If you are using the `-br' option, you probably want to also use the `-ce' option. This causes the else in an if-then-else construct to cuddle up to the immediately preceding `}'. For example, with `-br -ce' you get the following:

if (x > 0) {
  x--;
} else {
  fprintf (stderr, "...something wrong?\n");
}

With `-br -nce' that code would appear as

if (x > 0) {
  x--;
}
else {
  fprintf (stderr, "...something wrong?\n");
}

The `-cli' option specifies the number of spaces that case labels should be indented to the right of the containing `switch' statement.

If a semicolon is on the same line as a for or while statement, the `-ss' option will cause a space to be placed before the semicolon. This emphasizes the semicolon, making it clear that the body of the for or while statement is an empty statement. -nss disables this feature.

The `-pcs' option causes a space to be placed between the name of the procedure being called and the `(' (for example, puts ("Hi");. The `-npcs' option would give puts("Hi");).

If the `-cs' option is specified, indent puts a space after a cast operator.

The `-bs' option ensures that there is a space between the keyword sizeof and its argument. In some versions, this is known as the `Bill_Shannon' option.

Declarations

By default indent will line up identifiers, in the column specified by the `-di' option. For example, `-di16' makes things look like:

int             foo;
char           *bar;

Using a small value (such as one or two) for the `-di' option can be used to cause the indentifiers to be placed in the first available position, for example

int foo;
char *bar;

The value given to the `-di' option will still affect variables which are put on separate lines from their types, for example `-di2' will lead to

int
  foo;

If the `-bc' option is specified, a newline is forced after each comma in a declaration. For example,

int a,
  b,
  c;

With the `-nbc' option this would look like

int a, b, c;

The `-psl' option causes the type of a procedure being defined to be placed on the line before the name of the procedure. This style is required for the etags program to work correctly, as well as some of the c-mode functions of Emacs.

If you are not using the `-di1' option to place variables being declared immediately after their type, you need to use the `-T' option to tell indent the name of all the typenames in your program that are defined by typedef. `-T' can be specified more than once, and all names specified are used. For example, if your program contains

typedef unsigned long CODE_ADDR;
typedef enum {red, blue, green} COLOR;

you would use the options `-T CODE_ADDR -T COLOR'.

Indentation

One issue in the formatting of code is how far each line should be indented from the left margin. When the beginning of a statement such as if or for is encountered, the indentation level is increased by the value specified by the `-i' option. For example, use `-i8' to specify an eight character indentation for each level. When a statement is broken across two lines, the second line is indented by a number of additional spaces specified by the `-ci' option. `-ci' defaults to 0. However, if the `-lp' option is specified, and a line has a left parenthesis which is not closed on that line, then continuation lines will be lined up to start at the character position just after the left parenthesis. This processing also applies to `[' and applies to `{' when it occurs in initialization lists. For example, a piece of continued code might look like this with `-nlp -ci3' in effect:

  p1 = first_procedure (second_procedure (p2, p3),
     third_procedure (p4, p5));

With `-lp' in effect the code looks somewhat clearer:

  p1 = first_procedure (second_procedure (p2, p3),
                        third_procedure (p4, p5));

indent assumes that tabs are placed at regular intervals of both input and output character streams. These intervals are by default 8 columns wide, but (as of version 1.2) may be changed by the `-ts' option. Tabs are treated as the equivalent number of spaces.

The indentation of type declarations in old-style function definitions is controlled by the `-ip' parameter. This is a numeric parameter specifying how many spaces to indent type declarations. For example, the default `-ip5' makes definitions look like this:

char *
create_world (x, y, scale)
     int x;
     int y;
     float scale;
{
  . . .
}

For compatibility with other versions of indent, the option `-nip' is provided, which is equivalent to `-ip0'.

ASCII C allows white space to be placed on preprocessor command lines between the character `#' and the command name. By default, indent removes this space, but specifying the `-lps' option directs indent to leave this space unmodified.

Disabling Formatting

Formatting of C code may be disabled for portions of a program by embedding special control comments in the program. To turn off formatting for a section of a program, place the disabling control comment /* *INDENT-OFF* */ on a line by itself just before that section. Program text scanned after this control comment is output precisely as input with no modifications until the corresponding enabling comment is scanned on a line by itself. The disabling control comment is /* *INDENT-ON* */, and any text following the comment on the line is also output unformatted. Formatting begins again with the input line following the enabling control comment.

More precisely, indent does not attempt to verify the closing delimiter (*/) for these C comments, and any whitespace on the line is totally transparent.

These control comments also function in their C++ formats, namely // *INDENT-OFF* and // *INDENT-ON*.

It should be noted that the internal state of indent remains unchanged over the course of the unformatted section. Thus, for example, turning off formatting in the middle of a function and continuing it after the end of the function may lead to bizarre results. It is therefore wise to be somewhat modular in selecting code to be left unformatted.

As a historical note, some earlier versions of indent produced error messages beginning with *INDENT**. These versions of indent were written to ignore any input text lines which began with such error messages. I have removed this incestuous feature from GNU indent.

Miscellaneous options

To find out what version of indent you have, use the command indent -version. This will report the version number of indent, without doing any of the normal processing.

The `-v' option can be used to turn on verbose mode. When in verbose mode, indent reports when it splits one line of input into two more more lines of output, and gives some size statistics at completion.

Bugs

The "-troff" option is strongly deprecated, and is not supported. A good thing for someone to do is to rewrite `indent' to generate TeX source as a hardcopy output option, amoung other things.

Copyright

The following copyright notice applies to the indent program. The copyright and copying permissions for this manual appear near the beginning of this document.

Copyright (c) 1989, 1992 Free Software Foundation
Copyright (c) 1985 Sun Microsystems, Inc.
Copyright (c) 1980 The Regents of the University of California.
Copyright (c) 1976 Board of Trustees of the University of Illinois.
All rights reserved.

Redistribution and use in source and binary forms are permitted
provided that the above copyright notice and this paragraph are
duplicated in all such forms and that any documentation,
advertising materials, and other materials related to such
distribution and use acknowledge that the software was developed
by the University of California, Berkeley, the University of Illinois,
Urbana, and Sun Microsystems, Inc.  The name of either University
or Sun Microsystems may not be used to endorse or promote products
derived from this software without specific prior written permission.
THIS SOFTWARE IS PROVIDED "AS IS" AND WITHOUT ANY EXPRESS OR
IMPLIED WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED
WARRANTIES OF MERCHANTIBILITY AND FITNESS FOR A PARTICULAR
PURPOSE.

Go to the next section.