Top Qs
Timeline
Chat
Perspective

Pretty-printing

Formatting to make code or markup easier to read From Wikipedia, the free encyclopedia

Remove ads

Pretty-printing (or prettyprinting) is the application of any of various stylistic formatting conventions to text files, such as source code, markup, and similar kinds of content. These formatting conventions may entail adhering to an indentation style, using different color and typeface to highlight syntactic elements of source code, or adjusting size, to make the content easier for people to read, and understand. Pretty-printers for source code are sometimes called code formatters or beautifiers.

Remove ads

Pretty-printing mathematics

Thumb
A typeset mathematical expression

Pretty-printing usually refers to displaying mathematical expressions similar to the way they would be typeset professionally. For example, in computer algebra systems such as Maxima or Mathematica the system may write output like "x ^ 2 + 3 * x" as "". Some graphing calculators can perform pretty-printing. Such calculators include the Casio 9860 series, HP-49/50 series and HP Prime, TI-84 Plus, TI-89, TI-Nspire, and the TI-83 Plus with the PrettyPt[1] add-on. Similarly, the TI-84 Plus with the PrettPt add-on or with "MathPrint"-enabled operating systems. Additionally, a number of newer scientific calculators are equipped with dot matrix screens capable of pretty-printing such as the Casio FX-ES series (Natural Display), Sharp EL-W series (WriteView), HP SmartCalc 300s, TI-30XB, and Numworks.

Many text formatting programs can also typeset mathematics: TeX was developed specifically for high-quality mathematical typesetting.

Remove ads

Pretty-printing markup and tag-based code

Summarize
Perspective
Thumb
HTML source code, pretty-printed to better show the hierarchical relationships of its elements (called tags)

Pretty-printing in markup language instances is most typically associated with indentation of tags and string content to visually determine hierarchy and nesting. Although the syntactical structures of tag-based languages do not significantly vary, the indentation may vary significantly due to how a markup language is interpreted or due to the data it describes.

In MathML, whitespace characters do not reflect data, meaning, or syntax above what is required by XML syntax. In HTML, whitespace characters between tags are considered text and are parsed as text nodes into the parsed result.[2] While indentation may be generously applied to a MathML document, sufficient additional care must be taken in pretty-printing an HTML document to ensure additional text nodes are not created or destroyed in general proximity to the content or content-reflective tag elements. This difference in complexity is non-trivial from the perspective of an automated pretty-print operation where no special rules or edge cases are necessary, as in the more simple MathML example. The HTML example may require a series of progressive interrelated algorithms to account for various patterns of tag elements and content that conforms to a uniform style and is consistent in application across various instances, as evidenced by the markup.ts[3] application component used to beautify HTML, XML, and related technologies for the Pretty Diff tool.

Remove ads

Formatting of program source code

Summarize
Perspective

Programmers are taught to adhere to a formatting style when writing program source code, to make code easier to read and understand. Styles vary. They cover matters such as indentation, whitespace surrounding keywords and punctuation marks, and positioning of braces.

Tools exist to automate (re-)formatting. They are called beautifiers, code formatters, or code indenters. These tools convert source code from one format style to another. The implementation of such tools is relatively straightforward, because the syntax of programming languages is unambiguous. The tool reads the source code, recognizes the code's component structures (such as assignment statements, conditionals, and loops -- see control flow), then displays the source code, formatted in the style specified in a configuration file.

Some code beautifiers are standalone applications. Others are built into text editors and integrated development environments. For example, the Emacs editor has settings to specify a programming language ("Emacs language mode") and to indent blocks of code accordingly.[4]

HTML

Lisp pretty-printer

An early example of pretty-printing was Bill Gosper's "GRINDEF" (i.e. 'grind function') program (c. 1967), which used combinatorial search with pruning, to format LISP programs. Early versions operated on the executable (list structure) form of the Lisp program and were oblivious to the special meanings of various functions. Later versions had special read conventions for incorporating non-executable comments and also for preserving read macros in unexpanded form. They also allowed special indentation conventions for special functions such as if.[5][6] The term "grind" was used in some Lisp circles as a synonym for pretty-printing.[7]

Project style rules

Many free software projects have established rules for code layout. The most widely known are the GNU formatting[8] and the BSD style.[9] The biggest difference between the two is the positioning of braces: in the GNU style, each brace is placed on a line by itself, and matched pairs (opening and closing brace) are indented to the same level. The BSD style places an opening brace at the end of a line of code, and allows the closing brace to be followed by an else keyword. The size of indent and location of whitespace also differs.

Example of formatting and beautifying code

The following example shows some typical C structures and how various indentation style rules format them. Without any formatting at all, it looks like this:

int foo(int k){if(k<1||k>2){printf("out of range\n");
printf("this function requires a value of 1 or 2\n");}else{
printf("Switching\n");switch(k){case 1:printf("1\n");break;case
2:printf("2\n");break;}}}

The GNU indent program produces the following output when asked to indent according to the GNU rules:

int
foo (int k)
{
  if (k < 1 || k > 2)
    {
      printf ("out of range\n");
      printf ("this function requires a value of 1 or 2\n");
    }
  else
    {
      printf ("Switching\n");
      switch (k)
        {
        case 1:
          printf ("1\n");
          break;
        case 2:
          printf ("2\n");
          break;
        }
    }
}

It produces this output when formatting according to BSD rules:

int
foo(int k) {
	if (k < 1 || k > 2) {
		printf("out of range\n");
		printf("this function requires a value of 1 or 2\n");
	} else {
		printf("Switching\n");
		switch (k) {
		case 1:
			printf("1\n");
			break;
		case 2:
			printf("2\n");
			break;
		}
	}
}
Remove ads

See also

Related concepts

  • Elastic tabstop, a feature of many source code editors that detects and maintains aligned indents
  • Minification, making source code compact, even if it becomes harder for humans to understand
  • Obfuscation, deliberately making source code very difficult for humans to understand - especially as it becomes more convoluted

Utilities

  • enscript, a text-to-PostScript converter, with pretty-printing features
Remove ads

References

Loading related searches...

Wikiwand - on

Seamless Wikipedia browsing. On steroids.

Remove ads