Rendering (computer graphics)

Rendering is the process of generating a photorealistic or non-photorealistic image from input data such as 3D models. The word "rendering" (in one of its senses) originally meant the task performed by an artist when depicting a real or imaginary thing (the finished artwork is also called a "rendering"). Today, to "render" commonly means to generate an image or video from a precise description (often created by an artist) using a computer program.^[1]^[2]^[3]^[4]

A software application or component that performs rendering is called a rendering engine,^[5] render engine, rendering system, graphics engine, or simply a renderer.

A distinction is made between real-time rendering, in which images are generated and displayed immediately (ideally fast enough to give the impression of motion or animation), and offline rendering (sometimes called pre-rendering) in which images, or film or video frames, are generated for later viewing. Offline rendering can use a slower and higher-quality renderer. Interactive applications such as games must primarily use real-time rendering, although they may incorporate pre-rendered content.

Rendering can produce images of scenes or objects defined using coordinates in 3D space, seen from a particular viewpoint. Such 3D rendering uses knowledge and ideas from optics, the study of visual perception, mathematics, and software engineering, and it has applications such as video games, simulators, visual effects for films and television, design visualization, and medical diagnosis. Realistic 3D rendering requires modeling the propagation of light in an environment, e.g. by applying the rendering equation.

Real-time rendering uses high-performance rasterization algorithms that process a list of shapes and determine which pixels are covered by each shape. When more realism is required (e.g. for architectural visualization or visual effects) slower pixel-by-pixel algorithms such as ray tracing are used instead. (Ray tracing can also be used selectively during rasterized rendering to improve the realism of lighting and reflections.) A type of ray tracing called path tracing is currently the most common technique for photorealistic rendering. Path tracing is also popular for generating high-quality non-photorealistic images, such as frames for 3D animated films. Both rasterization and ray tracing can be sped up ("accelerated") by specially designed microprocessors called GPUs.

Rasterization algorithms are also used to render images containing only 2D shapes such as polygons and text. Applications of this type of rendering include digital illustration, graphic design, 2D animation, desktop publishing and the display of user interfaces.

Historically, rendering was called image synthesis^[6]^: xxi but today this term is likely to mean AI image generation.^[7] The term "neural rendering" is sometimes used when a neural network is the primary means of generating an image but some degree of control over the output image is provided.^[8] Neural networks can also assist rendering without replacing traditional algorithms, e.g. by removing noise from path traced images.

Remove ads

Features

Summarize

Perspective

Photorealistic rendering

A large proportion of computer graphics research has worked towards producing images that resemble photographs. Fundamental techniques that make this possible were invented in the 1980s, but at the end of the decade, photorealism for complex scenes was still considered a distant goal.^[9]^: x Today, photorealism is routinely achievable for offline rendering, but remains difficult for real-time rendering.^[10]^: 1–2

In order to produce realistic images, rendering must simulate how light travels from light sources, is reflected, refracted, and scattered (often many times) by objects in the scene, passes through a camera lens, and finally reaches the film or sensor of the camera. The physics used in these simulations is primarily geometrical optics, in which particles of light follow (usually straight) lines called rays, but in some situations (such as when rendering thin films, like the surface of soap bubbles) the wave nature of light must be taken into account.^[11]^[12]

Effects that may need to be simulated include:

Shadows, including both shadows with sharp edges and soft shadows with umbra and penumbra
Reflections in mirrors and smooth surfaces, as well as rough or rippled reflective surfaces
Refraction – the bending of light when it crosses a boundary between two transparent materials such as air and glass. The amount of bending varies with the wavelength of the light, which may cause colored fringes or "rainbows" to appear.
Volumetric effects – absorption and scattering when light travels through partially transparent or translucent substances (called participating media because they modify the light rather than simply allow rays to pass through)^[13]^: 140^[11]
Caustics – bright patches, sometimes with distinct filaments and a folded or twisted appearance, resulting when light is reflected or refracted before illuminating an object.^[13]^: 109

In realistic scenes, objects are illuminated both by light that arrives directly from a light source (after passing mostly unimpeded through air), and light that has bounced off other objects in the scene. The simulation of this complex lighting is called global illumination. In the past, indirect lighting was often faked (especially when rendering animated films) by placing additional hidden lights in the scene, but today path tracing is used to render it accurately.^[14]^: 3^[13]^: 108

For true photorealism, the camera used to take the photograph must be simulated. The thin lens approximation allows combining perspective projection with depth of field (and bokeh) emulation. Camera lens simulations can be made more realistic by modeling the way light is refracted by the components of the lens. Motion blur is often simulated if film or video frames are being rendered.^[11]^[15] Simulated lens flare and bloom are sometimes added to make the image appear subjectively brighter (although the design of real cameras tries to reduce these effects).^[16]^: 12.4

Realistic rendering uses mathematical descriptions of how different surface materials reflect light, called reflectance models or (when physically plausible) bidirectional reflectance distribution functions (BRDFs).^[11] Rendering materials such as marble, plant leaves, and human skin requires simulating an effect called subsurface scattering, in which a portion of the light travels into the material, is scattered, and then travels back out again.^[13]^: 143 The way color, and properties such as roughness, vary over a surface can be represented efficiently using texture mapping.^[16]^: 6.1

Other styles of 3D rendering

For some applications (including early stages of 3D modeling), simplified rendering styles such as wireframe rendering may be appropriate, particularly when the material and surface details have not been defined and only the shape of an object is known.^[17]^: 5.3 Games and other real-time applications may use simpler and less realistic rendering techniques as an artistic or design choice, or to allow higher frame rates on lower-end hardware.

Orthographic and isometric projections can be used for a stylized effect or to ensure that parallel lines are depicted as parallel in CAD rendering.^[16]^: 4.7^[17]^: 3.7

Non-photorealistic rendering (NPR) uses techniques like edge detection and posterization to produce 3D images that resemble technical illustrations, cartoons, or other styles of drawing or painting.^[16]^{: ch 15}

2D rendering

In 2D computer graphics the positions and sizes of shapes are specified using 2D coordinates (x and y) instead of 3D coordinates (x, y, and z). 2D rendering APIs often use a resolution-independent coordinate system, with a viewport determining how to convert coordinates to pixel indexes called device coordinates.^[18]^: 46 Transformations such as scaling, translation, and rotation may be applied before rendering the shapes. These affine transformations are often represented by 3 × 3 matrices, allowing easier composition of transformations.^[18]^{: 245–255}

Higher-quality 2D rendering engines such as SVG renderers usually implement anti-aliasing to reduce the jagged appearance of rasterized lines and shape edges.^[19]^: 2.5.6 When rendering overlapping shapes, renderers commonly use a "painter's model" in which the shapes are drawn in some determined order, or their contributions to each pixel are composited using blending operations that may depend on the order of the inputs.^[19]^: 3.3 Renderers may allow giving shapes a "z index" or "stacking order" to specify the rendering or blending order (unlike the z coordinate used in 3D rendering, this third coordinate only indicates an order, not a distance, and cannot be meaningfully rotated together with the x and y coordinates).^[19]^: 3.4

2D rendering typically does not simulate light propagation (which would likely require specifying 3D positions or thickness for the shapes). Effects such as drop shadows and transparency are defined by mathematical functions with no physical basis.^[20]^: 9.12^[21]^: 5

2D rendering for print output may need to support very high resolutions, e.g. 600 or 1200 DPI for a typical laser printer, or 2400 DPI or higher for an imagesetter or platesetter. Grayscale and color images require halftones (or some form of dithering or stochastic screening) and color separations. A rendering engine called a raster image processor (RIP) converts input data such as PDF files into the high-resolution bitmap images used by the printer.^[22]^: Ch1–2

Remove ads

Inputs

Summarize

Perspective

Before a 3D scene or 2D image can be rendered, it must be described in a way that the rendering software can understand. Historically, inputs for both 2D and 3D rendering were usually text files, which are easier than binary files for humans to edit and debug. For 3D graphics, text formats have largely been supplanted by more efficient binary formats, and by APIs which allow interactive applications to communicate directly with a rendering component without generating a file on disk (although a scene description is usually still created in memory prior to rendering).^[23]^{: 1.2, 3.2.6, 3.3.1, 3.3.7}

Traditional rendering algorithms use geometric descriptions of 3D scenes or 2D images. Applications and algorithms that render visualizations of data scanned from the real world, or scientific simulations, may require different types of input data.

The PostScript format (which is often credited with the rise of desktop publishing) provides a standardized, interoperable way to describe 2D graphics and page layout. The Scalable Vector Graphics (SVG) format is also text-based, and the PDF format uses the PostScript language internally. In contrast, although many 3D graphics file formats have been standardized (including text-based formats such as VRML and X3D), different rendering applications typically use formats tailored to their needs, and this has led to a proliferation of proprietary and open formats, with binary files being more common.^[23]^{: 3.2.3, 3.2.5, 3.3.7}^[24]^: vii^[25]^[26]^: 16.5.2.^[27]

2D vector graphics

A vector graphics image description may include:^[24]^[25]

Coordinates and curvature information for line segments, arcs, and Bézier curves (which may be used as boundaries of filled shapes)
Center coordinates, width, and height (or bounding rectangle coordinates) of basic shapes such as rectangles, circles and ellipses
Color, width and pattern (such as dashed or dotted) for rendering lines
Colors, patterns, and gradients for filling shapes
Bitmap image data (either embedded or in an external file) along with scale and position information
Text to be rendered (along with size, position, orientation, color, and font)
Clipping information, if only part of a shape or bitmap image should be rendered
Transparency and compositing information for rendering overlapping shapes
Color space information, allowing the image to be rendered consistently on different displays and printers

3D geometry

A geometric scene description may include:^[23]^{: Ch. 4-7, 8.7}^[28]

Size, position, and orientation of geometric primitives such as spheres and cones (which may be combined in various ways to create more complex objects)
Vertex coordinates and surface normal vectors for meshes of triangles or polygons (often rendered as smooth surfaces by subdividing the mesh)
Transformations for positioning, rotating, and scaling objects within a scene (allowing parts of the scene to use different local coordinate systems).
"Camera" information describing how the scene is being viewed (position, direction, focal length, and field of view)
Light information (location, type, brightness, and color)
Optical properties of surfaces, such as albedo, roughness, and refractive index,
Optical properties of media through which light passes (transparent solids, liquids, clouds, smoke), e.g. absorption and scattering cross sections
Bitmap image data used as texture maps for surfaces
Small scripts or programs for generating complex 3D shapes or scenes procedurally
Description of how object and camera locations and other information change over time, for rendering an animation

Many file formats exist for storing individual 3D objects or "models". These can be imported into a larger scene, or loaded on-demand by rendering software or games. A realistic scene may require hundreds of items like household objects, vehicles, and trees, and 3D artists often utilize large libraries of models. In game production, these models (along with other data such as textures, audio files, and animations) are referred to as "assets".^[27]^[29]^{: Ch. 4}

Volumetric data

Scientific and engineering visualization often requires rendering volumetric data generated by 3D scans or simulations. Perhaps the most common source of such data is medical CT and MRI scans, which need to be rendered for diagnosis. Volumetric data can be extremely large, and requires specialized data formats to store it efficiently, particularly if the volume is sparse (with empty regions that do not contain data).^[16]^: 14.3.1^[30]^[31]

Before rendering, level sets for volumetric data can be extracted and converted into a mesh of triangles, e.g. by using the marching cubes algorithm. Algorithms have also been developed that work directly with volumetric data, for example to render realistic depictions of the way light is scattered and absorbed by clouds and smoke, and this type of volumetric rendering is used extensively in visual effects for movies. When rendering lower-resolution volumetric data without interpolation, the individual cubes or "voxels" may be visible, an effect sometimes used deliberately for game graphics.^[32]^: 4.6^[16]^{: 13.10, Ch. 14, 16.1}

Photogrammetry and scanning

Photographs of real world objects can be incorporated into a rendered scene by using them as textures for 3D objects. Photos of a scene can also be stitched together to create panoramic images or environment maps, which allow the scene to be rendered very efficiently but only from a single viewpoint. Scanning of real objects and scenes using structured light or lidar produces point clouds consisting of the coordinates of millions of individual points in space, sometimes along with color information. These point clouds may either be rendered directly or converted into meshes before rendering. (Note: "point cloud" sometimes also refers to a minimalist rendering style that can be used for any 3D geometry, similar to wireframe rendering.)^[16]^{: 13.3, 13.9}^[23]^: 1.3

Neural approximations and light fields

A more recent, experimental approach is description of scenes using radiance fields which define the color, intensity, and direction of incoming light at each point in space. (This is conceptually similar to, but not identical to, the light field recorded by a hologram.) For any useful resolution, the amount of data in a radiance field is so large that it is impractical to represent it directly as volumetric data, and an approximation function must be found. Neural networks are typically used to generate and evaluate these approximations, sometimes using video frames, or a collection of photographs of a scene taken at different angles, as "training data".^[33]^[34]

Algorithms related to neural networks have recently been used to find approximations of a scene as 3D Gaussians. The resulting representation is similar to a point cloud, except that it uses fuzzy, partially-transparent blobs of varying dimensions and orientations instead of points. As with neural radiance fields, these approximations are often generated from photographs or video frames.^[35]

Remove ads

Outputs

Summarize

Perspective

The output of rendering may be displayed immediately on the screen (many times a second, in the case of real-time rendering such as games) or saved in a raster graphics file format such as JPEG or PNG. High-end rendering applications commonly use the OpenEXR file format, which can represent finer gradations of colors and high dynamic range lighting, allowing tone mapping or other adjustments to be applied afterwards without loss of quality.^[36]^[37]^{: Ch. 14, Ap. B}

Quickly rendered animations can be saved directly as video files, but for high-quality rendering, individual frames (which may be rendered by different computers in a cluster or render farm and may take hours or even days to render) are output as separate files and combined later into a video clip.^[38]^[29]^{: 1.5, 3.11, 8.11}

The output of a renderer sometimes includes more than just RGB color values. For example, the spectrum can be sampled using multiple wavelengths of light, or additional information such as depth (distance from camera) or the material of each point in the image can be included (this data can be used during compositing or when generating texture maps for real-time rendering, or used to assist in removing noise from a path-traced image). Transparency information can be included, allowing rendered foreground objects to be composited with photographs or video. It is also sometimes useful to store the contributions of different lights, or of specular and diffuse lighting, as separate channels, so lighting can be adjusted after rendering. The OpenEXR format allows storing many channels of data in a single file. Renderers such as Blender and Pixar RenderMan support a large variety of configurable values called Arbitrary Output Variables (AOVs).^[36]^[37]^{: Ch. 14, Ap. B}^[39]

Techniques

Summarize

Perspective

Choosing how to render a 3D scene usually involves trade-offs between speed, memory usage, and realism (although realism is not always desired). The algorithms developed over the years follow a loose progression, with more advanced methods becoming practical as computing power and memory capacity increased. Multiple techniques may be used for a single final image.

An important distinction is between image order algorithms, which iterate over pixels in the image, and object order algorithms, which iterate over objects in the scene. For simple scenes, object order is usually more efficient, as there are fewer objects than pixels.^[40]^{: Ch. 4}

2D vector graphics: The vector displays of the 1960s-1970s used deflection of an electron beam to draw line segments directly on the screen. Nowadays, vector graphics are rendered by rasterization algorithms that also support filled shapes. In principle, any 2D vector graphics renderer can be used to render 3D objects by first projecting them onto a 2D image plane. ^[18]^{: 93, 431, 505, 553}
3D rasterization: Adapts 2D rasterization algorithms so they can be used more efficiently for 3D rendering, handling hidden surface removal via scanline or z-buffer techniques. Different realistic or stylized effects can be obtained by coloring the pixels covered by the objects in different ways. Surfaces are typically divided into meshes of triangles before being rasterized. Rasterization is usually synonymous with "object order" rendering (as described above).^[18]^{: 560-561, 575-590}^[23]^: 8.5^[40]^{: Ch. 9}
Ray casting: Uses geometric formulas to compute the first object that a ray intersects.^[41]^: 8 It can be used to implement "image order" rendering by casting a ray for each pixel, and finding a corresponding point in the scene. Ray casting is a fundamental operation used for both graphical and non-graphical purposes,^[10]^: 6 e.g. determining whether a point is in shadow, or checking what an enemy can see in a game.
Ray tracing: Simulates the bouncing paths of light caused by specular reflection and refraction, requiring a varying number of ray casting operations for each path. Advanced forms use Monte Carlo techniques to render effects such as area lights, depth of field, blurry reflections, and soft shadows, but computing global illumination is usually in the domain of path tracing.^[41]^: 9-13^[42]
Radiosity: A finite element analysis approach that breaks surfaces in the scene into pieces, and estimates the amount of light that each piece receives from light sources, or indirectly from other surfaces. Once the irradiance of each surface is known, the scene can be rendered using rasterization or ray tracing.^[6]^{: 888-890, 1044-1045}
Path tracing: Uses Monte Carlo integration with a simplified form of ray tracing, computing the average brightness of a sample of the possible paths that a photon could take when traveling from a light source to the camera (for some images, thousands of paths need to be sampled per pixel^[10]^: 8). It was introduced as a statistically unbiased way to solve the rendering equation, giving ray tracing a rigorous mathematical foundation.^[43]^[41]^: 11-13

Each of the above approaches has many variations, and there is some overlap. Path tracing may be considered either a distinct technique or a particular type of ray tracing.^[6]^{: 846, 1021} Note that the usage of terminology related to ray tracing and path tracing has changed significantly over time.^[41]^: 7

Ray marching is a family of algorithms, used by ray casting, for finding intersections between a ray and a complex object, such as a volumetric dataset or a surface defined by a signed distance function. It is not, by itself, a rendering method, but it can be incorporated into ray tracing and path tracing, and is used by rasterization to implement screen-space reflection and other effects.^[41]^: 13

A technique called photon mapping traces paths of photons from a light source to an object, accumulating data about irradiance which is then used during conventional ray tracing or path tracing.^[6]^{: 1037-1039} Rendering a scene using only rays traced from the light source to the camera is impractical, even though it corresponds more closely to reality, because a huge number of photons would need to be simulated, only a tiny fraction of which actually hit the camera.^[44]^: 7-9^[18]^: 587

Some authors call conventional ray tracing "backward" ray tracing because it traces the paths of photons backwards from the camera to the light source, and call following paths from the light source (as in photon mapping) "forward" ray tracing.^[44]^: 7-9 However, sometimes the meaning of these terms is reversed.^[45] Tracing rays starting at the light source can also be called particle tracing or light tracing, which avoids this ambiguity.^[14]^: 92^[46]^: 4.5.4

Real-time rendering, including video game graphics, typically uses rasterization, but increasingly combines it with ray tracing and path tracing.^[10]^: 2 To enable realistic global illumination, real-time rendering often relies on pre-rendered ("baked") lighting for stationary objects. For moving objects, it may use a technique called light probes, in which lighting is recorded by rendering omnidirectional views of the scene at chosen points in space (often points on a grid to allow easier interpolation). These are similar to environment maps, but typically use a very low resolution or an approximation such as spherical harmonics.^[47] (Note: Blender uses the term 'light probes' for a more general class of pre-recorded lighting data, including reflection maps.^[48])

Examples comparing different rendering techniques

A low quality rasterized image, rendered by Blender's EEVEE renderer with low shadow map resolution and a low-resolution mesh
A low quality path traced image, rendered by Blender's Cycles renderer with only 16 sampled paths per pixel and a low-resolution mesh
A ray traced image, using the POV-Ray program (using only its ray tracing features) with a low-resolution mesh
A higher quality rasterized image, using Blender's EEVEE renderer with light probes
A higher quality path traced image, using Blender's Cycles renderer with 2000 sampled paths per pixel
An image rendered using POV-Ray's ray tracing, radiosity and photon mapping features
A more realistic path traced image, using Blender's Cycles renderer with image-based lighting
A spectral rendered image, using POV-Ray's ray tracing, radiosity and photon mapping features

Rasterization

The term rasterization (in a broad sense) encompasses many techniques used for 2D rendering and real-time 3D rendering. 3D animated films were rendered by rasterization before ray tracing and path tracing became practical.

A renderer combines rasterization with geometry processing (which is not specific to rasterization) and pixel processing which computes the RGB color values to be placed in the framebuffer for display.^[16]^: 2.1^[40]^: 9

The main tasks of rasterization (including pixel processing) are:^[16]^{: 2, 3.8, 23.1.1}

Determining which pixels are covered by each geometric shape in the 3D scene or 2D image (this is the actual rasterization step, in the strictest sense)
Blending between colors and depths defined at the vertices of shapes, e.g. using barycentric coordinates (interpolation)
Determining if parts of shapes are hidden by other shapes, due to 2D layering or 3D depth (hidden surface removal)
Evaluating a function for each pixel covered by a shape (shading)
Smoothing edges of shapes so pixels are less visible (anti-aliasing)
Blending overlapping transparent shapes (compositing)

3D rasterization is typically part of a graphics pipeline in which an application provides lists of triangles to be rendered, and the rendering system transforms and projects their coordinates, determines which triangles are potentially visible in the viewport, and performs the above rasterization and pixel processing tasks before displaying the final result on the screen.^[16]^: 2.1^[40]^: 9

Historically, 3D rasterization used algorithms like the Warnock algorithm and scanline rendering (also called "scan-conversion"), which can handle arbitrary polygons and can rasterize many shapes simultaneously. Although such algorithms are still important for 2D rendering, 3D rendering now usually divides shapes into triangles and rasterizes them individually using simpler methods.^[49]^[50]^[18]^{: 456, 561–569}

High-performance algorithms exist for rasterizing 2D lines, including anti-aliased lines, as well as ellipses and filled triangles. An important special case of 2D rasterization is text rendering, which requires careful anti-aliasing and rounding of coordinates to avoid distorting the letterforms and preserve spacing, density, and sharpness.^[40]^: 9.1.1^[51]

After 3D coordinates have been projected onto the image plane, rasterization is primarily a 2D problem, but the 3rd dimension necessitates hidden surface removal. Early computer graphics used geometric algorithms or ray casting to remove the hidden portions of shapes, or used the painter's algorithm, which sorts shapes by depth (distance from camera) and renders them from back to front. Depth sorting was later avoided by incorporating depth comparison into the scanline rendering algorithm. The z-buffer algorithm performs the comparisons indirectly by including a depth or "z" value in the framebuffer. A pixel is only covered by a shape if that shape's z value is lower (indicating closer to the camera) than the z value currently in the buffer. The z-buffer requires additional memory (an expensive resource at the time it was invented) but simplifies the rasterization code and permits multiple passes. Memory is now faster and more plentiful, and a z-buffer is almost always used for real-time rendering.^[52]^[53]^[18]^{: 553–570}^[16]^: 2.5.2

A drawback of the basic z-buffer algorithm is that each pixel ends up either entirely covered by a single object or filled with the background color, causing jagged edges in the final image. Early anti-aliasing approaches addressed this by detecting when a pixel is partially covered by a shape, and calculating the covered area. The A-buffer (and other supersampling and multi-sampling techniques) solve the problem less precisely but with higher performance. For real-time 3D graphics, it has become common to use complicated heuristics (and even neural-networks) to perform anti-aliasing.^[53]^[54]^[40]^: 9.3^[16]^: 5.4.2

In 3D rasterization, color is usually determined by a pixel shader or fragment shader, a small program that is run for each pixel. The shader does not (or cannot) directly access 3D data for the entire scene (this would be very slow, and would result in an algorithm similar to ray tracing) and a variety of techniques have been developed to render effects like shadows and reflections using only texture mapping and multiple passes.^[40]^: 17.8

Older and more basic 3D rasterization implementations did not support shaders, and used simple shading techniques such as flat shading (lighting is computed once for each triangle, which is then rendered entirely in one color), Gouraud shading (lighting is computed using normal vectors defined at vertices and then colors are interpolated across each triangle), or Phong shading (normal vectors are interpolated across each triangle and lighting is computed for each pixel).^[40]^: 9.2

Until relatively recently, Pixar used rasterization for rendering its animated films. Unlike the renderers commonly used for real-time graphics, the Reyes rendering system in Pixar's RenderMan software was optimized for rendering very small (pixel-sized) polygons, and incorporated stochastic sampling techniques more typically associated with ray tracing.^[23]^{: 2, 6.3}^[55]

Ray casting

One of the simplest ways to render a 3D scene is to test if a ray starting at the viewpoint (the "eye" or "camera") intersects any of the geometric shapes in the scene, repeating this test using a different ray direction for each pixel. This method, called ray casting, was important in early computer graphics, and is a fundamental building block for more advanced algorithms. Ray casting can be used to render shapes defined by constructive solid geometry (CSG) operations.^[41]^: 8-9^[56]^{: 246–249}

Early ray casting experiments include the work of Arthur Appel in the 1960s. Appel rendered shadows by casting an additional ray from each visible surface point towards a light source. He also tried rendering the density of illumination by casting random rays from the light source towards the object and plotting the intersection points (similar to the later technique called photon mapping).^[57]

When rendering scenes containing many objects, testing the intersection of a ray with every object becomes very expensive. Special data structures are used to speed up this process by allowing large numbers of objects to be excluded quickly (such as objects behind the camera). These structures are analogous to database indexes for finding the relevant objects. The most common are the bounding volume hierarchy (BVH), which stores a pre-computed bounding box or sphere for each branch of a tree of objects, and the k-d tree which recursively divides space into two parts. Recent GPUs include hardware acceleration for BVH intersection tests. K-d trees are a special case of binary space partitioning, which was frequently used in early computer graphics (it can also generate a rasterization order for the painter's algorithm). Octrees, another historically popular technique, are still often used for volumetric data.^[10]^: 16–17^[58]^[56]^[26]^: 36.2

Geometric formulas are sufficient for finding the intersection of a ray with shapes like spheres, polygons, and polyhedra, but for most curved surfaces there is no analytic solution, or the intersection is difficult to compute accurately using limited precision floating point numbers. Root-finding algorithms such as Newton's method can sometimes be used. To avoid these complications, curved surfaces are often approximated as meshes of triangles. Volume rendering (e.g. rendering clouds and smoke), and some surfaces such as fractals, may require ray marching instead of basic ray casting.^[59]^[41]^: 13^[16]^{: 14, 17.3}

Ray tracing

Ray casting can be used to render an image by tracing light rays backwards from a simulated camera. After finding a point on a surface where a ray originated, another ray is traced towards the light source to determine if anything is casting a shadow on that point. If not, a reflectance model (such as Lambertian reflectance for matte surfaces, or the Phong reflection model for glossy surfaces) is used to compute the probability that a photon arriving from the light would be reflected towards the camera, and this is multiplied by the brightness of the light to determine the pixel brightness. If there are multiple light sources, brightness contributions of the lights are added together. For color images, calculations are repeated for multiple wavelengths of light (e.g. red, green, and blue).^[16]^: 11.2.2^[41]^: 8

Classical ray tracing (also called Whitted-style or recursive ray tracing) extends this method so it can render mirrors and transparent objects. If a ray traced backwards from the camera originates at a point on a mirror, the reflection formula from geometric optics is used to calculate the direction the reflected ray came from, and another ray is cast backwards in that direction. If a ray originates at a transparent surface, rays are cast backwards for both reflected and refracted rays (using Snell's law to compute the refracted direction), and so ray tracing needs to support a branching "tree" of rays. In simple implementations, a recursive function is called to trace each ray.^[16]^: 11.2.2^[41]^: 9

Ray tracing usually performs anti-aliasing by taking the average of multiple samples for each pixel. It may also use multiple samples for effects like depth of field and motion blur. If evenly spaced ray directions or times are used for each of these features, many rays are required, and some aliasing will remain. Cook-style, stochastic, or Monte Carlo ray tracing avoids this problem by using random sampling instead of evenly spaced samples. This type of ray tracing is commonly called distributed ray tracing, or distribution ray tracing because it samples rays from probability distributions. Distribution ray tracing can also render realistic "soft" shadows from large lights by using a random sample of points on the light when testing for shadowing, and it can simulate chromatic aberration by sampling multiple wavelengths from the spectrum of light.^[41]^: 10^[44]^: 25

Real surface materials reflect small amounts of light in almost every direction because they have small (or microscopic) bumps and grooves. A distribution ray tracer can simulate this by sampling possible ray directions, which allows rendering blurry reflections from glossy and metallic surfaces. However, if this procedure is repeated recursively to simulate realistic indirect lighting, and if more than one sample is taken at each surface point, the tree of rays quickly becomes huge. Another kind of ray tracing, called path tracing, handles indirect light more efficiently, avoiding branching, and ensures that the distribution of all possible paths from a light source to the camera is sampled in an unbiased way.^[44]^: 25–27^[43]

Ray tracing was often used for rendering reflections in animated films, until path tracing became standard for film rendering. Films such as Shrek 2 and Monsters University also used distribution ray tracing or path tracing to precompute indirect illumination for a scene or frame prior to rendering it using rasterization.^[13]^{: 118–121}

Advances in GPU technology have made real-time ray tracing possible in games, although it is currently almost always used in combination with rasterization.^[10]^: 2 This enables visual effects that are difficult with only rasterization, including reflection from curved surfaces and interreflective objects,^[60]^: 305 and shadows that are accurate over a wide range of distances and surface orientations.^[61]^: 159-160 Ray tracing support is included in recent versions of the graphics APIs used by games, such as DirectX, Metal, and Vulkan.^[62]

Ray tracing has been used to render simulated black holes, and the appearance of objects moving at close to the speed of light, by taking spacetime curvature and relativistic effects into account during light ray simulation.^[63]^[64]

Radiosity

Radiosity (named after the radiometric quantity of the same name) is a method for rendering objects illuminated by light bouncing off rough or matte surfaces. This type of illumination is called indirect light, environment lighting, diffuse lighting, or diffuse interreflection, and the problem of rendering it realistically is called global illumination. Rasterization and basic forms of ray tracing (other than distribution ray tracing and path tracing) can only roughly approximate indirect light, e.g. by adding a uniform "ambient" lighting amount chosen by the artist. Radiosity techniques are also suited to rendering scenes with area lights such as rectangular fluorescent lighting panels, which are difficult for rasterization and traditional ray tracing. Radiosity is considered a physically-based method, meaning that it aims to simulate the flow of light in an environment using equations and experimental data from physics, however it often assumes that all surfaces are opaque and perfectly Lambertian, which reduces realism and limits its applicability.^[16]^{: 10, 11.2.1}^[6]^{: 888, 893}^[65]^[66]^: 6

In the original radiosity method (first proposed in 1984) now called classical radiosity, surfaces and lights in the scene are split into pieces called patches, a process called meshing (this step makes it a finite element method). The rendering code must then determine what fraction of the light being emitted or diffusely reflected (scattered) by each patch is received by each other patch. These fractions are called form factors or view factors (first used in engineering to model radiative heat transfer). The form factors are multiplied by the albedo of the receiving surface and put in a matrix. The lighting in the scene can then be expressed as a matrix equation (or equivalently a system of linear equations) that can be solved by methods from linear algebra.^[65]^[67]^: 46^[6]^{: 888, 896}

Solving the radiosity equation gives the total amount of light emitted and reflected by each patch, which is divided by area to get a value called radiosity that can be used when rasterizing or ray tracing to determine the color of pixels corresponding to visible parts of the patch. For real-time rendering, this value (or more commonly the irradiance, which does not depend on local surface albedo) can be pre-computed and stored in a texture (called an irradiance map) or stored as vertex data for 3D models. This feature was used in architectural visualization software to allow real-time walk-throughs of a building interior after computing the lighting.^[6]^: 890^[16]^: 11.5.1^[66]^: 332

The large size of the matrices used in classical radiosity (the square of the number of patches) causes problems for realistic scenes. Practical implementations may use Jacobi or Gauss-Seidel iterations, which is equivalent (at least in the Jacobi case) to simulating the propagation of light one bounce at a time until the amount of light remaining (not yet absorbed by surfaces) is insignificant. The number of iterations (bounces) required is dependent on the scene, not the number of patches, so the total work is proportional to the square of the number of patches (in contrast, solving the matrix equation using Gaussian elimination requires work proportional to the cube of the number of patches). Form factors may be recomputed when they are needed, to avoid storing a complete matrix in memory.^[6]^{: 901, 907}

The quality of rendering is often determined by the size of the patches, e.g. very fine meshes are needed to depict the edges of shadows accurately. An important improvement is hierarchical radiosity, which uses a coarser mesh (larger patches) for simulating the transfer of light between surfaces that are far away from one another, and adaptively sub-divides the patches as needed. This allows radiosity to be used for much larger and more complex scenes.^[6]^{: 975, 939}

Alternative and extended versions of the radiosity method support non-Lambertian surfaces, such as glossy surfaces and mirrors, and sometimes use volumes or "clusters" of objects as well as surface patches. Stochastic or Monte Carlo radiosity uses random sampling in various ways, e.g. taking samples of incident light instead of integrating over all patches, which can improve performance but adds noise (this noise can be reduced by using deterministic iterations as a final step, unlike path tracing noise). Simplified and partially precomputed versions of radiosity are widely used for real-time rendering, combined with techniques such as octree radiosity that store approximations of the light field.^[6]^{: 979, 982}^[67]^: 49^[68]^[16]^: 11.5

Path tracing

As part of the approach known as physically based rendering, path tracing has become the dominant technique for rendering realistic scenes, including effects for movies.^[69] For example, the popular open source 3D software Blender uses path tracing in its Cycles renderer.^[70] Images produced using path tracing for global illumination are generally noisier than when using radiosity (the main competing algorithm for realistic lighting), but radiosity can be difficult to apply to complex scenes and is prone to artifacts that arise from using a tessellated representation of irradiance.^[69]^[6]^{: 975-976, 1045}

Like distributed ray tracing, path tracing is a kind of stochastic or randomized ray tracing that uses Monte Carlo or Quasi-Monte Carlo integration. It was proposed and named in 1986 by Jim Kajiya in the same paper as the rendering equation. Kajiya observed that much of the complexity of distributed ray tracing could be avoided by only tracing a single path from the camera at a time (in Kajiya's implementation, this "no branching" rule was broken by tracing additional rays from each surface intersection point to randomly chosen points on each light source). Kajiya suggested reducing the noise present in the output images by using stratified sampling and importance sampling for making random decisions such as choosing which ray to follow at each step of a path. Even with these techniques, path tracing would not have been practical for film rendering, using computers available at the time, because the computational cost of generating enough samples to reduce variance to an acceptable level was too high. Monster House, the first feature film rendered entirely using path tracing, was not released until 20 years later.^[43]^[69]^[71]

In its basic form, path tracing is inefficient (requiring too many samples) for rendering caustics and scenes where light enters indirectly through narrow spaces. Attempts were made to address these weaknesses in the 1990s. Bidirectional path tracing has similarities to photon mapping, tracing rays from the light source and the camera separately, and then finding ways to connect these paths (but unlike photon mapping it usually samples new light paths for each pixel rather than using the same cached data for all pixels). Metropolis light transport samples paths by modifying paths that were previously traced, spending more time exploring paths that are similar to other "bright" paths, which increases the chance of discovering even brighter paths. Multiple importance sampling provides a way to reduce variance when combining samples from more than one sampling method, particularly when some samples are much noisier than the others.^[69]^[14]

This later work was summarized and expanded upon in Eric Veach's 1997 PhD thesis, which helped raise interest in path tracing in the computer graphics community. The Arnold renderer, first released in 1998, proved that path tracing was practical for rendering frames for films, and that there was a demand for unbiased and physically based rendering in the film industry; other commercial and open source path tracing renderers began appearing. Computational cost was addressed by rapid advances in CPU and cluster performance.^[69]

Path tracing's relative simplicity and its nature as a Monte Carlo method (sampling hundreds or thousands of paths per pixel) have made it attractive to implement on a GPU, especially on recent GPUs that support ray tracing acceleration technology such as Nvidia's RTX and OptiX.^[72] However bidirectional path tracing and Metropolis light transport are more difficult to implement efficiently on a GPU.^[73]^[74]

Techniques have been developed to denoise the output of path tracing, reducing the number of paths required to achieve acceptable quality, at the risk of losing some detail or introducing small-scale artifacts that are more objectionable than noise.^[75]^[76] Neural networks are now widely used for this purpose.^[77]^[78]^[79]

Research into improving path tracing continues. Many variations of bidirectional path tracing and Metropolis light transport have been explored, and ways of combining path tracing with photon mapping.^[80]^[81] Recent path guiding approaches construct approximations of the light field probability distribution in each volume of space, so paths can be sampled more effectively.^[81]

By combining denoising and hardware ray tracing acceleration, it is now practical to use path tracing for real-time rendering. Due to performance constraints, biased techniques such as a radiance cache may be incorporated. Spatiotemporal reservoir resampling (ReSTIR) aims to improve the quality of real-time path tracing and allow more complex lighting by reusing samples (paths) from previous frames and adjacent pixels.^[82]^: 16.2.4^[83]

Neural rendering

This section has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these messages)

This section needs expansion. You can help by adding to it. (February 2022)

This section needs to be updated. (January 2025)

Neural rendering is a rendering method using artificial neural networks.^[84]^[85] Neural rendering includes image-based rendering methods that are used to reconstruct 3D models from 2-dimensional images.^[84] One of these methods are photogrammetry, which is a method in which a collection of images from multiple angles of an object are turned into a 3D model. There have also been recent developments in generating and rendering 3D models from text and coarse paintings by notably Nvidia, Google and various other companies.

Remove ads

Hardware acceleration

Summarize

Perspective

Rendering is usually limited by available computing power and memory bandwidth, and so specialized hardware has been developed to speed it up ("accelerate" it), particularly for real-time rendering. Hardware features such as a framebuffer for raster graphics are required to display the output of rendering smoothly in real time.

Hardware acceleration does not replace the use of software for rendering, rather it speeds up selected operations or calculations using dedicated circuits, or runs portions of the software's code on a different type of processor.

History

In the era of vector monitors (also called calligraphic displays), a display processing unit (DPU) was a dedicated CPU or coprocessor that maintained a list of visual elements and redrew them continuously on the screen by controlling an electron beam. Advanced DPUs such as Evans & Sutherland's Line Drawing System-1 (and later models produced into the 1980s) incorporated 3D coordinate transformation features to accelerate rendering of wire-frame images.^[18]^{: 93–94, 404–421}^[86] Evans & Sutherland also made the Digistar planetarium projection system, which was a vector display that could render both stars and wire-frame graphics (the vector-based Digistar and Digistar II were used in many planetariums, and a few may still be in operation).^[87]^[88]^[89] A Digistar prototype was used for rendering 3D star fields for the film Star Trek II: The Wrath of Khan – some of the first 3D computer graphics sequences ever seen in a feature film.^[90]

Shaded 3D graphics rendering in the 1970s and early 1980s was usually implemented on general-purpose computers, such as the PDP-10 used by researchers at the University of Utah^[91]^[53]. It was difficult to speed up using specialized hardware because it involves a pipeline of complex steps, requiring data addressing, decision-making, and computation capabilities typically only provided by CPUs (although dedicated circuits for speeding up particular operations were proposed ^[91]). Supercomputers or specially designed multi-CPU computers or clusters were sometimes used for ray tracing.^[56] In 1981, James H. Clark and Marc Hannah designed the Geometry Engine, a VLSI chip for performing some of the steps of the 3D rasterization pipeline, and started the company Silicon Graphics (SGI) to commercialize this technology.^[92]^[93]

Home computers and game consoles in the 1980s contained graphics coprocessors that were capable of scrolling and filling areas of the display, and drawing sprites and lines, though they were not useful for rendering realistic images.^[94]^[95] Towards the end of the 1980s PC graphics cards and arcade games with 3D rendering acceleration began to appear, and in the 1990s such technology became commonplace. Today, even low-power mobile processors typically incorporate 3D graphics acceleration features.^[92]^[96]

GPUs

The 3D graphics accelerators of the 1990s evolved into modern GPUs. GPUs are general-purpose processors, like CPUs, but they are designed for tasks that can be broken into many small, similar, mostly independent sub-tasks (such as rendering individual pixels) and performed in parallel. This means that a GPU can speed up any rendering algorithm that can be split into subtasks in this way, in contrast to 1990s 3D accelerators which were only designed to speed up specific rasterization algorithms and simple shading and lighting effects (although tricks could be used to perform more general computations).^[16]^: ch3^[97]

Due to their origins, GPUs typically still provide specialized hardware acceleration for some steps of a traditional 3D rasterization pipeline, including hidden surface removal using a z-buffer, and texture mapping with mipmaps, but these features are no longer always used.^[16]^: ch3 Recent GPUs have features to accelerate finding the intersections of rays with a bounding volume hierarchy, to help speed up all variants of ray tracing and path tracing,^[58] as well as neural network acceleration features sometimes useful for rendering.^[98]

GPUs are usually integrated with high-bandwidth memory systems to support the read and write bandwidth requirements of high-resolution, real-time rendering, particularly when multiple passes are required to render a frame, however memory latency may be higher than on a CPU, which can be a problem if the critical path in an algorithm involves many memory accesses. GPU design accepts high latency as inevitable (in part because a large number of threads are sharing the memory bus) and attempts to "hide" it by efficiently switching between threads, so a different thread can be performing computations while the first thread is waiting for a read or write to complete.^[16]^: ch3^[99]^[100]

Rendering algorithms will run efficiently on a GPU only if they can be implemented using small groups of threads that perform mostly the same operations. As an example of code that meets this requirement: when rendering a small square of pixels in a simple ray-traced image, all threads will likely be intersecting rays with the same object and performing the same lighting computations. For performance and architectural reasons, GPUs run groups of around 16-64 threads called warps or wavefronts in lock-step (all threads in the group are executing the same instructions at the same time). If not all threads in the group need to run particular blocks of code (due to conditions) then some threads will be idle, or the results of their computations will be discarded, causing degraded performance.^[16]^: ch3^[100]

Hardware and software rendering

Historically, the term hardware rendering (possibly an abbreviation of "hardware accelerated rendering" or "hardware-assisted rendering") was sometimes used to mean rendering using a hardware-accelerated rasterization pipeline (typically for real-time rendering).^[66]^{: 243, 249, 284, 324} In contrast, software rendering meant offline rendering using software that was not limited by the capabilities of graphics hardware, and could use more realistic and higher quality techniques.^[23]^: 1.5 Both hardware and software have evolved, and although these terms are still used, their meaning may now be context-dependent.

When OpenGL and Direct3D were introduced in the 1990s, there was a need to use these APIs on computers that did not have hardware acceleration for 3D graphics. "Fallback" rendering implementations were provided that did not require special hardware, and these are sometimes called "software renderers" (today hardware acceleration is almost always available, and these CPU-only implementations are used primarily for testing).^[101]^: Ch2,13 More recently, "software rendering" may also mean rendering that does not use graphics APIs such as OpenGL, Metal, Direct3D, or Vulkan.^[102]^: Intr.

Types of renderers that in the past might have been called "software renderers" (e.g. path tracing renderers used for offline rendering for movies) now commonly use GPU acceleration, often via APIs such as CUDA or OpenCL, which are not graphics-specific. Since these latter APIs allow running C++ code on a GPU, it is now possible to run the same rendering code on either a CPU or GPU.^[72]

Remove ads

Chronology of algorithms and techniques

Summarize

Perspective

The following is a rough timeline of frequently mentioned rendering techniques, including areas of current research. Note that even in cases where an idea was named in a specific paper, there were almost always multiple researchers or teams working in the same area (including earlier related work). When a method is first proposed it is often very inefficient, and it takes additional research and practical efforts to turn it into a useful technique.^[6]^: 887

The list focuses on academic research and does not include hardware. (For more history see #External links, as well as Computer graphics#History and Golden age of arcade video games#Technology.)

1760 – Lambertian reflectance model^[16]^: 5.2
1931 – Standardized RGB representation of color
1967 – Torrance-Sparrow reflectance model^[103]
1968 – Ray casting^[57]
1968 – Warnock hidden surface removal^[104]^[49]
1970 – Scanline rendering^[50]^[52]
1971 – Gouraud shading^[105]
1973 – Phong shading^[106]^[107]
1973 – Phong reflectance model^[106]^[107]
1974 – Texture mapping^[53]
1974 – Z-buffering^[53]
1976 – Environment mapping^[108]
1977 – Blinn–Phong reflectance model^[109]
1977 – Shadow volumes^[110]
1978 – Shadow mapping^[111]
1978 – Bump mapping^[112]
1980 – BSP trees^[113]
1980 – Ray tracing^[114]
1981 – Cook-Torrance reflectance model^[115]
1983 – MIP maps^[116]
1984 – Octree ray tracing^[117]
1984 – Alpha compositing^[118]
1984 – Distributed ray tracing^[119]
1984 – Radiosity (method for non-trivial scenes in 1985)^[120]^[121]
1984 – A-buffer^[54]
1985 – Hemicube radiosity^[122]
1986 – Light source tracing^[45]
1986 – Rendering equation^[43]
1986 – Path tracing^[43]
1987 – Reyes rendering^[55]
1988 – Irradiance caching^[123]^[124]^[81]
1991 – Xiaolin Wu line anti-aliasing^[125]^[126]
1991 – Hierarchical radiosity^[127]
1993 – Oren–Nayar reflectance model^[128]
1993 – Tone mapping^[129]
1993 – Subsurface scattering^[130]
1993 – Bidirectional path tracing (Lafortune & Willems formulation)^[131]
1994 – Ambient occlusion^[132]
1995 – Photon mapping^[133]
1995 – Multiple importance sampling^[134]
1997 – Bidirectional path tracing (Veach & Guibas formulation)^[135]
1997 – Metropolis light transport^[135]
1997 – Instant Radiosity^[136]
2002 – Precomputed Radiance Transfer^[137]
2002 – Primary sample space Metropolis light transport^[138]^[81]
2003 – MERL BRDF database^[139]
2005 – Lightcuts^[140]^[81]
2005 – Radiance caching^[141]^[81]
2009 – Stochastic progressive photon mapping (SPPM)^[142]^[81]
2012 – Vertex connection and merging (VCM) (also called unified path sampling)^[143]^[144]^[81]^[145]
2012 – Manifold exploration^[146]^[81]
2013 – Gradient-domain rendering^[147]^[81]
2014 – Multiplexed Metropolis light transport^[148]^[81]
2014 – Differentiable rendering^[149]
2015 – Manifold next event estimation (MNEE)^[150]^[81]^[151]
2017 – Path guiding (using adaptive SD-tree)^[152]^[151]
2020 – Spatiotemporal reservoir resampling (ReSTIR)^[153]
2020 – Neural radiance fields^[34]
2023 – 3D Gaussian splatting^[35]

Remove ads

References

Loading content...

External links

Loading content...

Loading related searches...

Wikiwand - on

Seamless Wikipedia browsing. On steroids.

Remove ads

Rendering (computer graphics)

Features

Photorealistic rendering

Other styles of 3D rendering

2D rendering

Inputs

2D vector graphics

3D geometry

Volumetric data

Photogrammetry and scanning

Neural approximations and light fields

Outputs

Techniques

Rasterization

Ray casting

Ray tracing

Radiosity

Path tracing

Neural rendering

Hardware acceleration

History

GPUs

Hardware and software rendering

Chronology of algorithms and techniques

See also

References

Further reading

External links

Wikiwand - on