Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Graphics Pipeline: Concept Structure

Download as pdf or txt
Download as pdf or txt
You are on page 1of 5

9/13/2019 Graphics pipeline - Wikipedia

Graphics pipeline
In computer graphics, a computer graphics pipeline, rendering pipeline or simply graphics pipeline, is a conceptual model that describes what steps a graphics system
needs to perform to render a 3D scene to a 2D screen.[1] Once a 3D model has been created, for instance in a video game or any other 3D computer animation, the graphics pipeline
is the process of turning that 3D model into what the computer displays.[2] Because the steps required for this operation depend on the software and hardware used and the
desired display characteristics, there is no universal graphics pipeline suitable for all cases. However, graphics application programming interfaces (APIs) such as Direct3D and
OpenGL were created to unify similar steps and to control the graphics pipeline of a given hardware accelerator. These APIs abstract the underlying hardware and keep the
programmer away from writing code to manipulate the graphics hardware accelerators (AMD/Intel/NVIDIA etc.).

The model of the graphics pipeline is usually used in real-time rendering. Often, most of the pipeline steps are implemented in hardware, which allows for special optimizations.
The term "pipeline" is used in a similar sense to the pipeline in processors: the individual steps of the pipeline run parallel but are blocked until the slowest step has been
completed.

Contents
Concept
Structure
Application
Geometry
Definitions
The World Coordinate System
Camera Transformation
Projection
Lighting
Clipping
Window-Viewport transformation
Rasterisation
Inverse
Shader
See also
Sources
References
External links

Concept
The 3D pipeline usually refers to the most common form of computer 3D rendering called 3D polygon rendering, distinct from raytracing and raycasting. In raycasting, a ray
originates at the point where the camera resides, and if that ray hits a surface, the color and lighting of the point on the surface where the ray hit is calculated. In 3D polygon
rendering the reverse happens - the area that is in view of the camera is calculated, and then rays are created from every part of every surface in view of the camera and traced back
to the camera.[3]

Structure
A graphics pipeline can be divided into three main parts: Application, Geometry and Rasterization.[4]

Application
The application step is executed by the software on the main processor (CPU). In the application step, changes are made to the scene as required, for example, by user interaction
by means of input devices or during an animation. In a modern Game Engine such as Unity, the programmer deals almost exclusively with the application step, and uses a high-
level language such as C#, as opposed to C or C++. The new scene with all its primitives, usually triangles, lines and points, is then passed on to the next step in the pipeline.

Examples of tasks that are typically done in the application step are collision detection, animation, morphing, and acceleration techniques using spatial subdivision schemes such
as Quadtrees or Octrees. These are also used to reduce the amount of main memory required at a given time. The "world" of a modern computer game is far larger than what could
fit into memory at once.

Geometry
The geometry step (with Geometry pipeline), which is responsible for the majority of the operations with polygons and their vertices (with Vertex pipeline), can be divided into the
following five tasks. It depends on the particular implementation of how these tasks are organized as actual parallel pipeline steps.

Definitions

https://en.wikipedia.org/wiki/Graphics_pipeline 1/5
9/13/2019 Graphics pipeline - Wikipedia
A vertex (plural: vertices) is a point in the world. Many points are used to join the surfaces. In special cases, point clouds are drawn directly, but this is still the exception.

A triangle is the most common geometric primitive of computer graphics. It is defined by its three vertices and a normal vector - the normal vector serves to indicate the front face
of the triangle and is a vector that is perpendicular to the surface. The triangle may be provided with a color or with a texture (image "glued" on top of it). Triangles always exist on
a single plane, therefore they're preferred over rectangles.

The World Coordinate System


The world coordinate system is the coordinate system in which the virtual world is created. This should meet a few conditions for the following mathematics to be easily applicable:

It must be a rectangular Cartesian coordinate system in which all axes are equally scaled.
How the unit of the coordinate system is defined, is left to the developer. Whether, therefore, the unit vector of the system is to correspond in reality to one meter or an Ångström
depends on the application.

Whether a right-handed or a left-handed coordinate system is to be used may be determined by the graphic library to be used.

Example: If we are to develop a flight simulator, we can choose the world coordinate system so that the origin is in the middle of the earth and
the unit is set to one meter. In addition, in order to make the reference to reality easier, we define that the X axis should intersect the equator on
the zero meridian, and the Z axis passes through the poles. In a Right-handed system, the Y-axis runs through the 90°-East meridian
(somewhere in the Indian Ocean). Now we have a coordinate system that describes every point on Earth in three-dimensional Cartesian
coordinates. In this coordinate system, we are now modeling the principles of our world, mountains, valleys and oceans.

Note: Aside from computer geometry, geographic coordinates are used for the earth, ie, latitude and longitude, as well as altitudes above sea
level. The approximate conversion - if one does not consider the fact that the earth is not an exact sphere - is simple:

with R=Radius of the earth [6.378.137m], lat=Latitude, long=Longitude, hasl=height above

sea level.
All of the following examples apply in a right-handed system. For a left-handed system the signs may need to be interchanged.

The objects contained within the scene (houses, trees, cars) are often designed in their own object coordinate system (also called model coordinate system or local coordinate
system) for reasons of simpler modeling. To assign these objects to coordinates in the world coordinate system or global coordinate system of the entire scene, the object
coordinates are transformed by means of translation, rotation or scaling. This is done by multiplying the corresponding transformation matrices. In addition, several differently
transformed copies can be formed from one object, for example a forest from a tree; This technique is called instancing.

In order to place a model of an aircraft in the world, we first determine four matrices. Since we work in three-dimensional space, we need four-
dimensional homogeneous matrices for our calculations.

First, we need three rotation matrices, namely one for each of the three aircraft axes (vertical axis, transverse axis, longitudinal axis).

Around the X axis (usually defined as a longitudinal axis in the object coordinate system)

Around the Y axis (usually defined as the transverse axis in the object coordinate system)

Around the Z axis (usually defined as vertical axis in the object coordinate system)

We also use a translation matrix that moves the aircraft to the desired point in our world: .

Remark: The above matrices are transposed with respect to the ones in the article rotation matrix. See further down for an explanation why.

Now we could calculate the position of the vertices of the aircraft in world coordinates by multiplying each point successively with these four matrices. Since the multiplication of a
matrix with a vector is quite expensive (time consuming), one usually takes another path and first multiplies the four matrices together. The multiplication of two matrices is even
more expensive, but must be executed only once for the whole object. The multiplications and are equivalent.
Thereafter, the resulting matrix could be applied to the vertices. In practice, however, the multiplication with the vertices is still not applied, but the camera matrices - see below -
are determined first.

For our example from above, however, the translation has to be determined somewhat differently, since the common meaning of Up - apart
from at the North Pole - does not coincide with our definition of the positive Z axis and therefore the model must also be rotated around the
center of the earth: The first step pushes the origin of the model to the correct
height above the earth's surface, then it is rotated by latitude and longitude.

The order in which the matrices are applied is important, because the matrix multiplication is not commutative. This also applies to the three rotations, as can be demonstrated by
an example: The point (1, 0, 0) lies on the X-axis, if one rotates it first by 90° around the X- and then around The Y axis, it ends up on the Z axis (the rotation around the X axis has
no effect on a point that is on the axis). If, on the other hand, one rotates around the Y axis first and then around the X axis, the resulting point is located on the Y axis. The

https://en.wikipedia.org/wiki/Graphics_pipeline 2/5
9/13/2019 Graphics pipeline - Wikipedia
sequence itself is arbitrary as long as it is always the same. The sequence with x, then y, then z (roll, pitch, heading) is often the most intuitive, because the rotation causes the
compass direction to coincide with the direction of the "nose".

There are also two conventions to define these matrices, depending on whether you want to work with column vectors or row vectors. Different graphics libraries have different
preferences here. OpenGL prefers column vectors, DirectX row vectors. The decision determines from which side the point vectors are to be multiplied by the transformation
matrices. For column vectors, the multiplication is performed from the right, i.e. , where vout and vin are 4x1 column vectors. The concatenation of the matrices also
is done from the right to left, i.e., for example , when first rotating and then shifting.

In the case of row vectors, this works exactly the other way round. The multiplication now takes place from left as with 1x4-row vectors and the concatenation is
when we also first rotate and then move. The matrices shown above are valid for the second case, while those for column vectors are transposed. The rule
[5] applies, which for multiplication with vectors means that you can switch the multiplication order by transposing the matrix.

The interesting thing about this matrix chaining is that a new coordinate system is defined by each such transformation. This can be extended as desired. For example, the
propeller of the aircraft may be a separate model, which is then placed by translation to the aircraft nose. This translation only needs to describe the shift from the model
coordinate system to the propeller coordinate system. In order to draw the entire aircraft, the transformation matrix for the aircraft is first determined, the points are transformed,
and then the propeller model matrix is multiplied to the matrix of the aircraft, and then the propeller points are transformed.

The matrix calculated in this way is also called the world matrix. It must be determined for each object in the world before rendering. The application can introduce changes here,
for example change the position of the aircraft according to the speed after each frame.

Camera Transformation
In addition to the objects, the scene also defines a virtual camera or viewer that indicates the position and direction of
view from which the scene is to be rendered. To simplify later projection and clipping, the scene is transformed so
that the camera is at the origin, looking along the Z axis. The resulting coordinate system is called the camera
coordinate system and the transformation is called camera transformation or View Transformation.

The view matrix is usually determined from camera position, target point (where the camera
looks) and an "up vector" ("up" from the viewer's viewpoint). First three auxiliary vectors are
required:
Zaxis = normal(cameraPosition - cameraTarget)
Xaxis = normal(cross(cameraUpVector, zaxis)) Left: Position and direction of the virtual viewer
Yaxis = cross(zaxis, xaxis) (camera), as defined by the user. Right: Positioning the
With normal(v) = normalization of the vector v; objects after the camera transformation. The light gray
cross(v1, v2) = cross product of v1 and v2. area is the visible volume.

Finally, the matrix:

with dot(v1, v2) = dot product of v1 and v2.

Projection
The 3D projection step transforms the view volume into a cube with the corner point coordinates (-1, -1, 0) and (1, 1, 1); Occasionally other target volumes are also used. This step is
called projection, even though it transforms a volume into another volume, since the resulting Z coordinates are not stored in the image, but are only used in Z-buffering in the
later rastering step. In a perspective illustration, a central projection is used. To limit the number of displayed objects, two additional clipping planes are used; The visual volume is
therefore a truncated pyramid (frustum). The parallel or orthogonal projection is used, for example, for technical representations because it has the advantage that all parallels in
the object space are also parallel in the image space, and the surfaces and volumes are the same size regardless of the distance from the viewer. Maps use, for example, an
orthogonal projection (so-called orthophoto), but oblique images of a landscape cannot be used in this way - although they can technically be rendered, they seem so distorted that
we cannot make any use of them. The formula for calculating a perspective mapping matrix is:

With h = cot (fieldOfView / 2.0) (aperture angle of the camera); w = h / aspectRatio (aspect ratio of the target image); near = Smallest distance
to be visible; far = The longest distance to be visible.

The reasons why the smallest and the greatest distance have to be given here are, on the one hand, that this distance is divided by in order to reach the scaling of the scene (more
distant objects are smaller in a perspective image than near objects), and on the other hand to scale the Z values to the range 0..1, for filling the Z-buffer. This buffer often has only
a resolution of 16 bits, which is why the near and far values should be chosen carefully. A too large difference between the near and the far value leads to so-called Z-fighting
because of the low resolution of the Z-buffer. It can also be seen from the formula that the near value cannot be 0, because this point is the focus point of the projection. There is no
picture at this point.

For the sake of completeness, the formula for parallel projection (orthogonal projection):

with w = width of the target cube (dimension in units of the world coordinate system); H = w / aspectRatio (aspect ratio of the target image);
near = Smallest distance to be visible; far = The longest distance to be visible.

For reasons of efficiency, the camera and projection matrix are usually combined into a transformation matrix so that the camera coordinate system is omitted. The resulting
matrix is usually the same for a single image, while the world matrix looks different for each object. In practice, therefore, view and projection are pre-calculated so that only the
world matrix has to be adapted during the display. However, more complex transformations such as vertex blending are possible. Freely programmable geometry shaders that

https://en.wikipedia.org/wiki/Graphics_pipeline 3/5
9/13/2019 Graphics pipeline - Wikipedia
modify the geometry can also be executed.

In the actual rendering step, the world matrix * camera matrix * projection matrix is calculated and then finally applied to every single point. Thus, the points of all objects are
transferred directly to the screen coordinate system (at least almost, the value range of the axes are still -1..1 for the visible range, see section "Window-Viewport-Transformation").

Lighting
Often a scene contains light sources placed at different positions to make the lighting of the objects appear more realistic. In this case, a gain factor for the texture is calculated for
each vertex based on the light sources and the material properties associated with the corresponding triangle. In the later rasterization step, the vertex values of a triangle are
interpolated over its surface. A general lighting (ambient light) is applied to all surfaces. It is the diffuse and thus direction-independent brightness of the scene. The sun is a
directed light source, which can be assumed to be infinitely far away. The illumination effected by the sun on a surface is determined by forming the scalar product of the
directional vector from the sun and the normal vector of the surface. If the value is negative, the surface is facing the sun.

Clipping
Only the primitives which are within the visual volume need to actually be rastered (drawn). This visual volume is defined as the inside
of a frustum, a shape in the form of a pyramid with a cut off top. Primitives which are completely outside the visual volume are
discarded; This is called frustum culling. Further culling methods such as backface culling, which reduce the number of primitives to be
considered, can theoretically be executed in any step of the graphics pipeline. Primitives which are only partially inside the cube must be
clipped against the cube. The advantage of the previous projection step is that the clipping always takes place against the same cube.
Clipping of primitives against the
Only the - possibly clipped - primitives, which are within the visual volume, are forwarded to the final step.
cube. The blue triangle is discarded
while the orange triangle is clipped,
Window-Viewport transformation creating two new vertices

In order to output the image to any target area (viewport) of the screen, another transformation, the Window-Viewport
transformation, must be applied. This is a shift, followed by scaling. The resulting coordinates are the device coordinates of the output
device. The viewport contains 6 values: height and width of the window in pixels, the upper left corner of the window in window
coordinates (usually 0, 0) and the minimum and maximum values for Z (usually 0 and 1).

Formally:

With vp=Viewport; v=Point after projection


Frustum
On modern hardware, most of the geometry computation steps are performed in the vertex shader. This is, in principle, freely
programmable, but generally performs at least the transformation of the points and the illumination calculation. For the DirectX
programming interface, the use of a custom vertex shader is necessary from version 10, while older versions still have a standard shader.

Rasterisation
The rasterisation step is the final step before the fragment shader pipeline that all primitives are rasterized with Pixel pipeline. In the
rasterisation step, discrete fragments are created from continuous primitives. Window-Viewport-Transformation

In this stage of the graphics pipeline, the grid points are also called fragments, for the sake of greater distinctiveness. Each fragment
corresponds to one pixel in the frame buffer and this corresponds to one pixel of the screen. These can be colored (and possibly illuminated). Furthermore, it is necessary to
determine the visible, closer to the observer fragment, in the case of overlapping polygons. A Z-buffer is usually used for this so-called hidden surface determination. The color of a
fragment depends on the illumination, texture, and other material properties of the visible primitive and is often interpolated using the triangle vertice properties. Where available,
a fragment shader (also called Pixel Shader) is run in the rastering step for each fragment of the object. If a fragment is visible, it can now be mixed with already existing color
values in the image if transparency or multi-sampling is used. In this step, one or more fragments become a pixel.

To prevent that the user sees the gradual rasterisation of the primitives, double buffering takes place. The rasterisation is carried out in a special memory area. Once the image has
been completely rasterised, it is copied to the visible area of the image memory.

Inverse
All matrices used are nonsingular and thus invertible. Since the multiplication of two nonsingular matrices creates another nonsingular matrix, the entire transformation matrix is
also invertible. The inverse is required to recalculate world coordinates from screen coordinates - for example, to determine from the mouse pointer position the clicked object.
However, since the screen and the mouse have only two dimensions, the third is unknown. Therefore, a ray is projected at the cursor position into the world and then the
intersection of this ray with the polygons in the world is determined.

Shader

https://en.wikipedia.org/wiki/Graphics_pipeline 4/5
9/13/2019 Graphics pipeline - Wikipedia
Classic graphics cards are still relatively close to the graphics pipeline. With increasing demands on the GPU, restrictions were gradually removed out to create more flexibility.
Modern graphics cards use a freely programmable, shader-controlled pipeline, which allows direct access to individual processing steps. To relieve the main processor, additional
processing steps have been moved to the pipeline and the GPU.

The most important shader units are pixel shaders, vertex shaders, and geometry shaders. The Unified Shader has been introduced to take full advantage of all units. This gives you
a single large pool of shader units. As required, the pool is divided into different groups of shaders. A strict separation between the shader types is therefore no longer useful.

It is also possible to use a so-called compute-shader to perform any calculations off the display of a graphic on the GPU. The advantage is that they run very parallel, but there are
limitations. These universal calculations are also called general-purpose computing on graphics processing units, or GPGPU for short.

See also
Pipeline (computing)

Instruction pipelining
Hardware acceleration

Sources
Tomas Akenine-Möller, Eric Haines: Real-Time Rendering. AK Peters, Natick, Mass. 2002, ISBN 1-56881-182-9.
Michael Bender, Manfred Brill: Computergrafik: ein anwendungsorientiertes Lehrbuch. Hanser, München 2006, ISBN 3-446-40434-1.
Fischer, Martin (2011-07-04). Pixel-Fabrik Wie Grafikchips Spielewelten auf den Schirm zaubern. c't Magazin für Computer Technik. Heise Zeitschriften Verlag. p. 180.
ISSN 0724-8679 (https://www.worldcat.org/issn/0724-8679). Invalid |script-title=: missing prefix (help)

References
1. "Graphics Pipeline" (https://msdn.microsoft.com/en-us/library/windows/desktop/ff476882%28v=vs.85%29.aspx). Microsoft. Retrieved 15 June 2015.
2. "Lecture: Graphics pipeline" (http://goanna.cs.rmit.edu.au/~gl/teaching/rtr&3dgp/notes/pipeline.html). Retrieved 15 June 2015.
3. https://web.archive.org/web/20161229102150/http://www.cs.virginia.edu/~gfx/Courses/2012/IntroGraphics/lectures/13-Pipeline.pdf
4. Tomas Akenine-Möller, Eric Haines: Real-Time Rendering, S. 11.
5. K. Nipp, D. Stoffer; Lineare Algebra; v/d/f Hochschulverlag der ETH Zürich; Zürich 1998, ISBN 3-7281-2649-7.

External links
ExtremeTech 3D Pipeline Tutorial (http://www.extremetech.com/computing/49076-extremetech-3d-pipeline-tutorial)

Retrieved from "https://en.wikipedia.org/w/index.php?title=Graphics_pipeline&oldid=914670443"

This page was last edited on 8 September 2019, at 19:07 (UTC).

Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy.
Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.

https://en.wikipedia.org/wiki/Graphics_pipeline 5/5

You might also like