Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Reducing shading on GPUs using quad-fragment merging

Published: 26 July 2010 Publication History

Abstract

Current GPUs perform a significant amount of redundant shading when surfaces are tessellated into small triangles. We address this inefficiency by augmenting the GPU pipeline to gather and merge rasterized fragments from adjacent triangles in a mesh. This approach has minimal impact on output image quality, is amenable to implementation in fixed-function hardware, and, when rendering pixel-sized triangles, requires only a small amount of buffering to reduce overall pipeline shading work by a factor of eight. We find that a fragment-shading pipeline with this optimization is competitive with the REYES pipeline approach of shading at micropolygon vertices and, in cases of complex occlusion, can perform up to two times less shading work.

Supplementary Material

JPG File (tp066-10.jpg)
Supplemental material. (067.zip)
MP4 File (tp066-10.mp4)

References

[1]
Akeley, K. 1993. RealityEngine graphics. In Proceedings of SIGGRAPH 93, ACM Press / ACM SIGGRAPH, Computer Graphics Proceedings, Annual Conference Series, ACM, 109--116.
[2]
Apodaca, A. A., and Gritz, L. 2000. Advanced RenderMan: Creating CGI for Motion Pictures. Morgan Kaufmann.
[3]
Blythe, D. 2006. The Direct3D 10 system. ACM Transactions on Graphics 25, 3 (Aug), 724--734.
[4]
Cook, R., Carpenter, L., and Catmull, E. 1987. The Reyes image rendering architecture. In Computer Graphics (Proceedings of SIGGRAPH 87), ACM, vol. 27, 95--102.
[5]
Deering, M., Winner, S., Schediwy, B., Duffy, C., and Hunt, N. 1988. The triangle processor and normal vector shader: a VLSI system for high performance graphics. In Computer Graphics (Proceedings of SIGGRAPH 88), ACM, vol. 22, 21--30.
[6]
Fatahalian, K., Luong, E., Boulos, S., Akeley, K., Mark, W. R., and Hanrahan, P. 2009. Data-parallel rasterization of micropolygons with defocus and motion blur. In HPG '09: Proceedings of the Conference on High Performance Graphics 2009, ACM, 59--68.
[7]
Fisher, M., Fatahalian, K., Boulos, S., Akeley, K., Mark, W. R., and Hanrahan, P. 2009. DiagSplit: parallel, crack-free, adaptive tessellation for micropolygon rendering. ACM Transactions on Graphics 28, 5, 1--10.
[8]
Greene, N., Kass, M., and Miller, G. 1993. Hierarchical z-buffer visibility. In Proceedings of SIGGRAPH 93, ACM Press / ACM SIGGRAPH, Computer Graphics Proceedings, Annual Conference Series, ACM, 231--238.
[9]
Kessenich, J., 2009. The OpenGL Shading Language Specification, language version 1.5.
[10]
Microsoft, 2010. Windows DirectX graphics documentation. http://msdn.microsoft.com/en-us/library/ee663301 {Online; accessed 27-April-2010}.
[11]
Molnar, S., Eyles, J., and Poulton, J. 1992. PixelFlow: high-speed rendering using image composition. In Computer Graphics (Proceedings of SIGGRAPH 92), ACM, vol. 26, 231--240.
[12]
Patney, A., and Owens, J. D. 2008. Real-time Reyes-style adaptive surface subdivision. ACM Transactions on Graphics 27, 5, 1--8.
[13]
Ragan-Kelley, J., Lehtinen, J., Chen, J., Doggett, M., and Durand, F. 2010. Decoupled sampling for real-time graphics pipelines. MIT Computer Science and Artificial Intelligence Laboratory Technical Report Series, MIT-CSAIL-TR-2010-015.
[14]
Wexler, D., Gritz, L., Enderton, E., and Rice, J. 2005. GPU-accelerated high-quality hidden surface removal. In HWWS '05: Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware, ACM, ACM, 7--14.
[15]
Zhou, K., Hou, Q., Ren, Z., Gong, M., Sun, X., and Guo, B. 2009. RenderAnts: interactive reyes rendering on gpus. ACM Transactions on Graphics 28, 5, 1--11.

Cited By

View all

Index Terms

  1. Reducing shading on GPUs using quad-fragment merging

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Graphics
    ACM Transactions on Graphics  Volume 29, Issue 4
    July 2010
    942 pages
    ISSN:0730-0301
    EISSN:1557-7368
    DOI:10.1145/1778765
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 July 2010
    Published in TOG Volume 29, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. GPU architecture
    2. micropolygons
    3. real-time rendering

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)10
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 01 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)High-Throughput Batch Rendering for Embodied AISIGGRAPH Asia 2024 Conference Papers10.1145/3680528.3687629(1-9)Online publication date: 3-Dec-2024
    • (2020)Tile Pair-Based Adaptive Multi-Rate Stereo ShadingIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2018.288331426:6(2303-2314)Online publication date: 1-Jun-2020
    • (2016)Masked software occlusion cullingProceedings of High Performance Graphics10.5555/2977336.2977340(23-31)Online publication date: 20-Jun-2016
    • (2016)Compressed Coverage Masks for Path Rendering on Mobile GPUsIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2016.251699022:10(2229-2238)Online publication date: 1-Oct-2016
    • (2016)Practical power consumption analysis with current smartphones2016 29th IEEE International System-on-Chip Conference (SOCC)10.1109/SOCC.2016.7905505(333-337)Online publication date: Sep-2016
    • (2015)Masked depth culling for graphics hardwareACM Transactions on Graphics10.1145/2816795.281813834:6(1-9)Online publication date: 2-Nov-2015
    • (2015)Time-lapse mining from internet photosACM Transactions on Graphics10.1145/276690334:4(1-8)Online publication date: 27-Jul-2015
    • (2015)Compressed coverage masks for path rendering on mobile GPUsProceedings of the 19th Symposium on Interactive 3D Graphics and Games10.1145/2699276.2699291(101-108)Online publication date: 27-Feb-2015
    • (2015)A performance and energy evaluation of many-light rendering algorithmsThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-014-1046-y31:12(1671-1681)Online publication date: 1-Dec-2015
    • (2014)Deep shading buffers on commodity GPUsACM Transactions on Graphics10.1145/2661229.266124533:6(1-12)Online publication date: 19-Nov-2014
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media