Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2018323.2018337acmconferencesArticle/Chapter ViewAbstractPublication PageshpgConference Proceedingsconference-collections
research-article

High-performance software rasterization on GPUs

Published: 05 August 2011 Publication History

Abstract

In this paper, we implement an efficient, completely software-based graphics pipeline on a GPU. Unlike previous approaches, we obey ordering constraints imposed by current graphics APIs, guarantee hole-free rasterization, and support multisample antialiasing. Our goal is to examine the performance implications of not exploiting the fixed-function graphics pipeline, and to discern which additional hardware support would benefit software-based graphics the most.
We present significant improvements over previous work in terms of scalability, performance, and capabilities. Our pipeline is malleable and easy to extend, and we demonstrate that in a wide variety of test cases its performance is within a factor of 2--8x compared to the hardware graphics pipeline on a top of the line GPU.
Our implementation is open sourced and available at http://code.google.com/p/cudaraster/

References

[1]
Aila, T., and Laine, S. 2009. Understanding the efficiency of ray traversal on GPUs. In Proc. High-Performance Graphics 2009, 145--149.
[2]
Akenine-Möller, T., Munkberg, J., and Hasselgren, J. 2007. Stochastic rasterization using time-continuous triangles. In Proc. Graphics Hardware, 7--16.
[3]
Fatahalian, K., Boulos, S., Hegarty, J., Akeley, K., Mark, W. R., Moreton, H., and Hanrahan, P. 2010. Reducing shading on GPUs using quad-fragment merging. ACM Trans. Graph. 29, 67:1--67:8.
[4]
Gascuel, J.-D., Holzschuch, N., Fournier, G., and Peroche, B. 2008. Fast non-linear projections using graphics hardware. In Proc. I3D, 107--114.
[5]
Greene, N., Kass, M., and Miller, G. 1993. Hierarchical z-buffer visibility. In Proc. SIGGRAPH '93, 231--238.
[6]
Hasselgren, J., and Akenine-Möller, T. 2007. PCU: The programmable culling unit. ACM Trans. Graph. 26, 92:1--92:10.
[7]
Lindholm, E., Nickolls, J., Oberman, S., and Montrym, J. 2008. Nvidia Tesla: A unified graphics and computing architecture. IEEE Micro 28, 39--55.
[8]
Liu, F., Huang, M.-C., Liu, X.-H., and Wu, E.-H. 2010. Freepipe: A programmable parallel rendering architecture for efficient multi-fragment effects. In Proc. I3D, 75--82.
[9]
Loop, C., and Eisenacher, C., 2009. Real-time patch-based sort-middle rendering on massively parallel hardware. Microsoft Research tech. rep., MSR-TR-2009-83.
[10]
Molnar, S., Cox, M., Ellsworth, D., and Fuchs, H. 1994. A sorting classification of parallel rendering. IEEE Comput. Graph. Appl. 14, 23--32.
[11]
NVIDIA, 2001. HRAA: High-resolution antialiasing through multisampling. Tech. rep.
[12]
NVIDIA, 2007. Cuda technology; http://www.nvidia.com/cuda.
[13]
Purcell, T. J., Buck, I., Mark, W. R., and Hanrahan, P. 2002. Ray tracing on programmable graphics hardware. ACM Trans. Graph. 21, 3, 703--712.
[14]
Ragan-Kelley, J., Lehtinen, J., Chen, J., Doggett, M., and Durand, F. 2011. Decoupled sampling for graphics pipelines. ACM Trans. Graph. 30, 3, 17:1--17:17.
[15]
Seiler, L., Carmean, D., Sprangle, E., Forsyth, T., Abrash, M., Dubey, P., Junkins, S., Lake, A., Sugerman, J., Cavin, R., Espasa, R., Grochowski, E., Juan, T., and Hanrahan, P. 2008. Larrabee: A many-core x86 architecture for visual computing. ACM Trans. Graph. 27, 18:1--18:15.
[16]
Sugerman, J., Fatahalian, K., Boulos, S., Akeley, K., and Hanrahan, P. 2009. Gramps: A programming model for graphics pipelines. ACM Trans. Graph. 28, 4:1--4:11.

Cited By

View all
  • (2024)Foveated Path Culling: A mixed path tracing and radiance field approach for optimizing rendering in XR DisplaysJournal on Interactive Systems10.5753/jis.2024.435215:1(576-590)Online publication date: 18-Jun-2024
  • (2024)GPU-friendly Stroke ExpansionProceedings of the ACM on Computer Graphics and Interactive Techniques10.1145/36753907:3(1-29)Online publication date: 9-Aug-2024
  • (2024)StopThePop: Sorted Gaussian Splatting for View-Consistent Real-time RenderingACM Transactions on Graphics10.1145/365818743:4(1-17)Online publication date: 19-Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HPG '11: Proceedings of the ACM SIGGRAPH Symposium on High Performance Graphics
August 2011
185 pages
ISBN:9781450308960
DOI:10.1145/2018323
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 August 2011

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

HPG '11
Sponsor:
HPG '11: High Performance Graphics
August 5 - 7, 2011
British Columbia, Vancouver, Canada

Acceptance Rates

Overall Acceptance Rate 15 of 44 submissions, 34%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)79
  • Downloads (Last 6 weeks)7
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Foveated Path Culling: A mixed path tracing and radiance field approach for optimizing rendering in XR DisplaysJournal on Interactive Systems10.5753/jis.2024.435215:1(576-590)Online publication date: 18-Jun-2024
  • (2024)GPU-friendly Stroke ExpansionProceedings of the ACM on Computer Graphics and Interactive Techniques10.1145/36753907:3(1-29)Online publication date: 9-Aug-2024
  • (2024)StopThePop: Sorted Gaussian Splatting for View-Consistent Real-time RenderingACM Transactions on Graphics10.1145/365818743:4(1-17)Online publication date: 19-Jul-2024
  • (2024)Evolutionary image vectorization with variable curve numberProceedings of the Genetic and Evolutionary Computation Conference Companion10.1145/3638530.3664171(2083-2086)Online publication date: 14-Jul-2024
  • (2024)Gaussian Shell Maps for Efficient 3D Human Generation2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.00902(9441-9451)Online publication date: 16-Jun-2024
  • (2024)3D facial modeling, animation, and rendering for digital humans: A surveyNeurocomputing10.1016/j.neucom.2024.128168598(128168)Online publication date: Sep-2024
  • (2024)EvoVec: Evolutionary Image Vectorization with Adaptive Curve Number and Color GradientsParallel Problem Solving from Nature – PPSN XVIII10.1007/978-3-031-70085-9_24(383-397)Online publication date: 7-Sep-2024
  • (2023)3D Gaussian Splatting for Real-Time Radiance Field RenderingACM Transactions on Graphics10.1145/359243342:4(1-14)Online publication date: 26-Jul-2023
  • (2023)ETER: Elastic Tessellation for Real-Time Pixel-Accurate Rendering of Large-Scale NURBS ModelsACM Transactions on Graphics10.1145/359241942:4(1-13)Online publication date: 26-Jul-2023
  • (2023)Skybox: Open-Source Graphic Rendering on Programmable RISC-V GPUsProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 310.1145/3582016.3582024(616-630)Online publication date: 25-Mar-2023
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media