Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3208159.3208189acmotherconferencesArticle/Chapter ViewAbstractPublication PagescgiConference Proceedingsconference-collections
research-article

Further improvements to OIT sort performance

Published: 11 June 2018 Publication History

Abstract

Transparency requires geometry to be blended in depth sorted order. Order independent transparency (OIT) allows geometry to be rendered in any order, with exact OIT capturing all fragment data during rasterization before sorting and blending. The sorting stage is the only super-linear operation thus is more dominant with increasing scene depth complexity, and still remains costly for deep scenes despite many improvements. The current fastest approach for OIT uses an insertion sort network of fast registers, sorting fragment data in blocks before writing them to local memory and performing a k-way merge. We show that the sort network performance is improved by modularising parts of the network and tuning loop unrolling, thus reducing total sort code size for better cache behaviour. This further improves sort performance by up to 1.8x and total frametime by up to 1.2x, compounding with previous sorting improvements.

References

[1]
Louis Bavoil, Steven P Callahan, Aaron Lefohn, João LD Comba, and Cláudio T Silva. 2007. Multi-fragment effects on the GPU using the k-buffer. In Proceedings of the 2007 symposium on Interactive 3D graphics and games. ACM, 97--104.
[2]
Louis Bavoil and Kevin Myers. 2008. Order independent transparency with dual depth peeling. NVIDIA OpenGL SDK (2008), 1--12.
[3]
Loren Carpenter. 1984. The A-buffer, an antialiased hidden surface method. ACM Siggraph Computer Graphics 18, 3 (1984), 103--108.
[4]
Thomas H Cormen, Charles E Leiserson, Ronald L Rivest, and Clifford Stein. 2001. Introduction to algorithms second edition. (2001).
[5]
Eric Enderton, Erik Sintorn, Peter Shirley, and David Luebke. 2011. Stochastic transparency. IEEE transactions on visualization and computer graphics 17, 8 (2011), 1036--1047.
[6]
Cass Everitt. 2001. Interactive order-independent transparency. White paper, nVIDIA 2, 6 (2001), 7. Pyarelal Knowles, Geoff Leach, and Fabio Zambetta. 2012. Efficient layered fragment buffer techniques. (2012).
[7]
P Knowles, G Leach, and F Zambetta. 2013. Backwards memory allocation and improved OIT. In Proceedings of Pacific Graphics. 59--64.
[8]
Pyarelal Knowles, Geoff Leach, and Fabio Zambetta. 2014. Fast sorting for exact OIT of complex scenes. The Visual Computer 30, 6--8 (2014), 603--613.
[9]
Sylvain Lefebvre, Samuel Hornus, and Anass Lasram. 2013. HA-Buffer: Coherent Hashing for single-pass A-buffer. (2013).
[10]
Baoquan Liu, Li-Yi Wei, Ying-Qing Xu, and Enhua Wu. 2009. Multi-layer depth peeling via fragment sort. In Computer-Aided Design and Computer Graphics, 2009. CAD/Graphics' 09. 11th IEEE International Conference on. IEEE, 452--456.
[11]
Abraham Mammen. 1989. Transparency and antialiasing algorithms implemented with the virtual pixel maps technique. Computer Graphics and Applications, IEEE 9, 4 (1989), 43--55.
[12]
Marilena Maule, João Comba, Rafael Torchelsen, and Rui Bastos. 2013. Hybrid transparency. In Proceedings of the ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games. ACM, 103--118.
[13]
Marilena Maule, Joao LD Comba, Rafael Torchelsen, and Rui Bastos. 2012. Memory-efficient order-independent transparency with dynamic fragment buffer. In 2012 25th SIBGRAPI Conference on Graphics, Patterns and Images. IEEE, 134--141.
[14]
Morgan McGuire. 2017. Computer Graphics Archive. (July 2017). https://casual-effects.com/data
[15]
Thomas Porter and Tom Duff. 1984. Compositing digital images. In ACM Siggraph Computer Graphics, Vol. 18. ACM, 253--259.
[16]
Marco Salvi, Jefferson Montgomery, and Aaron Lefohn. 2011. Adaptive transparency. In Proceedings of the ACM SIGGRAPH Symposium on High Performance Graphics. ACM, 119--126.
[17]
Henry Wong, Misel-Myrto Papadopoulou, Maryam Sadooghi-Alvandi, and Andreas Moshovos. 2010. Demystifying GPU microarchitecture through microbenchmarking. In Performance Analysis of Systems & Software (ISPASS), 2010 IEEE International Symposium on. IEEE, 235--246.
[18]
Chris Wyman. 2016. Stochastic layered alpha blending. In ACM SIGGRAPH 2016 Talks. ACM, 37.

Cited By

View all
  • (2020)A Survey of Multifragment RenderingComputer Graphics Forum10.1111/cgf.1401939:2(623-642)Online publication date: 13-Jul-2020
  • (2018)Fast raycasting using a compound deep image for VPL range determinationSIGGRAPH Asia 2018 Technical Briefs10.1145/3283254.3283285(1-4)Online publication date: 4-Dec-2018

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
CGI 2018: Proceedings of Computer Graphics International 2018
June 2018
284 pages
ISBN:9781450364010
DOI:10.1145/3208159
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. OIT
  2. performance
  3. sorting
  4. transparency

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

CGI 2018
CGI 2018: Computer Graphics International 2018
June 11 - 14, 2018
Island, Bintan, Indonesia

Acceptance Rates

CGI 2018 Paper Acceptance Rate 35 of 159 submissions, 22%;
Overall Acceptance Rate 35 of 159 submissions, 22%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)1
Reflects downloads up to 31 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2020)A Survey of Multifragment RenderingComputer Graphics Forum10.1111/cgf.1401939:2(623-642)Online publication date: 13-Jul-2020
  • (2018)Fast raycasting using a compound deep image for VPL range determinationSIGGRAPH Asia 2018 Technical Briefs10.1145/3283254.3283285(1-4)Online publication date: 4-Dec-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media