Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2492045.2492060acmconferencesArticle/Chapter ViewAbstractPublication PageshpgConference Proceedingsconference-collections
research-article

Megakernels considered harmful: wavefront path tracing on GPUs

Published: 19 July 2013 Publication History

Abstract

When programming for GPUs, simply porting a large CPU program into an equally large GPU kernel is generally not a good approach. Due to SIMT execution model on GPUs, divergence in control flow carries substantial performance penalties, as does high register us-age that lessens the latency-hiding capability that is essential for the high-latency, high-bandwidth memory system of a GPU. In this paper, we implement a path tracer on a GPU using a wavefront formulation, avoiding these pitfalls that can be especially prominent when using materials that are expensive to evaluate. We compare our performance against the traditional megakernel approach, and demonstrate that the wavefront formulation is much better suited for real-world use cases where multiple complex materials are present in the scene.

References

[1]
Aila, T., and Laine, S. 2009. Understanding the efficiency of ray traversal on GPUs. In Proc. High Performance Graphics, 145--149.
[2]
Aila, T., Laine, S., and Karras, T. 2012. Understanding the efficiency of ray traversal on GPUs -- Kepler and Fermi addendum. Tech. Rep. NVR-2012-02, NVIDIA.
[3]
Ernst, M., and Woop, S., 2011. Embree: Photo-realistic ray tracing kernels. White paper, Intel.
[4]
Hoberock, J., Lu, V., Jia, Y., and Hart, J. C. 2009. Stream compaction for deferred shading. In Proc. High Performance Graphics, 173--180.
[5]
Jakob, W., 2010. Mitsuba renderer. http://www.mitsuba-renderer.org.
[6]
Joe, S., and Kuo, F. Y. 2008. Constructing Sobol sequences with better two-dimensional projections. SIAM J. Sci. Comput. 30, 2635--2654.
[7]
Kajiya, J. T. 1986. The rendering equation. In Proc. ACM SIGGRAPH 86, 143--150.
[8]
Kelemen, C., Szirmay-Kalos, L., Antal, G., and Csonka, F. 2002. A simple and robust mutation strategy for the Metropolis light transport algorithm. Comput. Graph. Forum 21, 3, 531--540.
[9]
Kniep, S., Häring, S., and Magnor, M. 2009. Efficient and accurate rendering of complex light sources. Comput. Graph. Forum 28, 4, 1073--1081.
[10]
Lafortune, E. P., and Willems, Y. D. 1993. Bi-directional path tracing. In Proc. Compugraphics, 145--153.
[11]
Novák, J., Havran, V., and Daschbacher, C. 2010. Path regeneration for interactive path tracing. In Eurographics 2007, short papers, 61--64.
[12]
Parker, S. G., Bigler, J., Dietrich, A., Friedrich, H., Hoberock, J., Luebke, D., McAllister, D., McGuire, M., Morley, K., Robison, A., and Stich, M. 2010. OptiX: A general purpose ray tracing engine. ACM Trans. Graph. 29, 4, 66:1--66:13.
[13]
Pharr, M., and Humphreys, G. 2010. Physically Based Rendering, 2nd ed. Morgan Kaufmann.
[14]
Pharr, M., and Mark, W. 2012. ispc: A SPMD compiler for high-performance CPU programming. In Proc. InPar 2012, 1--13.
[15]
Purcell, T. J., Buck, I., Mark, W. R., and Hanrahan, P. 2002. Ray tracing on programmable graphics hardware. ACM Trans. Graph. 21, 3, 703--712.
[16]
Raab, M., Seibert, D., and Keller, A. 2008. Unbiased global illumination with participating media. In Monte Carlo and Quasi-Monte Carlo Methods 2006. 591--605.
[17]
Robison, A. 2009. Hot3D talk: Scheduling in NVIRT. HPG '09, http://www.highperformancegraphics.org/previous/www_2009/presentations/nvidia-rt.pdf.
[18]
Stich, M., Friedrich, H., and Dietrich, A. 2009. Spatial splits in bounding volume hierarchies. In Proc. High Performance Graphics, 7--13.
[19]
van Antwerpen, D. 2011. Improving SIMD efficiency for parallel Monte Carlo light transport on the GPU. In Proc. High Performance Graphics, 41--50.
[20]
Veach, E., and Guibas, L. 1994. Bidirectional estimators for light transport. In Proc. Eurographics Rendering Workshop, 147--162.
[21]
Veach, E., and Guibas, L. J. 1995. Optimally combining sampling techniques for Monte Carlo rendering. In Proc. ACM SIGGRAPH 95, 419--428.
[22]
Veach, E., and Guibas, L. J. 1997. Metropolis light transport. In Proc. ACM SIGGRAPH 97, 65--76.
[23]
Wald, I. 2011. Active thread compaction for GPU path tracing. In Proc. High Performance Graphics, 51--58.

Cited By

View all
  • (2024)Real-Time Wave Simulation of Large-Scale Open Sea Based on Self-Adaptive Filtering and Screen Space Level of DetailJournal of Marine Science and Engineering10.3390/jmse1204057212:4(572)Online publication date: 28-Mar-2024
  • (2024)Spear: Across the Streaming Multiprocessors: Porting a Production Renderer to the GPUProceedings of the 2024 Digital Production Symposium10.1145/3665320.3670988(1-9)Online publication date: 24-Jul-2024
  • (2024)Real-time Neural Appearance ModelsACM Transactions on Graphics10.1145/365957743:3(1-17)Online publication date: 20-Apr-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HPG '13: Proceedings of the 5th High-Performance Graphics Conference
July 2013
149 pages
ISBN:9781450321358
DOI:10.1145/2492045
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. GPU
  2. complex materials
  3. path tracing

Qualifiers

  • Research-article

Conference

HPG '13
Sponsor:
HPG '13: High Performance Graphics
July 19 - 21, 2013
California, Anaheim

Acceptance Rates

HPG '13 Paper Acceptance Rate 15 of 44 submissions, 34%;
Overall Acceptance Rate 15 of 44 submissions, 34%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)59
  • Downloads (Last 6 weeks)3
Reflects downloads up to 26 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Real-Time Wave Simulation of Large-Scale Open Sea Based on Self-Adaptive Filtering and Screen Space Level of DetailJournal of Marine Science and Engineering10.3390/jmse1204057212:4(572)Online publication date: 28-Mar-2024
  • (2024)Spear: Across the Streaming Multiprocessors: Porting a Production Renderer to the GPUProceedings of the 2024 Digital Production Symposium10.1145/3665320.3670988(1-9)Online publication date: 24-Jul-2024
  • (2024)Real-time Neural Appearance ModelsACM Transactions on Graphics10.1145/365957743:3(1-17)Online publication date: 20-Apr-2024
  • (2024)Walkin’ Robin: Walk on Stars with Robin Boundary ConditionsACM Transactions on Graphics10.1145/365815343:4(1-18)Online publication date: 19-Jul-2024
  • (2024)Bounded VNDF Sampling for the Smith-GGX BRDFProceedings of the ACM on Computer Graphics and Interactive Techniques10.1145/36512917:1(1-18)Online publication date: 13-May-2024
  • (2024)Online Neural Path Guiding with Normalized Anisotropic Spherical GaussiansACM Transactions on Graphics10.1145/364931043:3(1-18)Online publication date: 9-Apr-2024
  • (2024)Interactive Volume Visualization via Multi-Resolution Hash Encoding Based Neural RepresentationIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2023.329312130:8(5404-5418)Online publication date: Aug-2024
  • (2024)Enabling High-Performance Physical Based Rendering on New Sunway Supercomputer2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS57955.2024.00032(277-288)Online publication date: 27-May-2024
  • (2024)Path guiding for wavefront path tracing: A memory efficient approach for GPU path tracersComputers & Graphics10.1016/j.cag.2024.103945121(103945)Online publication date: Jun-2024
  • (2024)Learning subsurface scattering solutions of tightly-packed granular media using optimal transportComputers & Graphics10.1016/j.cag.2024.103895119(103895)Online publication date: Apr-2024
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media