Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Open access

NeRF: representing scenes as neural radiance fields for view synthesis

Published: 17 December 2021 Publication History

Abstract

We present a method that achieves state-of-the-art results for synthesizing novel views of complex scenes by optimizing an underlying continuous volumetric scene function using a sparse set of input views. Our algorithm represents a scene using a fully connected (nonconvolutional) deep network, whose input is a single continuous 5D coordinate (spatial location (x, y, z) and viewing direction (θ, ϕ)) and whose output is the volume density and view-dependent emitted radiance at that spatial location. We synthesize views by querying 5D coordinates along camera rays and use classic volume rendering techniques to project the output colors and densities into an image. Because volume rendering is naturally differentiable, the only input required to optimize our representation is a set of images with known camera poses. We describe how to effectively optimize neural radiance fields to render photorealistic novel views of scenes with complicated geometry and appearance, and demonstrate results that outperform prior work on neural rendering and view synthesis.

References

[1]
Buehler, C., Bosse, M., McMillan, L., Gortler S., Cohen, M. Unstructured lumigraph rendering. In SIGGRAPH (2001).
[2]
Chang, A.X., Fhnkhouser, T., Guibas, L., Hanrahan, P., Huang, Q., Li, Z., Savarese, S., Savva, M., Song, S., Su, H., et al. ShapeNet: An information-rich 3D model repository. arXiv:1512.03012 (2015).
[3]
Curless, B., Levoy, M. A volumetric method for building complex models from range images. In SIGGRAPH (1996).
[4]
Debevec, P., Taylor, C.J., Malik, J. Modeling and rendering architecture from photographs: A hybrid geometry-and image-based approach. In SIGGRAPH (1996).
[5]
Kajiya, J.T., Herzen, B.P.V. Ray tracing volume densities. Comput. Graph. (SIGGRAPH) (1984).
[6]
Kingma, D.P., Ba, J. Adam: A method for stochastic optimization. In ICLR (2015).
[7]
Li, T.-M., Aittala, M., Durand, F., Lehtinen, J. Differentiable monte carlo ray tracing through edge sampling. ACM Trans. Graph. (SIGGRAPH Asia) (2018).
[8]
Lombardi, S., Simon, T., Saragih, J., Schwartz, G., Lehrmann, A., Sheikh, Y. Neural volumes: Learning dynamic renderable volumes from images. ACM Trans. Graph. (SIGGRAPH) (2019).
[9]
Loper, M.M., Black, M.J. OpenDR: An approximate differentiable renderer. In ECCV (2014).
[10]
Max, N. Optical models for direct volume rendering. IEEE Trans. Visual. Comput. Graph. (1995).
[11]
Mescheder, L., Oechsle, M., Niemeyer, M., Nowozin, S., Geiger, A. Occupancy networks: Learning 3D reconstruction in function space. In CVPR (2019).
[12]
Mildenhall, B., Srinivasan, P.P., Ortiz-Cayon, R., Kalantari, N.K., Ramamoorthi, R., Ng, R., Kar, A. Local light field fusion: Practical view synthesis with prescriptive sampling guidelines. ACM Trans. Graph. (SIGGRAPH) (2019).
[13]
Mildenhall, B., Srinivasan, P.P, Tancik, M., Barron, J.T., Ramamoorthi, R., Ng, R. NeRF: Representing scenes as neural radiance fields for view synthesis. In ECCV (2020).
[14]
Niemeyer, M., Mescheder, L., Oechsle, M., Geiger, A. Differentiable volumetric rendering: Learning implicit 3D representations without 3D supervision. In CVPR (2019).
[15]
Park, J.J., Florence, P., Straub, J., Newcombe, R., Lovegrove, S. DeepSDF: Learning continuous signed distance functions for shape representation. In CVPR (2019).
[16]
Porter, T., Duff, T. Compositing digital images. Comput. Graph. (SIGGRAPH) (1984).
[17]
Rahaman, N., Baratin, A., Arpit, D., Dräxler, F., Lin, M., Hamprecht, F.A., Bengio, Y., Courville, A.C. On the spectral bias of neural networks. In ICML (2018).
[18]
Schönberger, J.L., Frahm, J.-M. Structure-from-motion revisited. In CVPR (2016).
[19]
Seitz, S.M., Dyer, C.R. Photorealistic scene reconstruction by voxel coloring. Int. J. Comput. Vision (1999).
[20]
Sitzmann, V., Thies, J., Heide, F., Nießner, M., Wetzstein, G., Zollhöfer, M. Deepvoxels: Learning persistent 3D feature embeddings. In CVPR (2019).
[21]
Sitzmann, V., Zollhoefer, M., Wetzstein, G. Scene representation networks: Continuous 3D-structure-aware neural scene representations. In NeurIPS (2019).
[22]
Tancik, M., Srinivasan, P.P., Mildenhall, B., Fridovich-Keil, S., Raghavan, N., Singhal, U., Ramamoorthi, R., Barron, J.T., Ng, R. Fourier features let networks learn high frequency functions in low dimensional domains. In NeurIPS (2020).
[23]
Wood, D.N., Azuma, D.I., Aldinger, K., Curless, B., Duchamp, T., Salesin, D.H., Stuetzle, W. Surface light fields for 3D photography. In SIGGRAPH (2000).
[24]
Zhang, R., Isola, P., Efros, A.A., Shechtman, E., Wang, O. The unreasonable effectiveness of deep features as a perceptual metric. In CVPR (2018).
[25]
Zhou, T., Tucker, R., Flynn, J., Fyffe, G., Snavely, N. Stereo magnification: Learning view synthesis using multiplane images. ACM Trans. Graph. (SIGGRAPH) (2018).

Cited By

View all
  • (2025)SRNeRF: Super-Resolution Neural Radiance Fields for Autonomous Driving Scenario Reconstruction from Sparse ViewsWorld Electric Vehicle Journal10.3390/wevj1602006616:2(66)Online publication date: 23-Jan-2025
  • (2025)Novel View Synthesis with Depth Priors Using Neural Radiance Fields and CycleGAN with Attention TransformerSymmetry10.3390/sym1701005917:1(59)Online publication date: 1-Jan-2025
  • (2025)CDKD-w+: A Keyframe Recognition Method for Coronary Digital Subtraction Angiography Video Sequence Based on w+ Space EncodingSensors10.3390/s2503071025:3(710)Online publication date: 24-Jan-2025
  • Show More Cited By

Index Terms

  1. NeRF: representing scenes as neural radiance fields for view synthesis

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Communications of the ACM
    Communications of the ACM  Volume 65, Issue 1
    January 2022
    106 pages
    ISSN:0001-0782
    EISSN:1557-7317
    DOI:10.1145/3507640
    Issue’s Table of Contents
    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 December 2021
    Published in CACM Volume 65, Issue 1

    Check for updates

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)19,812
    • Downloads (Last 6 weeks)1,990
    Reflects downloads up to 01 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)SRNeRF: Super-Resolution Neural Radiance Fields for Autonomous Driving Scenario Reconstruction from Sparse ViewsWorld Electric Vehicle Journal10.3390/wevj1602006616:2(66)Online publication date: 23-Jan-2025
    • (2025)Novel View Synthesis with Depth Priors Using Neural Radiance Fields and CycleGAN with Attention TransformerSymmetry10.3390/sym1701005917:1(59)Online publication date: 1-Jan-2025
    • (2025)CDKD-w+: A Keyframe Recognition Method for Coronary Digital Subtraction Angiography Video Sequence Based on w+ Space EncodingSensors10.3390/s2503071025:3(710)Online publication date: 24-Jan-2025
    • (2025)Fusing LiDAR and Photogrammetry for Accurate 3D Data: A Hybrid ApproachRemote Sensing10.3390/rs1703044317:3(443)Online publication date: 28-Jan-2025
    • (2025)ThermalGS: Dynamic 3D Thermal Reconstruction with Gaussian SplattingRemote Sensing10.3390/rs1702033517:2(335)Online publication date: 19-Jan-2025
    • (2025)NeRF-Accelerated Ecological Monitoring in Mixed-Evergreen Redwood ForestForests10.3390/f1601017316:1(173)Online publication date: 17-Jan-2025
    • (2025)High-Fold 3D Gaussian Splatting Model Pruning Method Assisted by OpacityApplied Sciences10.3390/app1503153515:3(1535)Online publication date: 3-Feb-2025
    • (2025)Related Keyframe Optimization Gaussian–Simultaneous Localization and Mapping: A 3D Gaussian Splatting-Based Simultaneous Localization and Mapping with Related Keyframe OptimizationApplied Sciences10.3390/app1503132015:3(1320)Online publication date: 27-Jan-2025
    • (2025)Multi-Level Feature Dynamic Fusion Neural Radiance Fields for Audio-Driven Talking Head GenerationApplied Sciences10.3390/app1501047915:1(479)Online publication date: 6-Jan-2025
    • (2025)An end-to-end implicit neural representation architecture for medical volume dataPLOS ONE10.1371/journal.pone.031494420:1(e0314944)Online publication date: 3-Jan-2025
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Digital Edition

    View this article in digital edition.

    Digital Edition

    Magazine Site

    View this article on the magazine site (external)

    Magazine Site

    Login options

    Full Access

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media