research-article

Open access

NeRF: representing scenes as neural radiance fields for view synthesis

Authors:

Ben Mildenhall,

Pratul P. Srinivasan,

Matthew Tancik,

Jonathan T. Barron,

Ravi Ramamoorthi,

Ren NgAuthors Info & Claims

Communications of the ACM, Volume 65, Issue 1

Pages 99 - 106

https://doi.org/10.1145/3503250

Published: 17 December 2021 Publication History

All formats PDF

Abstract

We present a method that achieves state-of-the-art results for synthesizing novel views of complex scenes by optimizing an underlying continuous volumetric scene function using a sparse set of input views. Our algorithm represents a scene using a fully connected (nonconvolutional) deep network, whose input is a single continuous 5D coordinate (spatial location (x, y, z) and viewing direction (θ, ϕ)) and whose output is the volume density and view-dependent emitted radiance at that spatial location. We synthesize views by querying 5D coordinates along camera rays and use classic volume rendering techniques to project the output colors and densities into an image. Because volume rendering is naturally differentiable, the only input required to optimize our representation is a set of images with known camera poses. We describe how to effectively optimize neural radiance fields to render photorealistic novel views of scenes with complicated geometry and appearance, and demonstrate results that outperform prior work on neural rendering and view synthesis.

References

[1]

Buehler, C., Bosse, M., McMillan, L., Gortler S., Cohen, M. Unstructured lumigraph rendering. In SIGGRAPH (2001).

Digital Library

[2]

Chang, A.X., Fhnkhouser, T., Guibas, L., Hanrahan, P., Huang, Q., Li, Z., Savarese, S., Savva, M., Song, S., Su, H., et al. ShapeNet: An information-rich 3D model repository. arXiv:1512.03012 (2015).

[3]

Curless, B., Levoy, M. A volumetric method for building complex models from range images. In SIGGRAPH (1996).

Digital Library

[4]

Debevec, P., Taylor, C.J., Malik, J. Modeling and rendering architecture from photographs: A hybrid geometry-and image-based approach. In SIGGRAPH (1996).

Digital Library

[5]

Kajiya, J.T., Herzen, B.P.V. Ray tracing volume densities. Comput. Graph. (SIGGRAPH) (1984).

[6]

Kingma, D.P., Ba, J. Adam: A method for stochastic optimization. In ICLR (2015).

[7]

Li, T.-M., Aittala, M., Durand, F., Lehtinen, J. Differentiable monte carlo ray tracing through edge sampling. ACM Trans. Graph. (SIGGRAPH Asia) (2018).

[8]

Lombardi, S., Simon, T., Saragih, J., Schwartz, G., Lehrmann, A., Sheikh, Y. Neural volumes: Learning dynamic renderable volumes from images. ACM Trans. Graph. (SIGGRAPH) (2019).

[9]

Loper, M.M., Black, M.J. OpenDR: An approximate differentiable renderer. In ECCV (2014).

[10]

Max, N. Optical models for direct volume rendering. IEEE Trans. Visual. Comput. Graph. (1995).

[11]

Mescheder, L., Oechsle, M., Niemeyer, M., Nowozin, S., Geiger, A. Occupancy networks: Learning 3D reconstruction in function space. In CVPR (2019).

[12]

Mildenhall, B., Srinivasan, P.P., Ortiz-Cayon, R., Kalantari, N.K., Ramamoorthi, R., Ng, R., Kar, A. Local light field fusion: Practical view synthesis with prescriptive sampling guidelines. ACM Trans. Graph. (SIGGRAPH) (2019).

[13]

Mildenhall, B., Srinivasan, P.P, Tancik, M., Barron, J.T., Ramamoorthi, R., Ng, R. NeRF: Representing scenes as neural radiance fields for view synthesis. In ECCV (2020).

Digital Library

[14]

Niemeyer, M., Mescheder, L., Oechsle, M., Geiger, A. Differentiable volumetric rendering: Learning implicit 3D representations without 3D supervision. In CVPR (2019).

[15]

Park, J.J., Florence, P., Straub, J., Newcombe, R., Lovegrove, S. DeepSDF: Learning continuous signed distance functions for shape representation. In CVPR (2019).

[16]

Porter, T., Duff, T. Compositing digital images. Comput. Graph. (SIGGRAPH) (1984).

[17]

Rahaman, N., Baratin, A., Arpit, D., Dräxler, F., Lin, M., Hamprecht, F.A., Bengio, Y., Courville, A.C. On the spectral bias of neural networks. In ICML (2018).

[18]

Schönberger, J.L., Frahm, J.-M. Structure-from-motion revisited. In CVPR (2016).

[19]

Seitz, S.M., Dyer, C.R. Photorealistic scene reconstruction by voxel coloring. Int. J. Comput. Vision (1999).

[20]

Sitzmann, V., Thies, J., Heide, F., Nießner, M., Wetzstein, G., Zollhöfer, M. Deepvoxels: Learning persistent 3D feature embeddings. In CVPR (2019).

[21]

Sitzmann, V., Zollhoefer, M., Wetzstein, G. Scene representation networks: Continuous 3D-structure-aware neural scene representations. In NeurIPS (2019).

[22]

Tancik, M., Srinivasan, P.P., Mildenhall, B., Fridovich-Keil, S., Raghavan, N., Singhal, U., Ramamoorthi, R., Barron, J.T., Ng, R. Fourier features let networks learn high frequency functions in low dimensional domains. In NeurIPS (2020).

[23]

Wood, D.N., Azuma, D.I., Aldinger, K., Curless, B., Duchamp, T., Salesin, D.H., Stuetzle, W. Surface light fields for 3D photography. In SIGGRAPH (2000).

Digital Library

[24]

Zhang, R., Isola, P., Efros, A.A., Shechtman, E., Wang, O. The unreasonable effectiveness of deep features as a perceptual metric. In CVPR (2018).

[25]

Zhou, T., Tucker, R., Flynn, J., Fyffe, G., Snavely, N. Stereo magnification: Learning view synthesis using multiplane images. ACM Trans. Graph. (SIGGRAPH) (2018).

Cited By

Wang JZhu XChen ZLi PJiang CZhang HYu CYu B(2025)SRNeRF: Super-Resolution Neural Radiance Fields for Autonomous Driving Scenario Reconstruction from Sparse ViewsWorld Electric Vehicle Journal10.3390/wevj1602006616:2(66)Online publication date: 23-Jan-2025
https://doi.org/10.3390/wevj16020066
Qin YLi XZu LJin M(2025)Novel View Synthesis with Depth Priors Using Neural Radiance Fields and CycleGAN with Attention TransformerSymmetry10.3390/sym1701005917:1(59)Online publication date: 1-Jan-2025
https://doi.org/10.3390/sym17010059
Zhu YLi HXiao SYu WShang HWang LLiu YWang YYang J(2025)CDKD-w+: A Keyframe Recognition Method for Coronary Digital Subtraction Angiography Video Sequence Based on w+ Space EncodingSensors10.3390/s2503071025:3(710)Online publication date: 24-Jan-2025
https://doi.org/10.3390/s25030710
Show More Cited By

Index Terms

NeRF: representing scenes as neural radiance fields for view synthesis
1. Computing methodologies
  1. Computer graphics
    1. Image manipulation
      1. Image-based rendering

Recommendations

S³-NeRF: neural reflectance field from shading and shadow under a single viewpoint
NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing Systems

In this paper, we address the "dual problem" of multi-view scene reconstruction in which we utilize single-view images captured under different point lights to learn a neural scene representation. Different from existing single-view methods which can ...
NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections
SA '24: SIGGRAPH Asia 2024 Conference Papers
Neural Radiance Fields (NeRFs) typically struggle to reconstruct and render highly specular objects, whose appearance varies quickly with changes in viewpoint. Recent works have improved NeRF’s ability to render detailed specular appearance of distant ...
Ced-NeRF: a compact and efficient method for dynamic neural radiance fields
AAAI'24/IAAI'24/EAAI'24: Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence

Rendering photorealistic dynamic scenes has been a focus of recent research, with applications in virtual and augmented reality. While the Neural Radiance Field (NeRF) has shown remarkable rendering quality for static scenes, achieving realtime rendering ...

Comments

Information & Contributors

Information

Published In

cover image Communications of the ACM

Communications of the ACM Volume 65, Issue 1

January 2022

106 pages

ISSN:0001-0782

EISSN:1557-7317

DOI:10.1145/3507640

Editor:
Andrew A. Chien
Association for Computing Machinery, New York, NY

Issue’s Table of Contents

Copyright © 2021 Owner/Author.

This work is licensed under a Creative Commons Attribution International 4.0 License.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 December 2021

Published in CACM Volume 65, Issue 1

Check for updates

Qualifiers

Research-article
Research
Refereed

Funding Sources

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2,440
Total Citations
View Citations
67,796
Total Downloads

Downloads (Last 12 months)19,812
Downloads (Last 6 weeks)1,990

Reflects downloads up to 01 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Wang JZhu XChen ZLi PJiang CZhang HYu CYu B(2025)SRNeRF: Super-Resolution Neural Radiance Fields for Autonomous Driving Scenario Reconstruction from Sparse ViewsWorld Electric Vehicle Journal10.3390/wevj1602006616:2(66)Online publication date: 23-Jan-2025
https://doi.org/10.3390/wevj16020066
Qin YLi XZu LJin M(2025)Novel View Synthesis with Depth Priors Using Neural Radiance Fields and CycleGAN with Attention TransformerSymmetry10.3390/sym1701005917:1(59)Online publication date: 1-Jan-2025
https://doi.org/10.3390/sym17010059
Zhu YLi HXiao SYu WShang HWang LLiu YWang YYang J(2025)CDKD-w+: A Keyframe Recognition Method for Coronary Digital Subtraction Angiography Video Sequence Based on w+ Space EncodingSensors10.3390/s2503071025:3(710)Online publication date: 24-Jan-2025
https://doi.org/10.3390/s25030710
Maskeliūnas RMaqsood SVaškevičius MGelšvartas J(2025)Fusing LiDAR and Photogrammetry for Accurate 3D Data: A Hybrid ApproachRemote Sensing10.3390/rs1703044317:3(443)Online publication date: 28-Jan-2025
https://doi.org/10.3390/rs17030443
Liu YChen XYan SCui ZXiao HLiu YZhang M(2025)ThermalGS: Dynamic 3D Thermal Reconstruction with Gaussian SplattingRemote Sensing10.3390/rs1702033517:2(335)Online publication date: 19-Jan-2025
https://doi.org/10.3390/rs17020335
Korycki AYeaton CGilbert GJosephson CMcGuire S(2025)NeRF-Accelerated Ecological Monitoring in Mixed-Evergreen Redwood ForestForests10.3390/f1601017316:1(173)Online publication date: 17-Jan-2025
https://doi.org/10.3390/f16010173
Qiu SWu CWan ZTong S(2025)High-Fold 3D Gaussian Splatting Model Pruning Method Assisted by OpacityApplied Sciences10.3390/app1503153515:3(1535)Online publication date: 3-Feb-2025
https://doi.org/10.3390/app15031535
Ma XSong CJi YZhong S(2025)Related Keyframe Optimization Gaussian–Simultaneous Localization and Mapping: A 3D Gaussian Splatting-Based Simultaneous Localization and Mapping with Related Keyframe OptimizationApplied Sciences10.3390/app1503132015:3(1320)Online publication date: 27-Jan-2025
https://doi.org/10.3390/app15031320
Song WLiu QLiu YZhang PCao J(2025)Multi-Level Feature Dynamic Fusion Neural Radiance Fields for Audio-Driven Talking Head GenerationApplied Sciences10.3390/app1501047915:1(479)Online publication date: 6-Jan-2025
https://doi.org/10.3390/app15010479
Sheibanifard AYu HRuan ZZhang J(2025)An end-to-end implicit neural representation architecture for medical volume dataPLOS ONE10.1371/journal.pone.031494420:1(e0314944)Online publication date: 3-Jan-2025
https://doi.org/10.1371/journal.pone.0314944
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Digital Edition

View this article in digital edition.

Digital Edition

Magazine Site

View this article on the magazine site (external)

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Figures

Tables

Media

View Issue’s Table of Contents