research-article

Open access

CamP: Camera Preconditioning for Neural Radiance Fields

Authors:

Philipp Henzler,

Ben Mildenhall,

Jonathan T. Barron,

Ricardo Martin-BruallaAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 42, Issue 6

Article No.: 208, Pages 1 - 11

https://doi.org/10.1145/3618321

Published: 05 December 2023 Publication History

Abstract

Neural Radiance Fields (NeRF) can be optimized to obtain high-fidelity 3D scene reconstructions of objects and large-scale scenes. However, NeRFs require accurate camera parameters as input --- inaccurate camera parameters result in blurry renderings. Extrinsic and intrinsic camera parameters are usually estimated using Structure-from-Motion (SfM) methods as a pre-processing step to NeRF, but these techniques rarely yield perfect estimates. Thus, prior works have proposed jointly optimizing camera parameters alongside a NeRF, but these methods are prone to local minima in challenging settings. In this work, we analyze how different camera parameterizations affect this joint optimization problem, and observe that standard parameterizations exhibit large differences in magnitude with respect to small perturbations, which can lead to an ill-conditioned optimization problem. We propose using a proxy problem to compute a whitening transform that eliminates the correlation between camera parameters and normalizes their effects, and we propose to use this transform as a preconditioner for the camera parameters during joint optimization. Our preconditioned camera optimization significantly improves reconstruction quality on scenes from the Mip-NeRF 360 dataset: we reduce error rates (RMSE) by 67% compared to state-of-the-art NeRF approaches that do not optimize for cameras like Zip-NeRF, and by 29% relative to state-of-the-art joint optimization approaches using the camera parameterization of SCNeRF. Our approach is easy to implement, does not significantly increase runtime, can be applied to a wide variety of camera parameterizations, and can straightforwardly be incorporated into other NeRF-like models.

References

[1]

Jonathan T. Barron, Ben Mildenhall, Dor Verbin, Pratul P. Srinivasan, and Peter Hedman. 2022. Mip-nerf 360: Unbounded anti-aliased neural radiance fields. CVPR (2022).

[2]

Jonathan T. Barron, Ben Mildenhall, Dor Verbin, Pratul P. Srinivasan, and Peter Hedman. 2023. Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields. arXiv:2304.06706 (2023).

[3]

Sai Bi, Zexiang Xu, Pratul Srinivasan, Ben Mildenhall, Kalyan Sunkavalli, Miloš Hašan, Yannick Hold-Geoffroy, David Kriegman, and Ravi Ramamoorthi. 2020. Neural reflectance fields for appearance acquisition. arXiv:2008.03824 (2020).

[4]

Wenjing Bian, Zirui Wang, Kejie Li, Jia-Wang Bian, and Victor Adrian Prisacariu. 2022. NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior. arXiv:2212.07388 (2022).

[5]

Yu Chen and Gim Hee Lee. 2023. DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields. CVPR (2023).

[6]

Shin-Fang Chng, Sameera Ramasinghe, Jamie Sherrah, and Simon Lucey. 2022. GARF: gaussian activated radiance fields for high fidelity reconstruction and pose estimation. arXiv (2022).

[7]

Alexander Eugen Conrady. 1919. Decentred lens-systems. Monthly notices of the royal astronomical society 79, 5 (1919), 384--390.

[8]

Christopher De Sa. 2019. Lecture notes in CS4787 Principles of Large-Scale Machine Learning: Lecture 8.

[9]

Michael D Grossberg and Shree K Nayar. 2005. The raxel imaging model and ray-based calibration. IJCV (2005).

[10]

Peter Hedman, Julien Philip, True Price, Jan-Michael Frahm, George Drettakis, and Gabriel Brostow. 2018. Deep blending for free-viewpoint image-based rendering. ACM TOG (2018).

[11]

Hwan Heo, Taekyung Kim, Jiyoung Lee, Jaewon Lee, Soohyun Kim, Hyunwoo J Kim, and Jin-Hwa Kim. 2023. Robust Camera Pose Refinement for Multi-Resolution Hash Encoding. arXiv:2302.01571 (2023).

[12]

Amir Hertz, Or Perel, Raja Giryes, Olga Sorkine-Hornung, and Daniel Cohen-Or. 2021. Sape: Spatially-adaptive progressive encoding for neural optimization. NeurIPS (2021).

[13]

Yoonwoo Jeong, Seokjun Ahn, Christopher Choy, Anima Anandkumar, Minsu Cho, and Jaesik Park. 2021. Self-calibrating neural radiance fields. ICCV (2021).

[14]

Agnan Kessy, Alex Lewin, and Korbinian Strimmer. 2018. Optimal whitening and decorrelation. The American Statistician (2018).

[15]

Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv:1412.6980 (2014).

[16]

Abhijit Kundu, Yin Li, and James M Rehg. 2018. 3D-RCNN: Instance-level 3d object reconstruction via render-and-compare. CVPR (2018).

[17]

Avanish Kushal and Sameer Agarwal. 2012. Visibility based preconditioning for bundle adjustment. CVPR (2012).

[18]

Yi Li, Gu Wang, Xiangyang Ji, Yu Xiang, and Dieter Fox. 2018. Deepim: Deep iterative matching for 6d pose estimation. ECCV (2018).

[19]

Zhengqi Li, Simon Niklaus, Noah Snavely, and Oliver Wang. 2020. Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes. arXiv:2011.13084 (2020).

[20]

Chen-Hsuan Lin, Wei-Chiu Ma, Antonio Torralba, and Simon Lucey. 2021. Barf: Bundle-adjusting neural radiance fields. ICCV (2021).

[21]

Shichen Liu, Tianye Li, Weikai Chen, and Hao Li. 2019. Soft rasterizer: A differentiable renderer for image-based 3d reasoning. ICCV (2019).

[22]

Matthew M Loper and Michael J Black. 2014. OpenDR: An approximate differentiable renderer. ECCV (2014).

[23]

Kevin M Lynch and Frank C Park. 2017. Modern Robotics. Cambridge University Press.

[24]

Ricardo Martin-Brualla, Noha Radwan, Mehdi S. M. Sajjadi, Jonathan T. Barron, Alexey Dosovitskiy, and Daniel Duckworth. 2021. NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections. CVPR (2021).

[25]

Luke Melas-Kyriazi, Christian Rupprecht, Iro Laina, and Andrea Vedaldi. 2023. Real-Fusion: 360° Reconstruction of Any Object from a Single Image. arXiv:2302.10663 (2023).

[26]

Quan Meng, Anpei Chen, Haimin Luo, Minye Wu, Hao Su, Lan Xu, Xuming He, and Jingyi Yu. 2021. Gnerf: Gan-based neural radiance field without posed camera. ICCV (2021).

[27]

Moustafa Meshry, Dan B Goldman, Sameh Khamis, Hugues Hoppe, Rohit Pandey, Noah Snavely, and Ricardo Martin-Brualla. 2019. Neural rerendering in the wild. CVPR (2019).

[28]

Andreas Meuleman, Yu-Lun Liu, Chen Gao, Jia-Bin Huang, Changil Kim, Min H Kim, and Johannes Kopf. 2023. Progressively Optimized Local Radiance Fields for Robust View Synthesis. CVPR (2023).

[29]

Ben Mildenhall, Pratul P. Srinivasan, Matthew Tancik, Jonathan T. Barron, Ravi Ramamoorthi, and Ren Ng. 2020. NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis. ECCV (2020).

[30]

Thomas Müller, Alex Evans, Christoph Schied, and Alexander Keller. 2022. Instant neural graphics primitives with a multiresolution hash encoding. ACM TOG (2022).

[31]

Keunhong Park, Utkarsh Sinha, Jonathan T. Barron, Sofien Bouaziz, Dan B Goldman, Steven M Seitz, and Ricardo Martin-Brualla. 2021. Nerfies: Deformable neural radiance fields. ICCV (2021).

[32]

Julien Philip and Valentin Deschaintre. 2023. Radiance Field Gradient Scaling for Unbiased Near-Camera Training. arXiv:2305.02756 (2023).

[33]

Georgy Ponimatkin, Yann Labbé, Bryan Russell, Mathieu Aubry, and Josef Sivic. 2022. Focal length and object pose estimation via render and compare. CVPR (2022).

[34]

Olinde Rodrigues. 1816. De l'attraction des sphéroïdes. Correspondence Sur l'École Impériale Polytechnique (1816).

[35]

Antoni Rosinol, John J Leonard, and Luca Carlone. 2022. NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields. arXiv preprint arXiv:2210.13641 (2022).

[36]

Johannes Lutz Schönberger and Jan-Michael Frahm. 2016. Structure-from-Motion Revisited. CVPR (2016).

[37]

Thomas Schops, Viktor Larsson, Marc Pollefeys, and Torsten Sattler. 2020. Why having 10,000 parameters in your camera model is better than twelve. CVPR (2020).

[38]

Richard Szeliski. 2022. Computer vision: algorithms and applications. Springer Nature.

Digital Library

[39]

Matthew Tancik, Vincent Casser, Xinchen Yan, Sabeek Pradhan, Ben Mildenhall, Pratul P. Srinivasan, Jonathan T. Barron, and Henrik Kretzschmar. 2022. Block-nerf: Scalable large scene neural view synthesis. CVPR (2022).

[40]

Justus Thies, Michael Zollhöfer, and Matthias Nießner. 2019. Deferred neural rendering: Image synthesis using neural textures. ACM TOG (2019).

Digital Library

[41]

Prune Truong, Marie-Julie Rakotosaona, Fabian Manhardt, and Federico Tombari. 2023. SPARF: Neural Radiance Fields from Sparse and Noisy Poses. CVPR (2023).

[42]

Qianqian Wang, Zhicheng Wang, Kyle Genova, Pratul P. Srinivasan, Howard Zhou, Jonathan T. Barron, Ricardo Martin-Brualla, Noah Snavely, and Thomas Funkhouser. 2021a. Ibrnet: Learning multi-view image-based rendering. CVPR (2021).

[43]

Zirui Wang, Shangzhe Wu, Weidi Xie, Min Chen, and Victor Adrian Prisacariu. 2021b. NeRF-: Neural radiance fields without known camera parameters. arXiv:2102.07064 (2021).

[44]

Changchang Wu. 2014. Critical configurations for radial distortion self-calibration. CVPR (2014).

[45]

Yitong Xia, Hao Tang, Radu Timofte, and Luc Van Gool. 2022. Sinerf: Sinusoidal neural radiance fields for joint pose estimation and scene reconstruction. arXiv:2210.04553 (2022).

[46]

Alex Yu, Vickie Ye, Matthew Tancik, and Angjoo Kanazawa. 2021. pixelnerf: Neural radiance fields from one or few images. CVPR (2021).

[47]

Qian-Yi Zhou and Vladlen Koltun. 2014. Color map optimization for 3d reconstruction with consumer depth cameras. ACM TOG (2014).

[48]

Yi Zhou, Connelly Barnes, Jingwan Lu, Jimei Yang, and Hao Li. 2019. On the continuity of rotation representations in neural networks. CVPR (2019).

[49]

Zihan Zhu, Songyou Peng, Viktor Larsson, Weiwei Xu, Hujun Bao, Zhaopeng Cui, Martin R Oswald, and Marc Pollefeys. 2022. Nice-slam: Neural implicit scalable encoding for slam. CVPR (2022).

Cited By

Jiang KFu YVarma T MBelhe YWang XSu HRamamoorthi R(2024)A Construct-Optimize Approach to Sparse View Synthesis without Camera PoseACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657427(1-11)Online publication date: 13-Jul-2024
https://dl.acm.org/doi/10.1145/3641519.3657427
Liu JHu WYang ZChen JWang GChen XCai YGao HZhao H(2024)Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic SolidsACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657402(1-11)Online publication date: 13-Jul-2024
https://dl.acm.org/doi/10.1145/3641519.3657402
Fu HYu XLi LZhang L(2024)CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera PosesIEEE Transactions on Multimedia10.1109/TMM.2024.338892926(9304-9315)Online publication date: 2024
https://doi.org/10.1109/TMM.2024.3388929

Index Terms

CamP: Camera Preconditioning for Neural Radiance Fields
1. Computing methodologies

Recommendations

Structure-aware neural radiance fields without posed camera
Abstract
The neural radiance fields (NeRF) for realistic novel view synthesis require camera poses to be pre-acquired by a structure-from-motion (SfM) approach. This two-stage strategy is not convenient to use and degrades the performance because the ...
Highlights
- Our approach does not need the camera poses that required in NeRF.
- Our depth consistent constraint enables NeRF to be aware of the scene’s structure.
- The scene representation and camera’s pose extraction are jointly optimized.
CaSE-NeRF: Camera Settings Editing of Neural Radiance Fields
Advances in Computer Graphics
Abstract
Neural Radiance Fields (NeRF) have shown excellent quality in three-dimensional (3D) reconstruction by synthesizing novel views from multi-view images. However, previous NeRF-based methods do not allow users to perform user-controlled camera ...
HDR-Plenoxels: Self-Calibrating High Dynamic Range Radiance Fields
Computer Vision – ECCV 2022
Abstract
We propose high dynamic range (HDR) radiance fields, HDR-Plenoxels, that learn a plenoptic function of 3D HDR radiance fields, geometry information, and varying camera settings inherent in 2D low dynamic range (LDR) images. Our voxel-based volume ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 42, Issue 6

December 2023

1565 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/3632123

Issue’s Table of Contents

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 December 2023

Published in TOG Volume 42, Issue 6

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
579
Total Downloads

Downloads (Last 12 months)579
Downloads (Last 6 weeks)70

Reflects downloads up to 02 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Jiang KFu YVarma T MBelhe YWang XSu HRamamoorthi R(2024)A Construct-Optimize Approach to Sparse View Synthesis without Camera PoseACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657427(1-11)Online publication date: 13-Jul-2024
https://dl.acm.org/doi/10.1145/3641519.3657427
Liu JHu WYang ZChen JWang GChen XCai YGao HZhao H(2024)Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic SolidsACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657402(1-11)Online publication date: 13-Jul-2024
https://dl.acm.org/doi/10.1145/3641519.3657402
Fu HYu XLi LZhang L(2024)CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera PosesIEEE Transactions on Multimedia10.1109/TMM.2024.338892926(9304-9315)Online publication date: 2024
https://doi.org/10.1109/TMM.2024.3388929

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Media

Figures

Other

Tables

View Issue’s Table of Contents