research-article

Public Access

Learning-based view synthesis for light field cameras

Authors:

Nima Khademi Kalantari,

Ting-Chun Wang,

Ravi RamamoorthiAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 35, Issue 6

Article No.: 193, Pages 1 - 10

https://doi.org/10.1145/2980179.2980251

Published: 05 December 2016 Publication History

Abstract

With the introduction of consumer light field cameras, light field imaging has recently become widespread. However, there is an inherent trade-off between the angular and spatial resolution, and thus, these cameras often sparsely sample in either spatial or angular domain. In this paper, we use machine learning to mitigate this trade-off. Specifically, we propose a novel learning-based approach to synthesize new views from a sparse set of input views. We build upon existing view synthesis techniques and break down the process into disparity and color estimation components. We use two sequential convolutional neural networks to model these two components and train both networks simultaneously by minimizing the error between the synthesized and ground truth images. We show the performance of our approach using only four corner sub-aperture views from the light fields captured by the Lytro Illum camera. Experimental results show that our approach synthesizes high-quality images that are superior to the state-of-the-art techniques on a variety of challenging real-world scenes. We believe our method could potentially decrease the required angular resolution of consumer light field cameras, which allows their spatial resolution to increase.

Supplementary Material

ZIP File (a193-kalantari.zip)

Supplemental file.

Download
334.93 MB

References

[1]

Adelson, E. H., and Wang, J. Y. A. 1992. Single lens stereo with a plenoptic camera. IEEE PAMI 14, 2, 99--106.

Digital Library

[2]

Bishop, T. E., Zanetti, S., and Favaro, P. 2009. Light field superresolution. In IEEE ICCP, 1--9.

[3]

Burger, H. C., Schuler, C. J., and Harmeling, S. 2012. Image denoising: Can plain neural networks compete with BM3D? In IEEE CVPR, 2392--2399.

Digital Library

[4]

Chaurasia, G., Sorkine, O., and Drettakis, G. 2011. Silhouette-aware warping for image-based rendering. In EGSR, 1223--1232.

Digital Library

[5]

Chaurasia, G., Duchene, S., Sorkine-Hornung, O., and Drettakis, G. 2013. Depth synthesis and local warps for plausible image-based navigation. ACM TOG 32, 3, 30:1--30:12.

Digital Library

[6]

Cho, D., Lee, M., Kim, S., and Tai, Y.-W. 2013. Modeling the calibration pipeline of the lytro camera for high quality light-field image reconstruction. In IEEE ICCV, 3280--3287.

Digital Library

[7]

Dong, C., Loy, C. C., He, K., and Tang, X. 2014. Learning a deep convolutional network for image super-resolution. In ECCV, 184--199.

[8]

Dosovitskiy, A., Springenberg, J. T., and Brox, T. 2015. Learning to generate chairs with convolutional neural networks. In IEEE CVPR, 1538--1546.

[9]

Eisemann, M., De Decker, B., Magnor, M., Bekaert, P., De Aguiar, E., Ahmed, N., Theobalt, C., and Sellent, A. 2008. Floating textures. CGF 27, 2, 409--418.

[10]

Fitzgibbon, A., Wexler, Y., and Zisserman, A. 2003. Image-based rendering using image-based priors. In IEEE ICCV, 1176--1183 vol.2.

Digital Library

[11]

Flynn, J., Neulander, I., Philbin, J., and Snavely, N. 2016. Deepstereo: Learning to predict new views from the worlds imagery. In IEEE CVPR, 5515--5524.

[12]

Furukawa, Y., and Ponce, J. 2010. Accurate, dense, and robust multiview stereopsis. IEEE PAMI 32, 8, 1362--1376.

Digital Library

[13]

Georgiev, T., Zheng, K. C., Curless, B., Salesin, D., Nayar, S., and Intwala, C. 2006. Spatio-angular resolution tradeoffs in integral photography. In EGSR, 263--272.

Digital Library

[14]

Girod, B., Chang, C.-L., Ramanathan, P., and Zhu, X. 2003. Light field compression using disparity-compensated lifting. In IEEE ICME, vol. 1, I--373--6 vol.1.

Digital Library

[15]

Glorot, X., and Bengio, Y. 2010. Understanding the difficulty of training deep feedforward neural networks. In AISTATS, vol. 9, 249--256.

[16]

Goesele, M., Ackermann, J., Fuhrmann, S., Haubold, C., Klowsky, R., Steedly, D., and Szeliski, R. 2010. Ambient point clouds for view interpolation. ACM TOG 29, 4, 95.

Digital Library

[17]

Heber, S., and Pock, T. 2016. Convolutional networks for shape from light field. In IEEE CVPR.

[18]

Jeon, H. G., Park, J., Choe, G., Park, J., Bok, Y., Tai, Y. W., and Kweon, I. S. 2015. Accurate depth map estimation from a lenslet light field camera. In IEEE CVPR, 1547--1555.

[19]

Kholgade, N., Simon, T., Efros, A., and Sheikh, Y. 2014. 3D object manipulation in a single photograph using stock 3D models. ACM TOG 33, 4, 127.

Digital Library

[20]

Kingma, D., and Ba, J. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.

[21]

Levin, A., and Durand, F. 2010. Linear view synthesis using a dimensionality gap light field prior. In IEEE CVPR, 1831--1838.

[22]

Levoy, M., and Hanrahan, P. 1996. Light field rendering. In ACM SIGGRAPH, 31--42.

Digital Library

[23]

Lytro, 2016. https://www.lytro.com/.

[24]

Mahajan, D., Huang, F.-C., Matusik, W., Ramamoorthi, R., and Belhumeur, P. 2009. Moving gradients: a path-based method for plausible image interpolation. ACM TOG 28, 3, 42.

Digital Library

[25]

Marwah, K., Wetzstein, G., Bando, Y., and Raskar, R. 2013. Compressive light field photography using overcomplete dictionaries and optimized projections. ACM TOG 32, 4, 46:1--46:12.

Digital Library

[26]

Mitra, K., and Veeraraghavan, A. 2012. Light field de-noising, light field superresolution and stereo camera based re-focussing using a GMM light field patch prior. In IEEE CVPRW, 22--28.

[27]

Ng, R., Levoy, M., Brédif, M., Duval, G., Horowitz, M., and Hanrahan, P. 2005. Light field photography with a hand-held plenoptic camera. Computer Science Technical Report CSTR 2, 11, 1--11.

[28]

Pelican Imaging, 2016. Capture life in 3D. http://www.pelicanimaging.com/.

[29]

Raj, A., Lowney, M., Shah, R., and Wetzstein, G., 2016. Stanford lytro light field archive. http://lightfields.stanford.edu/.

[30]

RayTrix, 2016. 3D light field camera technology. https://www.raytrix.de/.

[31]

Rumelhart, D. E., Hinton, G. E., and Williams, R. J. 1986. Learning representations by back-propagating errors. Nature 323, 533--536.

[32]

Schedl, D. C., Birklbauer, C., and Bimber, O. 2015. Directional super-resolution by means of coded sampling and guided upsampling. In IEEE ICCP, 1--10.

[33]

Shechtman, E., Rav-Acha, A., Irani, M., and Seitz, S. 2010. Regenerative morphing. In IEEE CVPR, 615--622.

[34]

Shi, L., Hassanieh, H., Davis, A., Katabi, D., and Du-rand, F. 2014. Light field reconstruction using sparsity in the continuous fourier domain. ACM TOG 34, 1, 12:1--12:13.

Digital Library

[35]

Su, H., Wang, F., Yi, L., and Guibas, L. 2014. 3D-assisted image feature synthesis for novel views of an object. arXiv preprint arXiv:1412.0003.

[36]

Sun, J., Cao, W., Xu, Z., and Ponce, J. 2015. Learning a convolutional neural network for non-uniform motion blur removal. In IEEE CVPR, 769--777.

[37]

Tao, M. W., Hadap, S., Malik, J., and Ramamoorthi, R. 2013. Depth from combining defocus and correspondence using light-field cameras. In IEEE ICCV, 673--680.

Digital Library

[38]

Tao, M. W., Srinivasan, P. P., Malik, J., Rusinkiewicz, S., and Ramamoorthi, R. 2015. Depth from shading, defocus, and correspondence using light-field angular coherence. In IEEE CVPR, 1940--1948.

[39]

Tatarchenko, M., Dosovitskiy, A., and Brox, T. 2015. Single-view to multi-view: Reconstructing unseen views with a convolutional network. CoRR abs/1511.06702.

[40]

Tong, X., and Gray, R. M. 2003. Interactive rendering from compressed light fields. IEEE TCSVT 13, 11 (Nov), 1080--1091.

Digital Library

[41]

Vedaldi, A., and Lenc, K. 2015. MatConvNet: Convolutional neural networks for Matlab. In ACMMM, 689--692.

Digital Library

[42]

Wang, Z., Bovik, A., Sheikh, H., and Simoncelli, E. 2004. Image quality assessment: from error visibility to structural similarity. IEEE TIP 13, 4 (April), 600--612.

Digital Library

[43]

Wang, T. C., Efros, A. A., and Ramamoorthi, R. 2015. Occlusion-aware depth estimation using light-field cameras. In IEEE ICCV, 3487--3495.

Digital Library

[44]

Wanner, S., and Goldluecke, B. 2012. Globally consistent depth labeling of 4D light fields. In IEEE CVPR, 41--48.

Digital Library

[45]

Wanner, S., and Goldluecke, B. 2014. Variational light field analysis for disparity estimation and super-resolution. IEEE PAMI 36, 3, 606--619.

Digital Library

[46]

Wilburn, B., Joshi, N., Vaish, V., Talvala, E.-V., Antunez, E., Barth, A., Adams, A., Horowitz, M., and Levoy, M. 2005. High performance imaging using large camera arrays. ACM TOG 24, 3, 765--776.

Digital Library

[47]

Yang, J., Reed, S. E., Yang, M.-H., and Lee, H. 2015. Weakly-supervised disentangling with recurrent transformations for 3D view synthesis. In NIPS, 1099--1107.

Digital Library

[48]

Yoon, Y., Jeon, H. G., Yoo, D., Lee, J. Y., and Kweon, I. S. 2015. Learning a deep convolutional network for light-field image super-resolution. In IEEE ICCV Workshop, 57--65.

Digital Library

[49]

Zhang, Z., Liu, Y., and Dai, Q. 2015. Light field from micro-baseline image pair. In IEEE CVPR, 3800--3809.

[50]

Zhang, F. L., Wang, J., Shechtman, E., Zhou, Z. Y., Shi, J. X., and Hu, S. M. 2016. PlenoPatch: Patch-based plenoptic image manipulation. IEEE TVCG PP, 99, 1--1.

[51]

Zhou, T., Tulsiani, S., Sun, W., Malik, J., and Efros, A. A. 2016. View synthesis by appearance flow. CoRR abs/1605.03557.

Cited By

Cai DChen YHuang XAn P(2025)Disparity Enhancement-Based Light Field Angular Super-ResolutionIEEE Signal Processing Letters10.1109/LSP.2024.349658232(81-85)Online publication date: 2025
https://doi.org/10.1109/LSP.2024.3496582
Zhu JWang MZhang ZJia CGeng XShi F(2025)Optimization of feature association strategies in multi-target tracking based on light field imagesMeasurement10.1016/j.measurement.2024.116205242(116205)Online publication date: Jan-2025
https://doi.org/10.1016/j.measurement.2024.116205
Xiao Zeyu 肖Xiong Zhiwei 熊Wang Lizhi 王Huang Hua 黄(2024)基于深度学习的光场图像重建与增强综述（特邀）Laser & Optoelectronics Progress10.3788/LOP24140461:16(1611015)Online publication date: 2024
https://doi.org/10.3788/LOP241404
Show More Cited By

Index Terms

Learning-based view synthesis for light field cameras
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Image and video acquisition
        Computational photography
  2. Computer graphics
    1. Image manipulation

Recommendations

Camera array calibration for light field acquisition

Light field cameras are becoming popular in computer vision and graphics, with many research and commercial applications already having been proposed. Various types of cameras have been developed with the camera array being one of the ways of acquiring ...
Stream-centric stereo matching and view synthesis: a high-speed approach on GPUs

In this paper, we propose a real-time image-based rendering (IBR) system. It is specifically designed for photorealistic view synthesis at high-speed on the graphics processing unit (GPU). We steer the proposed IBR system design with two high-level ...
Synthesizing light field from a single image with variable MPI and two network fusion

We propose a learning-based approach to synthesize a light field with a small baseline from a single image. We synthesize the novel view images by first using a convolutional neural network (CNN) to promote the input image into a layered representation ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 35, Issue 6

November 2016

1045 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/2980179

Issue’s Table of Contents

Copyright © 2016 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 December 2016

Published in TOG Volume 35, Issue 6

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Science Foundation
Office of Naval Research
Nokia
Draper Lab
Google
UC San Diego Center for Visual Computing

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

519
Total Citations
View Citations
4,327
Total Downloads

Downloads (Last 12 months)509
Downloads (Last 6 weeks)101

Reflects downloads up to 25 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Cai DChen YHuang XAn P(2025)Disparity Enhancement-Based Light Field Angular Super-ResolutionIEEE Signal Processing Letters10.1109/LSP.2024.349658232(81-85)Online publication date: 2025
https://doi.org/10.1109/LSP.2024.3496582
Zhu JWang MZhang ZJia CGeng XShi F(2025)Optimization of feature association strategies in multi-target tracking based on light field imagesMeasurement10.1016/j.measurement.2024.116205242(116205)Online publication date: Jan-2025
https://doi.org/10.1016/j.measurement.2024.116205
Xiao Zeyu 肖Xiong Zhiwei 熊Wang Lizhi 王Huang Hua 黄(2024)基于深度学习的光场图像重建与增强综述（特邀）Laser & Optoelectronics Progress10.3788/LOP24140461:16(1611015)Online publication date: 2024
https://doi.org/10.3788/LOP241404
Jin Xin 金Long Zhenwei 龙Zeng Yunhui 曾(2024)超表面光场成像研究现状及展望（特邀）Laser & Optoelectronics Progress10.3788/LOP24139961:16(1611007)Online publication date: 2024
https://doi.org/10.3788/LOP241399
Gao SCheung CLi D(2024)A Semi-supervised Angular Super-Resolution Method for Autostereoscopic 3D Surface MeasurementOptics Letters10.1364/OL.516099Online publication date: 19-Jan-2024
https://doi.org/10.1364/OL.516099
Zhou YYao CCheng DWang Y(2024)Efficient light field acquisition for integral imaging with adaptive viewport optimizationOptics Express10.1364/OE.53126432:18(31280)Online publication date: 13-Aug-2024
https://doi.org/10.1364/OE.531264
Mahmoudpour SPagliari CSchelkens P(2024)Learning-based light field imaging: an overviewJournal on Image and Video Processing10.1186/s13640-024-00628-12024:1Online publication date: 30-May-2024
https://dl.acm.org/doi/10.1186/s13640-024-00628-1
Yang ZLiu BSong YYi LXiong YZhang ZYu X(2024)DirectL: Efficient Radiance Fields Rendering for 3D Light Field DisplaysACM Transactions on Graphics10.1145/368789743:6(1-19)Online publication date: 19-Dec-2024
https://dl.acm.org/doi/10.1145/3687897
Long LHu XLang JCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)Learning to Handle Large Obstructions in Video Frame InterpolationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681006(5221-5229)Online publication date: 28-Oct-2024
https://dl.acm.org/doi/10.1145/3664647.3681006
Luo DGu A(2024)More Realistic 3D Environment Reconstruction from Scanned Data Based on Multi-process TechnologiesProceedings of the International Conference on Computer Vision and Deep Learning10.1145/3653781.3653799(1-5)Online publication date: 19-Jan-2024
https://dl.acm.org/doi/10.1145/3653781.3653799
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Media

Figures

Other

Tables

View Issue’s Table of Contents