Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Open access

Real-time Global Illumination Decomposition of Videos

Published: 10 August 2021 Publication History

Abstract

We propose the first approach for the decomposition of a monocular color video into direct and indirect illumination components in real time. We retrieve, in separate layers, the contribution made to the scene appearance by the scene reflectance, the light sources, and the reflections from various coherent scene regions to one another. Existing techniques that invert global light transport require image capture under multiplexed controlled lighting or only enable the decomposition of a single image at slow off-line frame rates. In contrast, our approach works for regular videos and produces temporally coherent decomposition layers at real-time frame rates. At the core of our approach are several sparsity priors that enable the estimation of the per-pixel direct and indirect illumination layers based on a small set of jointly estimated base reflectance colors. The resulting variational decomposition problem uses a new formulation based on sparse and dense sets of non-linear equations that we solve efficiently using a novel alternating data-parallel optimization strategy. We evaluate our approach qualitatively and quantitatively and show improvements over the state-of-the-art in this field, in both quality and runtime. In addition, we demonstrate various real-time appearance editing applications for videos with consistent illumination.

Supplementary Material

meka (meka.zip)
Supplemental movie, appendix, image and software files for, Real-time Global Illumination Decomposition of Videos

References

[1]
Yağız Aksoy, Tunç Ozan Aydın, Marc Pollefeys, and Aljoša Smolić. 2016. Interactive high-quality green-screen keying via color unmixing. ACM Trans. Graph. 35, 5 (Aug. 2016), 152:1–12.
[2]
Yağız Aksoy, Tunç Ozan Aydın, Aljoša Smolić, and Marc Pollefeys. 2017. Unmixing-based soft color segmentation for image manipulation. ACM Trans. Graph. 36, 2 (Mar. 2017), 19:1–19.
[3]
Anna Alperovich and Bastian Goldluecke. 2017. A variational model for intrinsic light field decomposition. In ACCV. 66–82.
[4]
Dejan Azinović, Tzu-Mao Li, Anton Kaplanyan, and Matthias Nießner. 2019. Inverse path tracing for joint material and lighting estimation. In CVPR. 2442–2451.
[5]
Francis Bach, Rodolphe Jenatton, Julien Mairal, and Guillaume Obozinski. 2012. Optimization with sparsity-inducing penalties. Found. Trends Mach. Learn. 4, 1 (2012), 1–106.
[6]
Jonathan T. Barron and Jitendra Malik. 2015. Shape, illumination, and reflectance from shading. IEEE Trans. Pattern Anal. Mach. Intell. 37, 8 (Aug. 2015), 1670–1687.
[7]
Harry G. Barrow and Jay M. Tenenbaum. 1978. Recovering intrinsic scene characteristics from images. In ICVS. 3–26.
[8]
Shida Beigpour and Joost van de Weijer. 2011. Object recoloring based on intrinsic image estimation. In ICCV. 327–334.
[9]
Sean Bell, Kavita Bala, and Noah Snavely. 2014. Intrinsic images in the wild. ACM Trans. Graph. 33, 4 (July 2014), 159:1–12.
[10]
Sai Bi, Xiaoguang Han, and Yizhou Yu. 2015. An image transform for edge-preserving smoothing and scene-level intrinsic decomposition. ACM Trans. Graph. 34, 4 (July 2015), 78:1–12.
[11]
Sai Bi, Nima Khademi Kalantari, and Ravi Ramamoorthi. 2018. Deep hybrid real and synthetic training for intrinsic decomposition. In Eurographics Symposium on Rendering—Experimental Ideas & Implementations.
[12]
Nicolas Bonneel, Balazs Kovacs, Sylvain Paris, and Kavita Bala. 2017. Intrinsic decompositions for image editing. Comput. Graph. Forum 36, 2 (May 2017), 593–609.
[13]
Nicolas Bonneel, Kalyan Sunkavalli, James Tompkin, Deqing Sun, Sylvain Paris, and Hanspeter Pfister. 2014. Interactive intrinsic video editing. ACM Trans. Graph. 33, 6 (Nov. 2014), 197:1–10.
[14]
Nicolas Bonneel, James Tompkin, Kalyan Sunkavalli, Deqing Sun, Sylvain Paris, and Hanspeter Pfister. 2015. Blind video temporal consistency. ACM Trans. Graph. 34, 6 (Nov. 2015), 196:1–9.
[15]
Adrien Bousseau, Sylvain Paris, and Frédo Durand. 2009. User-assisted intrinsic images. ACM Trans. Graph. 28, 5 (Dec. 2009), 130:1–10.
[16]
Robert Carroll, Ravi Ramamoorthi, and Maneesh Agrawala. 2011. Illumination decomposition for material recoloring with consistent interreflections. ACM Trans. Graph. 30, 4 (July 2011), 43:1–10.
[17]
Jason Chang, Randi Cabezas, and John W. Fisher, III. 2014. Bayesian nonparametric intrinsic image decomposition. In ECCV, Vol. 8692. 704–719.
[18]
Zachary DeVito, Michael Mara, Michael Zollhöfer, Gilbert Bernstein, Jonathan Ragan-Kelley, Christian Theobalt, Pat Hanrahan, Matthew Fisher, and Matthias Nießner. 2017. Opt: A domain specific language for non-linear least squares optimization in graphics and imaging. ACM Trans. Graph. 36, 5 (Oct. 2017), 171:1–27.
[19]
Shouhong Ding, Bin Sheng, Xiaonan Hou, Zhifeng Xie, and Lizhuang Ma. 2017. Intrinsic image decomposition using multi-scale measurements and sparsity. Comput. Graph. Forum 36, 6 (2017), 251–261.
[20]
Bo Dong, Yue Dong, Xin Tong, and Pieter Peers. 2015. Measurement-based editing of diffuse albedo with consistent interreflections. ACM Trans. Graph. 34, 4 (July 2015), 112:1–11.
[21]
Yue Dong, Guojun Chen, Pieter Peers, Jiawan Zhang, and Xin Tong. 2014. Appearance-from-motion: Recovering spatially varying surface reflectance under unknown lighting. ACM Trans. Graph. 33, 6 (Nov. 2014), 193:1–12.
[22]
Jean-Dominique Favreau, Florent Lafarge, and Adrien Bousseau. 2017. Photo2ClipArt: Image abstraction and vectorization using layered linear gradients. ACM Trans. Graph. 36, 6 (Nov. 2017), 180:1–11.
[23]
Elena Garces, Jose I. Echevarria, Wen Zhang, Hongzhi Wu, Kun Zhou, and Diego Gutierrez. 2017. Intrinsic light field images. Comput. Graph. Forum 36, 8 (Dec. 2017), 589–599.
[24]
Elena Garces, Adolfo Muñoz, Jorge Lopez-Moreno, and Diego Gutierrez. 2012. Intrinsic images by clustering. Comput. Graph. Forum 31, 4 (2012), 1415–1424.
[25]
Stamatios Georgoulis, Konstantinos Rematas, Tobias Ritschel, Efstratios Gavves, Mario Fritz, Luc Van Gool, and Tinne Tuytelaars. 2018. Reflectance and natural illumination from single-material specular objects using deep learning. IEEE Trans. Pattern Anal. Mach. Intell. 40, 8 (Aug. 2018), 1932–1947.
[26]
Roger Grosse, Micah K. Johnson, Edward H. Adelson, and William T. Freeman. 2009. Ground truth dataset and baseline evaluations for intrinsic image algorithms. In ICCV. 2335–2342.
[27]
Kaiwen Guo, Feng Xu, Tao Yu, Xiaoyang Liu, Qionghai Dai, and Yebin Liu. 2017. Real-time geometry, albedo and motion reconstruction using a single RGBD camera. ACM Trans. Graph. 36, 3 (June 2017), 32:1–13.
[28]
Paul W. Holland and Roy E. Welsch. 1977. Robust regression using iteratively reweighted least-squares. Commun. Statist.-Theor. Meth. 6, 9 (Sept. 1977), 813–827.
[29]
Carlo Innamorati, Tobias Ritschel, Tim Weyrich, and Niloy J. Mitra. 2017. Decomposing single images for layered photo retouching. Comput. Graph. Forum 36, 4 (July 2017), 15–25.
[30]
Michael Janner, Jiajun Wu, Tejas D. Kulkarni, Ilker Yildirim, and Joshua B. Tenenbaum. 2017. Self-supervised intrinsic image decomposition. In NIPS. Retrieved from http://rin.csail.mit.edu/
[31]
James T. Kajiya. 1986. The rendering equation. Comput. Graph. 20, 4 (Aug. 1986), 143–150.
[32]
Seungryong Kim, Kihong Park, Kwanghoon Sohn, and Stephen Lin. 2016. Unified depth prediction and intrinsic image decomposition from a single image via joint convolutional neural fields. In ECCV. 143–159.
[33]
Naejin Kong, Peter V. Gehler, and Michael J. Black. 2014. Intrinsic video. In ECCV. 360–375.
[34]
Balazs Kovacs, Sean Bell, Noah Snavely, and Kavita Bala. 2017. Shading annotations in the wild. In CVPR. 850–859.
[35]
Pierre-Yves Laffont, Adrien Bousseau, Sylvain Paris, Frédo Durand, and George Drettakis. 2012. Coherent intrinsic images from photo collections. ACM Trans. Graph. 31, 6 (Nov. 2012), 202:1–11.
[36]
Xiao Li, Yue Dong, Pieter Peers, and Xin Tong. 2017. Modeling surface appearance from a single photograph using self-augmented convolutional neural networks. ACM Trans. Graph. 36, 4 (July 2017), 45:1–11.
[37]
Zhengqin Li, Zexiang Xu, Ravi Ramamoorthi, Kalyan Sunkavalli, and Manmohan Chandraker. 2018. Learning to reconstruct shape and spatially-varying reflectance from a single image. ACM Trans. Graph. 37, 6 (Nov. 2018), 269:1–11.
[38]
Sharon Lin, Matthew Fisher, Angela Dai, and Pat Hanrahan. 2017. LayerBuilder: Layer Decomposition for Interactive Image and Video Color Editing. https://arxiv.org/abs/1701.03754(2017).
[39]
Guilin Liu, Duygu Ceylan, Ersin Yumer, Jimei Yang, and Jyh-Ming Lien. 2017. Material editing using a physically based rendering network. In ICCV. 2280–2288.
[40]
Stephen Lombardi and Ko Nishino. 2016. Reflectance and illumination recovery in the wild. IEEE Trans. Pattern Anal. Mach. Intell. 38, 1 (Jan. 2016), 129–141.
[41]
Stephen R. Marschner and Donald P. Greenberg. 1997. Inverse lighting for photography. In Proceedings of the IS&T Color Imaging Conference. 262–265.
[42]
Abhimitra Meka, Michael Zollhöfer, Christian Richardt, and Christian Theobalt. 2016. Live intrinsic video. ACM Trans. Graph. 35, 4 (July 2016), 109:1–14.
[43]
Giljoo Nam, Joo Ho Lee, Diego Gutierrez, and Min H. Kim. 2018. Practical SVBRDF acquisition of 3D objects with unstructured flash photography. ACM Trans. Graph. 37, 6 (Nov. 2018), 267:1–12.
[44]
Takuya Narihira, Michael Maire, and Stella X. Yu. 2015. Direct intrinsics: Learning albedo-shading decomposition by convolutional regression. In ICCV.
[45]
Shree K. Nayar, Gurunandan Krishnan, Michael D. Grossberg, and Ramesh Raskar. 2006. Fast separation of direct and global components of a scene using high frequency illumination. ACM Trans. Graph. 25, 3 (July 2006), 935–944.
[46]
Thomas Nestmeyer and Peter V. Gehler. 2017. Reflectance adaptive filtering improves intrinsic image estimation. In CVPR. 1771–1780.
[47]
Matthew O’Toole, John Mather, and Kiriakos N. Kutulakos. 2016. 3D shape and indirect appearance by structured light transport. IEEE Trans. Pattern Anal. Mach. Intell. 38, 7 (July 2016), 1298–1312.
[48]
Gustavo Patow and Xavier Pueyo. 2003. A survey of inverse rendering problems. Comput. Graph. Forum 22, 4 (2003), 663–687.
[49]
Ravi Ramamoorthi and Pat Hanrahan. 2001. A signal-processing framework for inverse rendering. In SIGGRAPH. 117–128.
[50]
Peiran Ren, Yue Dong, Stephen Lin, Xin Tong, and Baining Guo. 2015. Image based relighting using neural networks. ACM Trans. Graph. 34, 4 (July 2015), 111:1–12.
[51]
Christian Richardt, Jorge Lopez-Moreno, Adrien Bousseau, Maneesh Agrawala, and George Drettakis. 2014. Vectorising bitmaps into semi-transparent gradient layers. Comput. Graph. Forum 33, 4 (June 2014), 11–19.
[52]
Christopher Schultz and Thorsten Hermes. 2006. Digital Keying Methods. TZI-Bericht 40. Technologie-Zentrum Informatik, Bremen University. Retrieved from http://www.tzi.de/fileadmin/resources/publikationen/tzi_berichte/TZI-Bericht-Nr._40.pdf
[53]
Steven M. Seitz, Yasuyuki Matsushita, and Kiriakos N. Kutulakos. 2005. A theory of inverse light transport. In ICCV, Vol. 2. 1440–1447.
[54]
Jianbing Shen, Xing Yan, Lin Chen, Hanqiu Sun, and Xuelong Li. 2014. Re-texturing by intrinsic video. Inf. Sci. 281 (Oct. 2014), 726–735.
[55]
Jianbing Shen, Xiaoshan Yang, Yunde Jia, and Xuelong Li. 2011. Intrinsic images using optimization. In CVPR. 3481–3487.
[56]
Jian Shi, Yue Dong, Hao Su, and Stella X. Yu. 2017. Learning non-Lambertian object intrinsics across ShapeNet categories. In CVPR. 5844–5853.
[57]
Jianchao Tan, Jose Echevarria, and Yotam Gingold. 2018. Efficient palette-based decomposition and recoloring of images via RGBXY-space geometry. ACM Trans. Graph. 37, 6 (Nov. 2018), 262:1–10.
[58]
Jianchao Tan, Jyh-Ming Lien, and Yotam Gingold. 2016. Decomposing images into layers via RGB-space geometry. ACM Trans. Graph. 36, 1 (Nov. 2016), 7:1–14.
[59]
Chenglei Wu, Michael Zollhöfer, Matthias Nießner, Marc Stamminger, Shahram Izadi, and Christian Theobalt. 2014. Real-time shading-based refinement for consumer depth cameras. ACM Trans. Graph. 33, 6 (Nov. 2014), 200:1–10.
[60]
Hongzhi Wu, Zhaotian Wang, and Kun Zhou. 2016. Simultaneous localization and appearance estimation with a consumer RGB-D camera. IEEE Trans. Vis. Comput. Graph. 22, 8 (Aug. 2016), 2012–2023.
[61]
Genzhi Ye, Elena Garces, Yebin Liu, Qionghai Dai, and Diego Gutierrez. 2014. Intrinsic video and applications. ACM Trans. Graph. 33, 4 (July 2014), 80:1–11.
[62]
Yizhou Yu, Paul Debevec, Jitendra Malik, and Tim Hawkins. 1999. Inverse global illumination: Recovering reflectance models of real scenes from photographs. In SIGGRAPH. 215–224.
[63]
Tinghui Zhou, Philipp Krähenbühl, and Alyosha Efros. 2015. Learning data-driven reflectance priors for intrinsic image decomposition. In ICCV. 3469–3477.
[64]
Michael Zollhöfer, Matthias Nießner, Shahram Izadi, Christoph Rhemann, Christopher Zach, Matthew Fisher, Chenglei Wu, Andrew Fitzgibbon, Charles Loop, Christian Theobalt, and Marc Stamminger. 2014. Real-time non-rigid reconstruction using an RGB-D camera. ACM Trans. Graph. 33, 4 (July 2014), 156:1–12.
[65]
Daniel Zoran, Phillip Isola, Dilip Krishnan, and William T. Freeman. 2015. Learning ordinal relationships for mid-level vision. In ICCV. 388–396.

Cited By

View all
  • (2024)Generative Portrait Shadow RemovalACM Transactions on Graphics10.1145/368790343:6(1-13)Online publication date: 19-Dec-2024
  • (2024)IntrinsicDiffusion: Joint Intrinsic Layers from Latent Diffusion ModelsACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657472(1-11)Online publication date: 13-Jul-2024
  • (2024)Intrinsic Omnidirectional Video Decomposition2024 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW)10.1109/VRW62533.2024.00165(737-738)Online publication date: 16-Mar-2024
  • Show More Cited By

Index Terms

  1. Real-time Global Illumination Decomposition of Videos

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Graphics
      ACM Transactions on Graphics  Volume 40, Issue 3
      June 2021
      264 pages
      ISSN:0730-0301
      EISSN:1557-7368
      DOI:10.1145/3463476
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 10 August 2021
      Accepted: 01 February 2021
      Revised: 01 December 2020
      Received: 01 June 2019
      Published in TOG Volume 40, Issue 3

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Illumination decomposition
      2. direct and indirect illumination
      3. real-time sparse-dense optimization

      Qualifiers

      • Research-article
      • Refereed

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)251
      • Downloads (Last 6 weeks)41
      Reflects downloads up to 08 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Generative Portrait Shadow RemovalACM Transactions on Graphics10.1145/368790343:6(1-13)Online publication date: 19-Dec-2024
      • (2024)IntrinsicDiffusion: Joint Intrinsic Layers from Latent Diffusion ModelsACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657472(1-11)Online publication date: 13-Jul-2024
      • (2024)Intrinsic Omnidirectional Video Decomposition2024 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW)10.1109/VRW62533.2024.00165(737-738)Online publication date: 16-Mar-2024
      • (2024)Intrinsic Omnidirectional Image Decomposition With Illumination Pre-ExtractionIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.336634330:7(4416-4428)Online publication date: 1-Jul-2024
      • (2024)CRefNet: Learning Consistent Reflectance Estimation With a Decoder-Sharing TransformerIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2023.333787030:9(6407-6420)Online publication date: Sep-2024
      • (2024)Color Illumination Image Separation from Single Complex Scene Image for Augmented Reality2024 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct)10.1109/ISMAR-Adjunct64951.2024.00111(401-402)Online publication date: 21-Oct-2024
      • (2023)Image vectorization and editing via linear gradient layer decompositionACM Transactions on Graphics10.1145/359212842:4(1-13)Online publication date: 26-Jul-2023
      • (2023)IBL‐NeRF: Image‐Based Lighting Formulation of Neural Radiance FieldsComputer Graphics Forum10.1111/cgf.1492942:7Online publication date: 30-Oct-2023
      • (2023)Neural Reconstruction of Relightable Human Model from Monocular Video2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.00043(397-407)Online publication date: 1-Oct-2023
      • (2023)IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.00038(339-151)Online publication date: 1-Oct-2023
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Login options

      Full Access

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media