Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Exposure: A White-Box Photo Post-Processing Framework

Published: 12 May 2018 Publication History

Abstract

Retouching can significantly elevate the visual appeal of photos, but many casual photographers lack the expertise to do this well. To address this problem, previous works have proposed automatic retouching systems based on supervised learning from paired training images acquired before and after manual editing. As it is difficult for users to acquire paired images that reflect their retouching preferences, we present in this article a deep learning approach that is instead trained on unpaired data, namely, a set of photographs that exhibits a retouching style the user likes, which is much easier to collect. Our system is formulated using deep convolutional neural networks that learn to apply different retouching operations on an input image. Network training with respect to various types of edits is enabled by modeling these retouching operations in a unified manner as resolution-independent differentiable filters. To apply the filters in a proper sequence and with suitable parameters, we employ a deep reinforcement learning approach that learns to make decisions on what action to take next, given the current state of the image. In contrast to many deep learning systems, ours provides users with an understandable solution in the form of conventional retouching edits rather than just a “black-box” result. Through quantitative comparisons and user studies, we show that this technique generates retouching results consistent with the provided photo set.

Supplementary Material

a26-hu-supp.pdf (hu.zip)
Supplemental movie and image files for, Exposure: A White-Box Photo Post-Processing Framework
MP4 File (tog37-2-a26-hu.mp4)

References

[1]
Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Jozefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Mané, Rajat Monga, Sherry Moore, Derek Murray, Chris Olah, Mike Schuster, Jonathon Shlens, Benoit Steiner, Ilya Sutskever, Kunal Talwar, Paul Tucker, Vincent Vanhoucke, Vijay Vasudevan, Fernanda Viégas, Oriol Vinyals, Pete Warden, Martin Wattenberg, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. Retrieved January 2017 from http://tensorflow.org/.
[2]
Martin Arjovsky, Soumith Chintala, and Léon Bottou. 2017. Wasserstein GAN. arXiv:1701.07875.
[3]
Vladimir Bychkovsky, Sylvain Paris, Eric Chan, and Frédo Durand. 2011. Learning photographic global tonal adjustment with a database of input/output image pairs. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’11). 97--104.
[4]
Hsiang-Ting Chen, Li-Yi Wei, and Chun-Fa Chang. 2011. Nonlinear revision control for images. ACM Transactions on Graphics 30, 4, Article 105.
[5]
Hsiang-Ting Chen, Li-Yi Wei, Björn Hartmann, and Maneesh Agrawala. 2016. Data-driven adaptive history for image editing. In Proceedings of the 20th ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games. 103--111.
[6]
Kevin Dale, Micah K. Johnson, Kalyan Sunkavalli, Wojciech Matusik, and Hanspeter Pfister. 2009. Image restoration using online photo collections. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’09). 2217--2224.
[7]
Alexei A. Efros and Thomas K. Leung. 1999. Texture synthesis by non-parametric sampling. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’99), Vol. 2. 1033--1038.
[8]
Chen Fang, Zhe Lin, Radomir Mech, and Xiaohui Shen. 2014. Automatic image cropping using visual composition boundary simplicity and content preservation models. In Proceedings of the 22nd ACM International Conference on Multimedia. 1005--1008.
[9]
Hui Fang and Meng Zhang. 2017. Creatism: A deep-learning photographer capable of creating professional work. arXiv:1707.03491.
[10]
William T. Freeman, Thouis R. Jones, and Egon C. Pasztor. 2002. Example-based super-resolution. IEEE Computer Graphics and Applications 22, 2, 56--65.
[11]
Michaël Gharbi, Jiawen Chen, Jonathan T. Barron, Samuel W. Hasinoff, and Frédo Durand. 2017. Deep bilateral learning for real-time image enhancement. ACM Transactions on Graphics 36, 4, 118.
[12]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Proceedings of Advances in Neural Information Processing Systems (NIPS’14). 2672--2680.
[13]
Floraine Grabler, Maneesh Agrawala, Wilmot Li, Mira Dontcheva, and Takeo Igarashi. 2009. Generating photo manipulation tutorials by demonstration. ACM Transactions on Graphics 28, 66.
[14]
Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, and Aaron C. Courville. 2017. Improved training of Wasserstein GANs. In Proceedings of Advances in Neural Information Processing Systems (NIPS’17). 5769--5779.
[15]
Kan Guo, Dongqing Zou, and Xiaowu Chen. 2015. 3D mesh labeling via deep convolutional neural networks. ACM Transactions on Graphics 35, 1, 3.
[16]
Samuel W. Hasinoff, Dillon Sharlet, Ryan Geiss, Andrew Adams, Jonathan T. Barron, Florian Kainz, Jiawen Chen, and Marc Levoy. 2016. Burst photography for high dynamic range and low-light imaging on mobile cameras. ACM Transactions on Graphics 35, 6, 192.
[17]
Shi-Min Hu, Kun Xu, Li-Qian Ma, Bin Liu, Bi-Ye Jiang, and Jue Wang. 2013. Inverse image editing: Recovering a semantic editing history from a before-and-after image pair. ACM Transactions on Graphics 32, 6, 194:1--194:11.
[18]
Yuanming Hu, Hao He, Chenxi Xu, Baoyuan Wang, and Steve Lin. 2018. Exposure: A white-box photo post-processing framework (supplemental material). arXiv:1709.09602
[19]
Yuanming Hu, Baoyuan Wang, and Stephen Lin. 2017. FC4: Fully convolutional color constancy with confidence-weighted pooling. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17). 4085--4094.
[20]
Sung Ju Hwang, Ashish Kapoor, and Sing Bing Kang. 2012. Context-based automatic local image enhancement. In Proceedings of the European Conference on Computer Vision (ECCV’12). 569--582.
[21]
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei A. Efros. 2017. Image-to-image translation with conditional adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).
[22]
Taeksoo Kim, Moonsu Cha, Hyunsoo Kim, Jung Kwon Lee, and Jiwon Kim. 2017. Learning to discover cross-domain relations with generative adversarial networks. In Proceedings of the 34th International Conference on Machine Learning (ICML’17), Vol. 70. 1857--1865.
[23]
Diederik Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In Proceedings of the International Conference on Learning Representations (ICLR’15).
[24]
Diederik Kingma and Max Welling. 2014. Auto-encoding variational Bayes. In Proceedings of the International Conference on Learning Representations (ICLR’14).
[25]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Proceedings of Advances in Neural Information Processing Systems (NIPS’12).
[26]
Gustav Larsson, Michael Maire, and Gregory Shakhnarovich. 2016. Learning representations for automatic colorization. In Proceedings of the European Conference on Computer Vision. 577--593.
[27]
Christian Ledig, Lucas Theis, Ferenc Huszar, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, and Wenzhe Shi. 2016. Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of Advances in Neural Information Processing Systems (NIPS’16).
[28]
Joon-Young Lee, Kalyan Sunkavalli, Zhe Lin, Xiaohui Shen, and In So Kweon. 2016. Automatic content-aware color and tone stylization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16). 2470--2478.
[29]
Chuan Li and Michael Wand. 2016. Precomputed real-time texture synthesis with Markovian generative adversarial networks. In Proceedings of the European Conference on Computer Vision (ECCV’16).
[30]
Long-Ji Lin. 1993. Reinforcement Learning for Robots Using Neural Networks. Ph.D. Dissertation. Fujitsu Laboratories Ltd.
[31]
Ming-Yu Liu and Oncel Tuzel. 2016. Coupled generative adversarial networks. In Proceedings of Advances in Neural Information Processing Systems (NIPS’16). 469--477.
[32]
Ziwei Liu, Lu Yuan, Xiaoou Tang, Matt Uyttendaele, and Jian Sun. 2014. Fast burst images denoising. ACM Transactions on Graphics 33, 6, 232.
[33]
Mehdi Mirza and Simon Osindero. 2014. Conditional generative adversarial nets. arXiv:1411.1784.
[34]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing Atari with deep reinforcement learning. In Proceedings of the NIPS Deep Learning Workshop.
[35]
Deepak Pathak, Philipp Krahenbuhl, Jeff Donahue, Trevor Darrell, and Alexei A. Efros. 2016. Context encoders: Feature learning by inpainting. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16).
[36]
Xue Bin Peng, Glen Berseth, and Michiel Van de Panne. 2015. Dynamic terrain traversal skills using reinforcement learning. ACM Transactions on Graphics 34, 4, 80.
[37]
Xue Bin Peng, Glen Berseth, and Michiel Van de Panne. 2016. Terrain-adaptive locomotion skills using deep reinforcement learning. ACM Transactions on Graphics 35, 4, 81.
[38]
Xue Bin Peng, Glen Berseth, KangKang Yin, and Michiel Van De Panne. 2017. Deeploco: Dynamic locomotion skills using hierarchical deep reinforcement learning. ACM Transactions on Graphics 36, 4, 41.
[39]
Xue Bin Peng and Michiel van de Panne. 2017. Learning locomotion skills using DeepRL: Does the choice of action space matter? In Proceedings of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation. ACM, New York, NY, 12.
[40]
Alec Radford, Luke Metz, and Soumith Chintala. 2016. Unsupervised representation learning with deep convolutional generative adversarial networks. In Proceedings of the International Conference on Learning Representations.
[41]
Danilo Jimenez Rezende, Shakir Mohamed, and Daan Wierstra. 2014. Stochastic backpropagation and approximate inference in deep generative models. In Proceedings of the International Conference on Machine Learning.
[42]
Ashish Shrivastava, Tomas Pfister, Oncel Tuzel, Josh Susskind, Wenda Wang, and Russ Webb. 2017. Learning from simulated and unsupervised images through adversarial training. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17), Vol. 3. 6.
[43]
David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the game of Go with deep neural networks and tree search. Nature 529, 7587, 484--489.
[44]
David Silver, Guy Lever, Nicolas Heess, Thomas Degris, Daan Wierstra, and Martin Riedmiller. 2014. Deterministic policy gradient algorithms. In Proceedings of the 31st International Conference on Machine Learning (ICML’14). 387--395.
[45]
Richard S. Sutton, David A. McAllester, Satinder P. Singh, and Yishay Mansour. 2000. Policy gradient methods for reinforcement learning with function approximation. In Proceedings of Advances in Neural Information Processing Systems (NIPS’00). 1057--1063.
[46]
Hsiao-Yu Fish Tung, Adam W. Harley, William Seto, and Katerina Fragkiadaki. 2017. Adversarial inverse graphics networks: Learning 2D-to-3D lifting and image-to-image translation from unpaired supervision. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’17), Vol. 2.
[47]
Baoyuan Wang, Yizhou Yu, and Ying-Qing Xu. 2011. Example-based image color and tone style enhancement. ACM Transactions on Graphics 30, 64.
[48]
Xiaolong Wang and Abhinav Gupta. 2016. Generative image modeling using style and structure adversarial networks. In Proceedings of the European Conference on Computer Vision.
[49]
Xian Wu, Kun Xu, and Peter Hall. 2017. A survey of image synthesis and editing with generative adversarial networks. Tsinghua Science and Technology 22, 6, 660--674.
[50]
Jianzhou Yan, Stephen Lin, Sing Bing Kang, and Xiaoou Tang. 2014. A learning-to-rank approach for image color enhancement. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’14). 2987--2994.
[51]
Jianzhou Yan, Stephen Lin, Sing Bing Kang, and Xiaoou Tang. 2015. Change-based image cropping with exclusion and compositional features. International Journal of Computer Vision 114, 1, 74--87.
[52]
Zhicheng Yan, Hao Zhang, Baoyuan Wang, Sylvain Paris, and Yizhou Yu. 2016. Automatic photo adjustment using deep neural networks. ACM Transactions on Graphics 35, 2, 11.
[53]
Lantao Yu, Weinan Zhang, Jun Wang, and Yong Yu. 2017. SeqGAN: Sequence generative adversarial nets with policy gradient. In Proceedings of the AAAI Conference. 2852--2858.
[54]
Richard Zhang, Phillip Isola, and Alexei A. Efros. 2016. Colorful image colorization. In Proceedings of the European Conference on Computer Vision. 649--666.
[55]
Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A. Efros. 2017. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’17).

Cited By

View all
  • (2024)The art of photography in China: some observations on its aesthetic connotationTrans/Form/Ação10.1590/0101-3173.2024.v47.n5.e0240015347:5Online publication date: 2024
  • (2024)Blasthole Detection on the Tunnel Face under Low-Light EnvironmentsTransportation Research Record: Journal of the Transportation Research Board10.1177/03611981241252765Online publication date: 17-Jun-2024
  • (2024)A Model Based on Universal Filters for Image Color CorrectionPattern Recognition and Image Analysis10.1134/S105466182470073134:3(844-854)Online publication date: 17-Oct-2024
  • Show More Cited By

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics
ACM Transactions on Graphics  Volume 37, Issue 2
April 2018
244 pages
ISSN:0730-0301
EISSN:1557-7368
DOI:10.1145/3191713
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 May 2018
Accepted: 01 February 2018
Revised: 01 February 2018
Received: 01 September 2017
Published in TOG Volume 37, Issue 2

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Reinforcement learning (RL)
  2. generative adversarial networks (GANs)

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)166
  • Downloads (Last 6 weeks)27
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)The art of photography in China: some observations on its aesthetic connotationTrans/Form/Ação10.1590/0101-3173.2024.v47.n5.e0240015347:5Online publication date: 2024
  • (2024)Blasthole Detection on the Tunnel Face under Low-Light EnvironmentsTransportation Research Record: Journal of the Transportation Research Board10.1177/03611981241252765Online publication date: 17-Jun-2024
  • (2024)A Model Based on Universal Filters for Image Color CorrectionPattern Recognition and Image Analysis10.1134/S105466182470073134:3(844-854)Online publication date: 17-Oct-2024
  • (2024)Low-light image enhancement via multistage Laplacian feature fusionJournal of Electronic Imaging10.1117/1.JEI.33.2.02302033:02Online publication date: 1-Mar-2024
  • (2024)Learning Deep Representations for Photo RetouchingIEEE Transactions on Multimedia10.1109/TMM.2023.330790326(3153-3163)Online publication date: 2024
  • (2024)MDD-ShipNet: Math-Data Integrated Defogging for Fog-Occlusion Ship DetectionIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.339457325:10(15040-15052)Online publication date: Oct-2024
  • (2024)Vehicle-Mounted Adaptive Traffic Sign Detector for Small-Sized Signs in Multiple Working ConditionsIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2023.330964425:1(710-724)Online publication date: Jan-2024
  • (2024)Learning Controllable ISP for Image EnhancementIEEE Transactions on Image Processing10.1109/TIP.2023.330581633(867-880)Online publication date: 1-Jan-2024
  • (2024)Specific Instance and Cross-Prompt-Based Robust 3-D Semi-Supervised Medical Image SegmentationIEEE Transactions on Instrumentation and Measurement10.1109/TIM.2024.343612473(1-14)Online publication date: 2024
  • (2024)A Robust Real-Time Anchor-Free Traffic Sign Detector With One-Level FeatureIEEE Transactions on Emerging Topics in Computational Intelligence10.1109/TETCI.2024.33494648:2(1437-1451)Online publication date: Apr-2024
  • Show More Cited By

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media