research-article

Public Access

Transform recipes for efficient cloud photo enhancement

Authors:

Michaël Gharbi,

Gaurav Chaurasia,

Jonathan Ragan-Kelley,

Frédo DurandAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 34, Issue 6

Article No.: 228, Pages 1 - 12

https://doi.org/10.1145/2816795.2818127

Published: 02 November 2015 Publication History

Abstract

Cloud image processing is often proposed as a solution to the limited computing power and battery life of mobile devices: it allows complex algorithms to run on powerful servers with virtually unlimited energy supply. Unfortunately, this overlooks the time and energy cost of uploading the input and downloading the output images. When transfer overhead is accounted for, processing images on a remote server becomes less attractive and many applications do not benefit from cloud offloading. We aim to change this in the case of image enhancements that preserve the overall content of an image. Our key insight is that, in this case, the server can compute and transmit a description of the transformation from input to output, which we call a transform recipe. At equivalent quality, our recipes are much more compact than JPEG images: this reduces the client's download. Furthermore, recipes can be computed from highly compressed inputs which significantly reduces the data uploaded to the server. The client reconstructs a high-fidelity approximation of the output by applying the recipe to its local high-quality input. We demonstrate our results on 168 images and 10 image processing applications, showing that our recipes form a compact representation for a diverse set of image filters. With an equivalent transmission budget, they provide higher-quality results than JPEG-compressed input/output images, with a gain of the order of 10 dB in many cases. We demonstrate the utility of recipes on a mobile phone by profiling the energy consumption and latency for both local and cloud computation: a transform recipe-based pipeline runs 2--4x faster and uses 2--7x less energy than local or naive cloud computation.

Supplementary Material

ZIP File (a228-gharbi.zip)

Supplemental files.

Download
223.69 KB

References

[1]

Aubry, M., Paris, S., Hasinoff, S. W., Kautz, J., and Durand, F. 2014. Fast local laplacian filters: Theory and applications. ACM Trans. Graph. 33, 5 (Sept.), 167:1--167:14.

Digital Library

[2]

Barr, K. C., and Asanović, K. 2006. Energy-aware lossless data compression. ACM Trans. Comput. Syst. 24, 3 (Aug.), 250--291.

Digital Library

[3]

Berthouzoz, F., Li, W., Dontcheva, M., and Agrawala, M. 2011. A framework for content-adaptive photo manipulation macros: Application to face, landscape, and global manipulations. ACM Transactions on Graphics 30, 5.

Digital Library

[4]

Burt, P. J., and Adelson, E. H. 1983. The laplacian pyramid as a compact image code. IEEE Transactions on Communications 31, 4, 532--540.

[5]

Bychkovsky, V., Paris, S., Chan, E., and Durand, F. 2011. Learning photographic global tonal adjustment with a database of input / output image pairs. In IEEE Conference on Computer Vision and Pattern Recognition.

Digital Library

[6]

Chen, Q., Li, D., and Tang, C.-K. 2012. Knn matting. In IEEE Conference on Computer Vision and Pattern Recognition, 869--876.

Digital Library

[7]

Deutsch, P., 1996. Deflate compressed data format specification version 1.3.

[8]

Farbman, Z., Fattal, R., Lischinski, D., and Szeliski, R. 2008. Edge-preserving decompositions for multi-scale tone and detail manipulation. In ACM Transaction on Graphics (SIGGRAPH), ACM, New York, NY, USA, SIGGRAPH '08, 67:1--67:10.

Digital Library

[9]

Farbman, Z., Fattal, R., and Lischinski, D. 2011. Convolution pyramids. ACM Transactions on Graphics (Proc. of SIGGRAPH Asia) 30, 6.

Digital Library

[10]

Freeman, W. T., and Torralba, A. 2002. Shape recipes: Scene representations that refer to the image. In Vision Sciences Society Annual Meeting, MIT Press, 25--47.

[11]

Hamilton, E. 1992. Jpeg file interchange format. C-Cube Microsystems.

[12]

Heeger, D. J., and Bergen, J. R. 1995. Pyramid-based texture analysis/synthesis. In Proceedings of the 22Nd Annual Conference on Computer Graphics and Interactive Techniques, ACM, New York, NY, USA, SIGGRAPH '95, 229--238.

Digital Library

[13]

Huang, J., Qian, F., Gerber, A., Mao, Z. M., Sen, S., and Spatscheck, O., 2012. A close examination of performance and power characteristics of 4g lte networks.

[14]

Huffman, D. 1952. A method for the construction of minimum-redundancy codes. Proceedings of the IRE 40, 9 (Sept), 1098--1101.

[15]

Jeong, W.-K., Johnson, M. K., Yu, I., Kautz, J., Pfister, H., and Paris, S. 2011. Display-aware image editing. In International Conference on Computational Photography.

[16]

Kaufman, L., Lischinski, D., and Werman, M. 2012. Content-aware automatic photo enhancement. Computer Graphics Forum 31, 8, 2528--2540.

Digital Library

[17]

Kim, J.-H., Jang, W.-D., Sim, J.-Y., and Kim, C.-S. 2013. Optimized contrast enhancement for real-time image and video dehazing. J. Vis. Comun. Image Represent. 24, 3 (Apr.), 410--425.

Digital Library

[18]

Kumar, K., Liu, J., Lu, Y.-H., and Bhargava, B. 2013. A survey of computation offloading for mobile systems. Mob. Netw. Appl. 18, 1 (Feb.), 129--140.

Digital Library

[19]

Laffont, P.-Y., Ren, Z., Tao, X., Qian, C., and Hays, J. 2014. Transient attributes for high-level understanding and editing of outdoor scenes. ACM Transaction on Graphics (SIGGRAPH) 33, 4 (July), 149:1--149:11.

Digital Library

[20]

Lee, K., Chu, D., Cuervo, E., Kopf, J., Grizan, S., Wolman, A., and Flinn, J. 2014. Outatime: Using speculation to enable low-latency continuous interaction for cloud gaming. Tech. Rep. MSR-TR-2014-115.

[21]

Levin, A., Lischinski, D., and Weiss, Y. 2004. Colorization using optimization. ACM Transaction on Graphics (SIGGRAPH) 23, 3 (Aug.), 689--694.

Digital Library

[22]

Levin, A., Lischinski, D., and Weiss, Y. 2008. A closed-form solution to natural image matting. IEEE Transactions on Pattern Analysis and Machine Intelligence 30, 2 (Feb), 228--242.

Digital Library

[23]

Levoy, M. 1995. Polygon-assisted jpeg and mpeg compression of synthetic images. In Computer Graphics and Interactive Techniques, ACM, New York, NY, USA, SIGGRAPH '95, 21--28.

Digital Library

[24]

LiKamWa, R., Priyantha, B., Philipose, M., Zhong, L., and Bahl, P. 2013. Energy characterization and optimization of image sensing toward continuous mobile vision. In Proc. of International Conference on Mobile Systems, Applications, and Services, ACM, 69--82.

Digital Library

[25]

Mantiuk, R., and Seidel, H.-P. 2008. Modeling a generic tone-mapping operator. Computer Graphics Forum (Proc. of Eurographics) 27, 2.

[26]

Paris, S., Hasinoff, S. W., and Kautz, J. 2011. Local laplacian filters: Edge-aware image processing with a laplacian pyramid. In ACM Transaction on Graphics (SIGGRAPH), ACM, New York, NY, USA, SIGGRAPH '11, 68:1--68:12.

Digital Library

[27]

Rabbani, M., and Jones, P. W. 1991. Digital Image Compression Techniques, 1st ed. Society of Photo-Optical Instrumentation Engineers (SPIE), Bellingham, WA, USA.

Digital Library

[28]

Ragan-Kelley, J., Adams, A., Paris, S., Levoy, M., Amarasinghe, S., and Durand, F. 2012. Decoupling algorithms from schedules for easy optimization of image processing pipelines. ACM Transactions on Graphics 31, 4 (July), 32:1--32:12.

Digital Library

[29]

Ragan-Kelley, J., Barnes, C., Adams, A., Paris, S., Durand, F., and Amarasinghe, S. 2013. Halide: A language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines. In Proceedings of the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation, ACM, New York, NY, USA, PLDI, 519--530.

Digital Library

[30]

Rhemann, C., Rother, C., Wang, J., Gelautz, M., Kohli, P., and Rott, P. 2009. A perceptually motivated online benchmark for image matting. In IEEE Conference on Computer Vision and Pattern Recognition, 1826--1833.

[31]

Shih, Y., Paris, S., Durand, F., and Freeman, W. T. 2013. Data-driven hallucination of different times of day from a single outdoor photo. ACM Transaction on Graphics (SIGGRAPH) 32, 6 (Nov.), 200:1--200:11.

Digital Library

[32]

Shih, Y., Paris, S., Barnes, C., Freeman, W. T., and Durand, F. 2014. Style transfer for headshot portraits. ACM Transaction on Graphics (SIGGRAPH) 33, 4 (July), 148:1--148:14.

Digital Library

[33]

Skodras, A., Christopoulos, C., and Ebrahimi, T. 2001. The jpeg 2000 still image compression standard. IEEE Signal Processing Magazine 18, 5, 36--58.

[34]

Tibshirani, R. 1994. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B 58, 267--288.

[35]

Torralba, A., and Freeman, W. 2003. Properties and applications of shape recipes. In IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, II-383-90 vol.2.

[36]

Wallace, G. 1992. The jpeg still picture compression standard. IEEE Transactions on Consumer Electronics 38, 1 (Feb), xviii--xxxiv.

Digital Library

[37]

Welch, T. 1984. A technique for high-performance data compression. Computer 17, 6 (June), 8--19.

Digital Library

[38]

Witten, I. H., Neal, R. M., and Cleary, J. G. 1987. Arithmetic coding for data compression. Communications of the ACM 30, 6, 520--540.

Digital Library

[39]

Xu, L., Lu, C., Xu, Y., and Jia, J. 2011. Image smoothing via l0 gradient minimization. ACM Transaction on Graphics (SIGGRAPH) 30, 6 (Dec.), 174:1--174:12.

Digital Library

[40]

Ziv, J., and Lempel, A. 1977. A universal algorithm for sequential data compression. IEEE Transactions on Information Theory 23, 3 (May), 337--343.

Digital Library

Cited By

Yang YQiu SZeng LPan Z(2024)Detail-preserving Joint Image UpsamplingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366524620:8(1-23)Online publication date: 13-Jun-2024
https://dl.acm.org/doi/10.1145/3665246
Yang YMa WGao WZeng LPan Z(2024)Hyperbolic tangent penalty function for edge-preserving image filteringDigital Signal Processing10.1016/j.dsp.2024.104591153(104591)Online publication date: Oct-2024
https://doi.org/10.1016/j.dsp.2024.104591
Song SZhong FWang TQin XTu C(2023)Guided Linear UpsamplingACM Transactions on Graphics10.1145/359245342:4(1-12)Online publication date: 26-Jul-2023
https://dl.acm.org/doi/10.1145/3592453
Show More Cited By

Index Terms

Transform recipes for efficient cloud photo enhancement
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision representations
        Image representations
  2. Computer graphics
    1. Image compression

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 34, Issue 6

November 2015

944 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/2816795

Issue’s Table of Contents

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 November 2015

Published in TOG Volume 34, Issue 6

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Qatar Computing Research Institute
Adobe
Stanford Pervasive Parallelism Lab
DARPA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

25
Total Citations
View Citations
904
Total Downloads

Downloads (Last 12 months)74
Downloads (Last 6 weeks)15

Reflects downloads up to 10 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Yang YQiu SZeng LPan Z(2024)Detail-preserving Joint Image UpsamplingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366524620:8(1-23)Online publication date: 13-Jun-2024
https://dl.acm.org/doi/10.1145/3665246
Yang YMa WGao WZeng LPan Z(2024)Hyperbolic tangent penalty function for edge-preserving image filteringDigital Signal Processing10.1016/j.dsp.2024.104591153(104591)Online publication date: Oct-2024
https://doi.org/10.1016/j.dsp.2024.104591
Song SZhong FWang TQin XTu C(2023)Guided Linear UpsamplingACM Transactions on Graphics10.1145/359245342:4(1-12)Online publication date: 26-Jul-2023
https://dl.acm.org/doi/10.1145/3592453
Wang KGharbi MZhang HXia ZShechtman E(2023)Semi-Supervised Parametric Real-World Image Harmonization2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52729.2023.00574(5927-5936)Online publication date: Jun-2023
https://doi.org/10.1109/CVPR52729.2023.00574
Tsubokawa TTajima HMaeda YFukushima N(2023)Local look-up table upsampling for accelerating image processingMultimedia Tools and Applications10.1007/s11042-023-16405-783:9(26131-26158)Online publication date: 28-Aug-2023
https://doi.org/10.1007/s11042-023-16405-7
Ravikumar ASriraman H(2022)Acceleration of Image Processing and Computer Vision AlgorithmsHandbook of Research on Computer Vision and Image Processing in the Deep Learning Era10.4018/978-1-7998-8892-5.ch001(1-18)Online publication date: 21-Oct-2022
https://doi.org/10.4018/978-1-7998-8892-5.ch001
Zhao KLin CLiao KYang SZhao Y(2022)Revisiting Radial Distortion Rectification in Polar-Coordinates: A New and Efficient Learning PerspectiveIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2021.311929332:6(3552-3560)Online publication date: Jun-2022
https://doi.org/10.1109/TCSVT.2021.3119293
Lyu KPan SLi YZhang Z(2022)JSENetNeurocomputing10.1016/j.neucom.2021.12.071489:C(570-583)Online publication date: 22-Jun-2022
https://dl.acm.org/doi/10.1016/j.neucom.2021.12.071
Xue BRan SChen QJia RZhao BTang X(2022)DCCF: Deep Comprehensible Color Filter Learning Framework for High-Resolution Image HarmonizationComputer Vision – ECCV 202210.1007/978-3-031-20071-7_18(300-316)Online publication date: 23-Oct-2022
https://dl.acm.org/doi/10.1007/978-3-031-20071-7_18
Liu WZhang PLei YHuang XYang JNg M(2021)A Generalized Framework for Edge-preserving and Structure-preserving Image SmoothingIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2021.3097891(1-1)Online publication date: 2021
https://doi.org/10.1109/TPAMI.2021.3097891
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Media

Figures

Other

Tables

View Issue’s Table of Contents