Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Public Access

Transform recipes for efficient cloud photo enhancement

Published: 02 November 2015 Publication History

Abstract

Cloud image processing is often proposed as a solution to the limited computing power and battery life of mobile devices: it allows complex algorithms to run on powerful servers with virtually unlimited energy supply. Unfortunately, this overlooks the time and energy cost of uploading the input and downloading the output images. When transfer overhead is accounted for, processing images on a remote server becomes less attractive and many applications do not benefit from cloud offloading. We aim to change this in the case of image enhancements that preserve the overall content of an image. Our key insight is that, in this case, the server can compute and transmit a description of the transformation from input to output, which we call a transform recipe. At equivalent quality, our recipes are much more compact than JPEG images: this reduces the client's download. Furthermore, recipes can be computed from highly compressed inputs which significantly reduces the data uploaded to the server. The client reconstructs a high-fidelity approximation of the output by applying the recipe to its local high-quality input. We demonstrate our results on 168 images and 10 image processing applications, showing that our recipes form a compact representation for a diverse set of image filters. With an equivalent transmission budget, they provide higher-quality results than JPEG-compressed input/output images, with a gain of the order of 10 dB in many cases. We demonstrate the utility of recipes on a mobile phone by profiling the energy consumption and latency for both local and cloud computation: a transform recipe-based pipeline runs 2--4x faster and uses 2--7x less energy than local or naive cloud computation.

Supplementary Material

ZIP File (a228-gharbi.zip)
Supplemental files.

References

[1]
Aubry, M., Paris, S., Hasinoff, S. W., Kautz, J., and Durand, F. 2014. Fast local laplacian filters: Theory and applications. ACM Trans. Graph. 33, 5 (Sept.), 167:1--167:14.
[2]
Barr, K. C., and Asanović, K. 2006. Energy-aware lossless data compression. ACM Trans. Comput. Syst. 24, 3 (Aug.), 250--291.
[3]
Berthouzoz, F., Li, W., Dontcheva, M., and Agrawala, M. 2011. A framework for content-adaptive photo manipulation macros: Application to face, landscape, and global manipulations. ACM Transactions on Graphics 30, 5.
[4]
Burt, P. J., and Adelson, E. H. 1983. The laplacian pyramid as a compact image code. IEEE Transactions on Communications 31, 4, 532--540.
[5]
Bychkovsky, V., Paris, S., Chan, E., and Durand, F. 2011. Learning photographic global tonal adjustment with a database of input / output image pairs. In IEEE Conference on Computer Vision and Pattern Recognition.
[6]
Chen, Q., Li, D., and Tang, C.-K. 2012. Knn matting. In IEEE Conference on Computer Vision and Pattern Recognition, 869--876.
[7]
Deutsch, P., 1996. Deflate compressed data format specification version 1.3.
[8]
Farbman, Z., Fattal, R., Lischinski, D., and Szeliski, R. 2008. Edge-preserving decompositions for multi-scale tone and detail manipulation. In ACM Transaction on Graphics (SIGGRAPH), ACM, New York, NY, USA, SIGGRAPH '08, 67:1--67:10.
[9]
Farbman, Z., Fattal, R., and Lischinski, D. 2011. Convolution pyramids. ACM Transactions on Graphics (Proc. of SIGGRAPH Asia) 30, 6.
[10]
Freeman, W. T., and Torralba, A. 2002. Shape recipes: Scene representations that refer to the image. In Vision Sciences Society Annual Meeting, MIT Press, 25--47.
[11]
Hamilton, E. 1992. Jpeg file interchange format. C-Cube Microsystems.
[12]
Heeger, D. J., and Bergen, J. R. 1995. Pyramid-based texture analysis/synthesis. In Proceedings of the 22Nd Annual Conference on Computer Graphics and Interactive Techniques, ACM, New York, NY, USA, SIGGRAPH '95, 229--238.
[13]
Huang, J., Qian, F., Gerber, A., Mao, Z. M., Sen, S., and Spatscheck, O., 2012. A close examination of performance and power characteristics of 4g lte networks.
[14]
Huffman, D. 1952. A method for the construction of minimum-redundancy codes. Proceedings of the IRE 40, 9 (Sept), 1098--1101.
[15]
Jeong, W.-K., Johnson, M. K., Yu, I., Kautz, J., Pfister, H., and Paris, S. 2011. Display-aware image editing. In International Conference on Computational Photography.
[16]
Kaufman, L., Lischinski, D., and Werman, M. 2012. Content-aware automatic photo enhancement. Computer Graphics Forum 31, 8, 2528--2540.
[17]
Kim, J.-H., Jang, W.-D., Sim, J.-Y., and Kim, C.-S. 2013. Optimized contrast enhancement for real-time image and video dehazing. J. Vis. Comun. Image Represent. 24, 3 (Apr.), 410--425.
[18]
Kumar, K., Liu, J., Lu, Y.-H., and Bhargava, B. 2013. A survey of computation offloading for mobile systems. Mob. Netw. Appl. 18, 1 (Feb.), 129--140.
[19]
Laffont, P.-Y., Ren, Z., Tao, X., Qian, C., and Hays, J. 2014. Transient attributes for high-level understanding and editing of outdoor scenes. ACM Transaction on Graphics (SIGGRAPH) 33, 4 (July), 149:1--149:11.
[20]
Lee, K., Chu, D., Cuervo, E., Kopf, J., Grizan, S., Wolman, A., and Flinn, J. 2014. Outatime: Using speculation to enable low-latency continuous interaction for cloud gaming. Tech. Rep. MSR-TR-2014-115.
[21]
Levin, A., Lischinski, D., and Weiss, Y. 2004. Colorization using optimization. ACM Transaction on Graphics (SIGGRAPH) 23, 3 (Aug.), 689--694.
[22]
Levin, A., Lischinski, D., and Weiss, Y. 2008. A closed-form solution to natural image matting. IEEE Transactions on Pattern Analysis and Machine Intelligence 30, 2 (Feb), 228--242.
[23]
Levoy, M. 1995. Polygon-assisted jpeg and mpeg compression of synthetic images. In Computer Graphics and Interactive Techniques, ACM, New York, NY, USA, SIGGRAPH '95, 21--28.
[24]
LiKamWa, R., Priyantha, B., Philipose, M., Zhong, L., and Bahl, P. 2013. Energy characterization and optimization of image sensing toward continuous mobile vision. In Proc. of International Conference on Mobile Systems, Applications, and Services, ACM, 69--82.
[25]
Mantiuk, R., and Seidel, H.-P. 2008. Modeling a generic tone-mapping operator. Computer Graphics Forum (Proc. of Eurographics) 27, 2.
[26]
Paris, S., Hasinoff, S. W., and Kautz, J. 2011. Local laplacian filters: Edge-aware image processing with a laplacian pyramid. In ACM Transaction on Graphics (SIGGRAPH), ACM, New York, NY, USA, SIGGRAPH '11, 68:1--68:12.
[27]
Rabbani, M., and Jones, P. W. 1991. Digital Image Compression Techniques, 1st ed. Society of Photo-Optical Instrumentation Engineers (SPIE), Bellingham, WA, USA.
[28]
Ragan-Kelley, J., Adams, A., Paris, S., Levoy, M., Amarasinghe, S., and Durand, F. 2012. Decoupling algorithms from schedules for easy optimization of image processing pipelines. ACM Transactions on Graphics 31, 4 (July), 32:1--32:12.
[29]
Ragan-Kelley, J., Barnes, C., Adams, A., Paris, S., Durand, F., and Amarasinghe, S. 2013. Halide: A language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines. In Proceedings of the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation, ACM, New York, NY, USA, PLDI, 519--530.
[30]
Rhemann, C., Rother, C., Wang, J., Gelautz, M., Kohli, P., and Rott, P. 2009. A perceptually motivated online benchmark for image matting. In IEEE Conference on Computer Vision and Pattern Recognition, 1826--1833.
[31]
Shih, Y., Paris, S., Durand, F., and Freeman, W. T. 2013. Data-driven hallucination of different times of day from a single outdoor photo. ACM Transaction on Graphics (SIGGRAPH) 32, 6 (Nov.), 200:1--200:11.
[32]
Shih, Y., Paris, S., Barnes, C., Freeman, W. T., and Durand, F. 2014. Style transfer for headshot portraits. ACM Transaction on Graphics (SIGGRAPH) 33, 4 (July), 148:1--148:14.
[33]
Skodras, A., Christopoulos, C., and Ebrahimi, T. 2001. The jpeg 2000 still image compression standard. IEEE Signal Processing Magazine 18, 5, 36--58.
[34]
Tibshirani, R. 1994. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B 58, 267--288.
[35]
Torralba, A., and Freeman, W. 2003. Properties and applications of shape recipes. In IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, II-383-90 vol.2.
[36]
Wallace, G. 1992. The jpeg still picture compression standard. IEEE Transactions on Consumer Electronics 38, 1 (Feb), xviii--xxxiv.
[37]
Welch, T. 1984. A technique for high-performance data compression. Computer 17, 6 (June), 8--19.
[38]
Witten, I. H., Neal, R. M., and Cleary, J. G. 1987. Arithmetic coding for data compression. Communications of the ACM 30, 6, 520--540.
[39]
Xu, L., Lu, C., Xu, Y., and Jia, J. 2011. Image smoothing via l0 gradient minimization. ACM Transaction on Graphics (SIGGRAPH) 30, 6 (Dec.), 174:1--174:12.
[40]
Ziv, J., and Lempel, A. 1977. A universal algorithm for sequential data compression. IEEE Transactions on Information Theory 23, 3 (May), 337--343.

Cited By

View all
  • (2024)Detail-preserving Joint Image UpsamplingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366524620:8(1-23)Online publication date: 13-Jun-2024
  • (2024)Hyperbolic tangent penalty function for edge-preserving image filteringDigital Signal Processing10.1016/j.dsp.2024.104591153(104591)Online publication date: Oct-2024
  • (2023)Guided Linear UpsamplingACM Transactions on Graphics10.1145/359245342:4(1-12)Online publication date: 26-Jul-2023
  • Show More Cited By

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics
ACM Transactions on Graphics  Volume 34, Issue 6
November 2015
944 pages
ISSN:0730-0301
EISSN:1557-7368
DOI:10.1145/2816795
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 November 2015
Published in TOG Volume 34, Issue 6

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. energy-efficient cloud computing
  2. image filter approximation
  3. mobile image processing

Qualifiers

  • Research-article

Funding Sources

  • Qatar Computing Research Institute
  • Adobe
  • Stanford Pervasive Parallelism Lab
  • DARPA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)74
  • Downloads (Last 6 weeks)15
Reflects downloads up to 10 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Detail-preserving Joint Image UpsamplingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366524620:8(1-23)Online publication date: 13-Jun-2024
  • (2024)Hyperbolic tangent penalty function for edge-preserving image filteringDigital Signal Processing10.1016/j.dsp.2024.104591153(104591)Online publication date: Oct-2024
  • (2023)Guided Linear UpsamplingACM Transactions on Graphics10.1145/359245342:4(1-12)Online publication date: 26-Jul-2023
  • (2023)Semi-Supervised Parametric Real-World Image Harmonization2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52729.2023.00574(5927-5936)Online publication date: Jun-2023
  • (2023)Local look-up table upsampling for accelerating image processingMultimedia Tools and Applications10.1007/s11042-023-16405-783:9(26131-26158)Online publication date: 28-Aug-2023
  • (2022)Acceleration of Image Processing and Computer Vision AlgorithmsHandbook of Research on Computer Vision and Image Processing in the Deep Learning Era10.4018/978-1-7998-8892-5.ch001(1-18)Online publication date: 21-Oct-2022
  • (2022)Revisiting Radial Distortion Rectification in Polar-Coordinates: A New and Efficient Learning PerspectiveIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2021.311929332:6(3552-3560)Online publication date: Jun-2022
  • (2022)JSENetNeurocomputing10.1016/j.neucom.2021.12.071489:C(570-583)Online publication date: 22-Jun-2022
  • (2022)DCCF: Deep Comprehensible Color Filter Learning Framework for High-Resolution Image HarmonizationComputer Vision – ECCV 202210.1007/978-3-031-20071-7_18(300-316)Online publication date: 23-Oct-2022
  • (2021)A Generalized Framework for Edge-preserving and Structure-preserving Image SmoothingIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2021.3097891(1-1)Online publication date: 2021
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media