Abstract
In recent years, image enhancement based on deep network plays a vital role and has become the mainstream research. However, current approaches are generally limited to the manual embedding of auxiliary components (e.g., hyper-parameters, appended modules) to train the network; thus, they can often lack flexibility, adaptability, or even fail to achieve the optimal settings. Moreover, the straightforward learning-based architectures cannot adequately handle the complex latent image distributions in real-world scenarios. To partially address the above issues, in this work, a generic adaptive hyper-training scheme based on bilevel optimization is established. Specifically, we propose a completely new bilevel deep-unfolded strategy to collaboratively optimize the inner-level task-related hyper-guidance and the outer-level image reconstruction. The process can embed the differentiable proxy-based network with parameters to automatically learn the appended control mechanism. Instead of constructing the empirically manual interventions, our strategy can proactively learn to learn self-adaptive auxiliary modules. Extensive experiments demonstrate the superiority of our strategy to address different image enhancement tasks (i.e., image restoration, image rain removal and image haze removal) in terms of flexibility and effectiveness.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Notes
It is known that the simple bi-level optimization is just a specific bilevel optimization problem with only one variable [35] ?,.
References
Simoes, M., Almeida, L.B., Bioucas-Dias, J., Chanussot, J.: A framework for fast image deconvolution with incomplete observations. IEEE Trans. Image Process. 25(11), 5266–5280 (2016)
Wang, Y., Yang, J., Yin, W., Zhang, Y.: A new alternating minimization algorithm for total variation image reconstruction. SIAM J. Imaging Sci. 1(3), 248–272 (2008)
Cheng, J., Gao, Y., Guo, B., Zuo, W.: Image restoration using spatially variant hyper-laplacian prior. Signal Image Video Process. 13(1), 155–162 (2019)
Liu, D., Wen, B., Fan, Y., Loy, C. C., Huang, T. S.: Non-local recurrent network for image restoration. In NeurIPS (2018)
Zhang, K., Zuo, W., Gu, S., Zhang, L.: Learning deep cnn denoiser prior for image restoration. In CVPR, (2017)
Tirer, T., Giryes, R.: Image restoration by iterative denoising and backward projections. IEEE Trans. Image Process. 28(3), 1220–1234 (2019)
Kruse, J., Rother, C., Schmidt, U.: Learning to push the limits of efficient fft-based image deconvolution. In ICCV, (2017)
Zhang, K., Zuo, W., Zhang, L.: Ffdnet: toward a fast and flexible solution for cnn-based image denoising. IEEE Trans. Image Process. 27(9), 4608–4622 (2018)
Liu, R., Jiang, Z., Fan, X., Luo, Z.: Knowledge-driven deep unrolling for robust image layer separation. IEEE Trans. Neural Netw. Learn. Syst. 31(5), 1653–1666 (2019)
Zhang, K., Gool, L. V., Timofte, R.: Deep unfolding network for image super-resolution. In CVPR (2020)
Liu, R., Cheng, S., Ma, L., Fan, X., Luo, Z.: Deep proximal unrolling: algorithmic framework, convergence analysis and applications. IEEE Trans. Image Process. 28(10), 5013–5026 (2019)
Liu, R., Lin, Z., Zhang, W., Su, Z.: Learning pdes for image restoration via optimal control. In ECCV, (2010)
Tai, X. C., Lie, K. A., Chan, T. F., Osher, S.: Image processing based on partial differential equations. In Proceedings of the International Conference on PDE-Based Image Processing and Related Inverse Problems. Springer Science and Business Media, Berlin (2006)
Liu, R., Cheng, S., He, Y., Fan, X., Lin, Z., Luo, Z.: On the convergence of learning-based iterative methods for nonconvex inverse problems. IEEE Trans. Pattern Anal. Mach. Intell. 42(12), 3027–3039 (2019)
Tao, X., Gao, H., Shen, X., Wang, J., Jia, J.: Scale-recurrent network for deep image deblurring. In CVPR, (2018)
Ren, W., Zhang, J., Ma, L., Pan, J., Cao, X., Zuo, W., Liu, W., Yang, M.H.: Deep non-blind econvolution via generalized low-rank approximation. In NeurIPS (2018)
Cai, J., Zuo, W., Zhang, L.: Extreme channel prior embedded network for dynamic scene deblurring. arXiv:1903.00763 (2019)
Zamir, S. W., Arora, A., Khan, S., Hayat, M., Khan, F. S., Yang, M. H., Shao, L.: Cycleisp: Real image restoration via improved data synthesis. arXiv:2003.07761, (2020)
Guo, Y., Chen, J., Wang, J., Chen, Q., Cao, J., Deng, Z., Xu, Y., Tan, M.: Closed-loop matters: Dual regression networks for single image super-resolution. arXiv:2003.07018 (2020)
Liu, R., Fan, X., Hou, M., Jiang, Z., Luo, Z., Zhang, L.: Learning aggregated transmission propagation networks for haze removal and beyond. IEEE Trans. Neural Netw. Learn. Syst. 30(10), 2973–2986 (2018)
Liu, R., Ma, L., Wang, Y., Zhang, L.: Learning converged propagations with deep prior ensemble for image enhancement. IEEE Trans. Image Process. 28(3), 1528–1543 (2018)
Liu, R., Cheng, S., Liu, X., Ma, L., Fan, X., Luo, Z.: A bridging framework for model optimization and deep propagation. In NeurIPS (2018)
Li, Y., Tan, R. T., Guo, X., Lu, J., Brown, M. S.: Rain streak removal using layer priors. In CVPR (2016)
Xiaojie Guo, Yu., Li, J.M., Ling, H.: Mutually guided image filtering. IEEE Trans. Neural Netw. Learn. Syst. 42(3), 694–707 (2020)
Guo, S., Yan, Z., Zhang, K., Zuo, W., Zhang, L.: Toward convolutional blind denoising of real photographs. In CVPR (2019)
Zhang, H., Patel, V. M.: Density-aware single image de-raining using a multi-stream dense network. In CVPR (2018)
Du, Y., Xu, J., Qiu, Q., Zhen, X., Zhang, L.: Variational image deraining. In WACV (2020)
Ren, D., Zuo, W., Hu, Q., Zhu, P., Meng, D.: Progressive image deraining networks: a better and simpler baseline. In CVPR (2019)
Liu, X., Ma, Y., Shi, Z., Chen, J.: Griddehazenet: attention-based multi-scale network for image dehazing. In ICCV (2019)
MacKay, M., Vicol, P., Lorraine, J., Duvenaud, D., Grosse, R.: Self-tuning networks: Bilevel optimization of hyperparameters using structured best-response functions. arXiv:1903.03088 (2019)
Dempe, S.: Foundations of Bilevel Programming. Springer Science and Business Media, Berlin (2002)
Dempe, S., Dutta, J., Mordukhovich, B.S.: New necessary optimality conditions in optimistic bilevel programming. Optimization 56(5–6), 577–604 (2007)
Kohli, B.: Optimality conditions for optimistic bilevel programming problem using convexifactors. J. Optim. Theory Appl. 152(3), 632–651 (2012)
Lampariello, L., Sagratella, S.: Numerically tractable optimistic bilevel problems. Comput. Optim. Appl. 76(2), 277–303 (2020)
Liu, R., Mu, P., Yuan, X., Zeng, S., Zhang, J.: A generic first-order algorithmic framework for bi-level programming beyond lower-level singleton. In ICML (2020)
Liu, R., Mu, P., Yuan, X., Zeng, S., Zhang, J.: A generic descent aggregation framework for gradient-based bi-level optimization. In ICML (2021)
Liu, R., Liu, X., Yuan, X., Zeng, S., Zhang, J.: A value-function-based interior-point method for non-convex bi-level optimization (2021)
Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. JMLR 13(2), 281–305 (2012)
Swersky, K., Snoek, J., Adams, R. P.: Freeze-thaw bayesian optimization. arXiv:1406.3896 (2014)
Bottou, L.: Large-scale machine learning with stochastic gradient descent. In COMPSTAT (2010)
Kingma, D. P., Ba, J.: A method for stochastic optimization. arXiv:1412.6980, (2014)
Lorraine, J., Duvenaud, D.: Stochastic hyperparameter optimization through hypernetworks arXiv:1802.09419 (2018)
Franceschi, L., Frasconi, P., Salzo, S., Grazzi, R., Pontil, M.: Bilevel programming for hyperparameter optimization and meta-learning. In ICML (2018)
Fu, X., Huang, J., Zeng, D., Huang, Y., Ding, X., Paisley, J.: Removing rain from single images via a deep detail network. In CVPR (2017)
Fan, Z., Wu, H., Fu, X., Hunag, Y., Ding, X.: Residual-guide feature fusion network for single image deraining. arXiv:1804.07493 (2018)
Yang, W., Tan, R. T., Feng, J., Liu, J., Guo, Z., Yan, S.: Deep joint rain detection and removal from a single image. In CVPR (2017)
Li, X., Wu, J., Lin, Z., Liu, H., Zha, H.: Recurrent squeeze-and-excitation context aggregation net for single image deraining. In ECCV (2018)
Narasimhan, S.G., Nayar, S.K: Chromatic framework for vision in bad weather. In CVPR (2000)
He, K., Sun, J., Tang, X.: Single image haze removal using dark channel prior. IEEE Trans. Neural Netw. Learn. Syst. 33(12), 2341–2353 (2010)
Cai, B., Xiangmin, X., Jia, K., Qing, C., Tao, D.: Dehazenet: an end-to-end system for single image haze removal. IEEE Trans. Image Process. 25(11), 5187–5198 (2016)
Ren, W., Liu, S., Zhang, H., Pan, J., Cao, X., Yang, M. H.: Single image dehazing via multi-scale convolutional neural networks. In ECCV (2016)
Li, B., Peng, X., Wang, Z., Xu, J., Feng, D.: Aod-net: All-in-one dehazing network. In ICCV (2017)
Qu, Y., Chen, Y., Huang, J., Xie, Y.: Enhanced pix2pix dehazing network. In CVPR, (2019)
Arbelaez, P., Maire, M., Fowlkes, C., Malik, J.: Contour detection and hierarchical image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 33(5), 898–916 (2010)
Schmidt, U., Jancsary, J., Nowozin, S., Roth, S., Rother, C.: Cascades of regression tree fields for image restoration. IEEE Trans. Pattern Anal. Mach. Intell. 38(4), 677–689 (2015)
Levin, A., Weiss, Y., Durand, F., Freeman, W.T.: Understanding and evaluating blind deconvolution algorithms. In CVPR (2009)
Sun, L., Cho, S., Wang, J., Hays, J.: Edge-based blur kernel estimation using patch priors. In ICCP (2013)
Krishnan, D., Fergus, R.: Fast image deconvolution using hyper-laplacian priors. In NeurIPS (2009)
Schuler, C. J., Christopher Burger, H., Harmeling, S., Scholkopf, B.: A machine learning approach for non-blind image deconvolution. In CVPR (2013)
Pan, J., Lin, Z., Su, Z., Yang, M. H. Robust kernel estimation with outliers handling for image deblurring. In CVPR (2016)
Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P., et al.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Wang, T., Yang, X., Xu, K., Chen, S., Zhang, Q., Lau, R. W.: Spatial attentive single-image deraining with a high quality real rain dataset. In CVPR (2019)
Li, B., Ren, W., Dengpan, F., Tao, D., Feng, D., Zeng, W., Wang, Z.: Benchmarking single-image dehazing and beyond. IEEE Trans. Image Process. 28(1), 492–505 (2018)
Silberman, N., Hoiem, D., Kohli, P., Fergus, R.: Indoor segmentation and support inference from rgbd images. In ECCV, (2012)
Scharstein, D., Szeliski, R.: High-accuracy stereo depth maps using structured light. In CVPR (2003)
Liu, F., Shen, C., Lin, G., Reid, I.: Learning depth from single monocular images using deep convolutional neural fields. IEEE Trans. Pattern Anal. Mach. Intell. 38(10), 2024–2039 (2015)
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
We declare that we have no financial and personal relationships with other people or organizations that can inappropriately influence our work, and there is no professional or other personal interest of any nature or kind in any product, service and/or company that could be construed as influencing the position presented in, or the review of, the manuscript entitled, “Learning Adaptive Hyper-guidance via Proxy-based Bilevel Optimization for Image Restoration”.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Gao, J., Liu, X., Liu, R. et al. Learning adaptive hyper-guidance via proxy-based bilevel optimization for image enhancement. Vis Comput 39, 1471–1484 (2023). https://doi.org/10.1007/s00371-022-02423-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-022-02423-3