research-article

Fully Point-wise Convolutional Neural Network for Modeling Statistical Regularities in Natural Images

Authors:

Chang Wen ChenAuthors Info & Claims

MM '18: Proceedings of the 26th ACM international conference on Multimedia

Pages 984 - 992

https://doi.org/10.1145/3240508.3240653

Published: 15 October 2018 Publication History

Abstract

Modeling statistical regularity plays an essential role in ill-posed image processing problems. Recently, deep learning based methods have been presented to implicitly learn statistical representation of pixel distributions in natural images and leverage it as a constraint to facilitate subsequent tasks, such as color constancy and image dehazing. However, the existing CNN architecture is prone to variability and diversity of pixel intensity within and between local regions, which may result in inaccurate statistical representation. To address this problem, this paper presents a novel fully point-wise CNN architecture for modeling statistical regularities in natural images. Specifically, we propose to randomly shuffle the pixels in the origin images and leverage the shuffled image as input to make CNN more concerned with the statistical properties. Moreover, since the pixels in the shuffled image are independent identically distributed, we can replace all the large convolution kernels in CNN with point-wise (1*1) convolution kernels while maintaining the representation ability. Experimental results on two applications: color constancy and image dehazing, demonstrate the superiority of our proposed network over the existing architectures, i.e., using 1/10~1/100 network parameters and computational cost while achieving comparable performance.

References

[1]

Kobus Barnard. 2000. Improvements to gamut mapping colour constancy algorithms. Computer Vision-ECCV 2000 (2000), 390--403.

Digital Library

[2]

Kobus Barnard, Vlad Cardei, and Brian Funt. 2002. A comparison of computational color constancy algorithms. I: Methodology and experiments with synthesized data. IEEE transactions on Image Processing, Vol. 11, 9 (2002), 972--984.

Digital Library

[3]

Jonathan T Barron. 2015. Convolutional color constancy. In Proceedings of the IEEE International Conference on Computer Vision. 379--387.

Digital Library

[4]

Jonathan T Barron and Yun-Ta Tsai. 2017. Fast Fourier Color Constancy. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017).

[5]

Dana Berman, Shai Avidan, et almbox. 2016. Non-local image dehazing. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1674--1682.

[6]

Simone Bianco, Claudio Cusano, and Raimondo Schettini. 2015. Color constancy using CNNs. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 81--89.

[7]

Simone Bianco, Claudio Cusano, and Raimondo Schettini. 2017. Single and Multiple Illuminant Estimation Using Convolutional Neural Networks. IEEE Transactions on Image Processing (2017).

[8]

David H Brainard and Brian A Wandell. 1986. Analysis of the retinex theory of color vision. JOSA A, Vol. 3, 10 (1986), 1651--1661.

[9]

Gershon Buchsbaum. 1980. A spatial processor model for object colour perception. Journal of the Franklin institute, Vol. 310, 1 (1980), 1--26.

[10]

Harold C Burger, Christian J Schuler, and Stefan Harmeling. 2012. Image denoising: Can plain neural networks compete with BM3D?. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on. IEEE, 2392--2399.

Digital Library

[11]

Bolun Cai, Xiangmin Xu, Kui Jia, Chunmei Qing, and Dacheng Tao. 2016. Dehazenet: An end-to-end system for single image haze removal. IEEE Transactions on Image Processing, Vol. 25, 11 (2016), 5187--5198.

Digital Library

[12]

Yang Cao, Shuai Fang, and Zengfu Wang. 2013. Digital multi-focusing from a single photograph taken with an uncalibrated conventional camera. IEEE Transactions on image processing, Vol. 22, 9 (2013), 3703--3714.

Digital Library

[13]

Dongliang Cheng, Dilip K Prasad, and Michael S Brown. 2014. Illuminant estimation for color constancy: why spatial-domain methods work and the role of the color distribution. JOSA A, Vol. 31, 5 (2014), 1049--1058.

[14]

Dongliang Cheng, Brian Price, Scott Cohen, and Michael S Brown. 2015. Effective learning-based illuminant estimation using simple features. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . 1000--1008.

[15]

Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. 2016. Image super-resolution using deep convolutional networks. IEEE transactions on pattern analysis and machine intelligence, Vol. 38, 2 (2016), 295--307.

Digital Library

[16]

Raanan Fattal, Dani Lischinski, and Michael Werman. 2002. Gradient domain high dynamic range compression. In ACM Transactions on Graphics (TOG), Vol. 21. ACM, 249--256.

Digital Library

[17]

Graham D. Finlayson, Steven D. Hordley, and Paul M. Hubel. 2001. Color by correlation: A simple, unifying framework for color constancy. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 23, 11 (2001), 1209--1221.

Digital Library

[18]

Graham D Finlayson and Elisabetta Trezzi. 2004. Shades of gray and colour constancy. In Color and Imaging Conference, Vol. 2004. Society for Imaging Science and Technology, 37--41.

[19]

Peter Vincent Gehler, Carsten Rother, Andrew Blake, Tom Minka, and Toby Sharp. 2008. Bayesian color constancy revisited. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE, 1--8.

[20]

Arjan Gijsenij and Theo Gevers. 2011. Color constancy using natural image statistics and scene semantics. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 33, 4 (2011), 687--698.

Digital Library

[21]

Xiaojie Guo, Yu Li, and Haibin Ling. 2017. LIME: Low-Light Image Enhancement via Illumination Map Estimation. IEEE Transactions on Image Processing, Vol. 26, 2 (2017), 982--993.

Digital Library

[22]

Kaiming He, Jian Sun, and Xiaoou Tang. 2009. Single image haze removal using dark channel prior. In Proceedings of the IEEE conference on computer vision and pattern recognition .

[23]

Kaiming He, Jian Sun, and Xiaoou Tang. 2011. Single image haze removal using dark channel prior. IEEE transactions on pattern analysis and machine intelligence, Vol. 33, 12 (2011), 2341--2353.

Digital Library

[24]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.

[25]

Heiko Hirschmuller and Daniel Scharstein. 2007. Evaluation of cost functions for stereo matching. In Computer Vision and Pattern Recognition, 2007. CVPR'07. IEEE Conference on. IEEE, 1--8.

[26]

Andrew G Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017).

[27]

Yuanming Hu, Baoyuan Wang, and Stephen Lin. 2017. FC4: Fully Convolutional Color Constancy with Confidence-weighted Pooling. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . 4085--4094.

[28]

Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Caffe: Convolutional architecture for fast feature embedding. In Proceedings of the 22nd ACM international conference on Multimedia. ACM, 675--678.

Digital Library

[29]

Hamid Reza Vaezi Joze, Mark S Drew, Graham D Finlayson, and Perla Aurora Troncoso Rey. 2012. The role of bright pixels in illumination estimation. In Color and Imaging Conference, Vol. 2012. Society for Imaging Science and Technology, 41--46.

[30]

Yan Karklin and Michael S Lewicki. 2005. A hierarchical Bayesian model for learning nonlinear statistical regularities in nonstationary natural signals. Neural computation, Vol. 17, 2 (2005), 397--423.

Digital Library

[31]

Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, and Dan Feng. 2017. Aod-net: All-in-one dehazing network. In Proceedings of the IEEE International Conference on Computer Vision, Vol. 1. 7.

[32]

Yan-Tsung Peng and Pamela C Cosman. 2017. Underwater Image Restoration Based on Image Blurriness and Light Absorption. IEEE Transactions on Image Processing, Vol. 26, 4 (2017), 1579--1594.

Digital Library

[33]

Wenqi Ren, Si Liu, Hua Zhang, Jinshan Pan, Xiaochun Cao, and Ming-Hsuan Yang. 2016. Single image dehazing via multi-scale convolutional neural networks. In European Conference on Computer Vision. Springer, 154--169.

[34]

Lilong Shi. 2000. Re-processed version of the gehler color constancy dataset of 568 images. http://www.cs.sfu.ca/%7Ecolour/data/, (2000).

[35]

Wu Shi, Chen Change Loy, and Xiaoou Tang. 2016. Deep specialized network for illuminant estimation. In European Conference on Computer Vision. Springer, 371--387.

[36]

Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1--9.

[37]

Ketan Tang, Jianchao Yang, and Jue Wang. 2014. Investigating haze-relevant features in a learning framework for image dehazing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . 2995--3000.

Digital Library

[38]

Joost Van De Weijer, Theo Gevers, and Arjan Gijsenij. 2007. Edge-based color constancy. IEEE Transactions on image processing, Vol. 16, 9 (2007), 2207--2214.

Digital Library

[39]

Junyuan Xie, Linli Xu, and Enhong Chen. 2012. Image denoising and inpainting with deep neural networks. In Advances in Neural Information Processing Systems. 341--349.

Digital Library

[40]

Jing Zhang, Yang Cao, Shuai Fang, Yu Kang, and Chang Wen Chen. 2017a. Fast Haze Removal for Nighttime Image Using Maximum Reflectance Prior. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . 7418--7426.

[41]

Kai Zhang, Wangmeng Zuo, Yunjin Chen, Deyu Meng, and Lei Zhang. 2017c. Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising. IEEE Transactions on Image Processing (2017).

Digital Library

[42]

Xiangyu Zhang, Xinyu Zhou, Mengxiao Lin, and Jian Sun. 2017b. Shufflenet: An extremely efficient convolutional neural network for mobile devices. arXiv preprint arXiv:1707.01083 (2017).

[43]

Qingsong Zhu, Jiaming Mai, and Ling Shao. 2014. Single Image Dehazing Using Color Attenuation Prior. In BMVC .

Cited By

Sahu GSeal AYazidi AKrejcar O(2024)A Dual-Channel Dehaze-Net for Single Image Dehazing in Visual Internet of Things Using PYNQ-Z2 BoardIEEE Transactions on Automation Science and Engineering10.1109/TASE.2022.321780121:1(305-319)Online publication date: Jan-2024
https://doi.org/10.1109/TASE.2022.3217801
Zhang YDe Smedt J(2024)Index tracking using shapley additive explanations and one-dimensional pointwise convolutional autoencodersInternational Review of Financial Analysis10.1016/j.irfa.2024.10348795(103487)Online publication date: Oct-2024
https://doi.org/10.1016/j.irfa.2024.103487
Wang XGuo JWang YHe W(2024)Jdlmask: joint defogging learning with boundary refinement for foggy scene instance segmentationThe Visual Computer10.1007/s00371-023-03230-0Online publication date: 30-Jan-2024
https://doi.org/10.1007/s00371-023-03230-0
Show More Cited By

Index Terms

Fully Point-wise Convolutional Neural Network for Modeling Statistical Regularities in Natural Images
1. Computing methodologies
  1. Computer graphics
    1. Image manipulation
      1. Image processing
  2. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

Single Image Dehazing via Image Generating
Image and Video Technology
Abstract
Outdoor images taken in bad weather conditions often suffer from poor visibility. However, single image haze removal is an ill-posed problem, because the number of the equations is smaller than the number of unknowns. In this paper, a deep ...
Learning intensity and detail mapping parameters for dehazing

State-of-the-art methods for removing haze from a single image rely on a haze image formation model and the inversion problem is solved by estimating medium transmission and global atmospheric light. In this paper, we propose a hazy image enhancement ...
Binarization of Color Historical Document Images Using Local Image Equalization and XDoG
ICDAR '13: Proceedings of the 2013 12th International Conference on Document Analysis and Recognition

This paper presents a new binarization method for color images of degraded historical document. The proposed method makes use of local image equalization based on color constancy, and an extension to the standard difference of Gaussians edge detection ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '18: Proceedings of the 26th ACM international conference on Multimedia

October 2018

2167 pages

ISBN:9781450356657

DOI:10.1145/3240508

General Chairs:
Susanne Boll
University of Oldenburg, Germany
,
Kyoung Mu Lee
Seoul National University, Korea
,
Jiebo Luo
University of Rochester, USA
,
Wenwu Zhu
Tsinghua University, China
,
Program Chairs:
Hyeran Byun
Yonsei University, Korea
,
Chang Wen Chen
State Univ. Of New York at Buffalo, USA
,
Rainer Lienhart
University of Augsburg, Germany
,
Tao Mei
JD AI, China

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China

Conference

MM '18

Sponsor:

SIGMM

MM '18: ACM Multimedia Conference

October 22 - 26, 2018

Seoul, Republic of Korea

Acceptance Rates

MM '18 Paper Acceptance Rate 209 of 757 submissions, 28%;

Overall Acceptance Rate 995 of 4,171 submissions, 24%

Upcoming Conference

MM '24

Sponsor:
sigmm

The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

28
Total Citations
View Citations
239
Total Downloads

Downloads (Last 12 months)13
Downloads (Last 6 weeks)5

Reflects downloads up to 02 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Sahu GSeal AYazidi AKrejcar O(2024)A Dual-Channel Dehaze-Net for Single Image Dehazing in Visual Internet of Things Using PYNQ-Z2 BoardIEEE Transactions on Automation Science and Engineering10.1109/TASE.2022.321780121:1(305-319)Online publication date: Jan-2024
https://doi.org/10.1109/TASE.2022.3217801
Zhang YDe Smedt J(2024)Index tracking using shapley additive explanations and one-dimensional pointwise convolutional autoencodersInternational Review of Financial Analysis10.1016/j.irfa.2024.10348795(103487)Online publication date: Oct-2024
https://doi.org/10.1016/j.irfa.2024.103487
Wang XGuo JWang YHe W(2024)Jdlmask: joint defogging learning with boundary refinement for foggy scene instance segmentationThe Visual Computer10.1007/s00371-023-03230-0Online publication date: 30-Jan-2024
https://doi.org/10.1007/s00371-023-03230-0
Chen SLiu SChen XDan JWu B(2024)Improved AODNet for Fast Image DehazingMobile Networks and Management10.1007/978-3-031-55471-1_12(154-165)Online publication date: 17-Mar-2024
https://doi.org/10.1007/978-3-031-55471-1_12
Lin SLin MKang BLin YLiu YYin CLin PLin C(2023)Artificial Neural Network-Assisted Classification of Hearing Prognosis of Sudden Sensorineural Hearing Loss With VertigoIEEE Journal of Translational Engineering in Health and Medicine10.1109/JTEHM.2023.324233911(170-181)Online publication date: 2023
https://doi.org/10.1109/JTEHM.2023.3242339
Meng XLu XYe HYang BCao F(2023)A new self-augment CNN for 3D point cloud classification and segmentationInternational Journal of Machine Learning and Cybernetics10.1007/s13042-023-01940-415:3(807-818)Online publication date: 17-Aug-2023
https://doi.org/10.1007/s13042-023-01940-4
Ayoub ANaeem EEl-Shafai WEl-Samie FHamad EEl-Rabaie E(2023)Video quality enhancement using different enhancement and dehazing techniquesJournal of Ambient Intelligence and Humanized Computing10.1007/s12652-023-04642-514:12(16607-16635)Online publication date: 26-Aug-2023
https://doi.org/10.1007/s12652-023-04642-5
Zhang QXu YZhang JTao D(2023)ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and BeyondInternational Journal of Computer Vision10.1007/s11263-022-01739-w131:5(1141-1162)Online publication date: 12-Jan-2023
https://doi.org/10.1007/s11263-022-01739-w
Xiang RZhu XWu FXu QZhang L(2023)Deep-Aware Network for Removing Single HazeProceedings of Eighth International Congress on Information and Communication Technology10.1007/978-981-99-3236-8_14(181-191)Online publication date: 15-Sep-2023
https://doi.org/10.1007/978-981-99-3236-8_14
Wang WZhang JZhai WCao YTao D(2022)Robust Object Detection via Adversarial Novel Style ExplorationIEEE Transactions on Image Processing10.1109/TIP.2022.314601731(1949-1962)Online publication date: 2022
https://doi.org/10.1109/TIP.2022.3146017
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents