Learning a Virtual Codec Based on Deep Convolutional Neural Network to Compress Image

Zhao, Lijun; Bai, Huihui; Wang, Anhong; Zhao, Yao

Computer Science > Computer Vision and Pattern Recognition

arXiv:1712.05969 (cs)

[Submitted on 16 Dec 2017 (v1), last revised 16 Jan 2018 (this version, v7)]

Title:Learning a Virtual Codec Based on Deep Convolutional Neural Network to Compress Image

Authors:Lijun Zhao, Huihui Bai, Anhong Wang, Yao Zhao

View PDF

Abstract:Although deep convolutional neural network has been proved to efficiently eliminate coding artifacts caused by the coarse quantization of traditional codec, it's difficult to train any neural network in front of the encoder for gradient's back-propagation. In this paper, we propose an end-to-end image compression framework based on convolutional neural network to resolve the problem of non-differentiability of the quantization function in the standard codec. First, the feature description neural network is used to get a valid description in the low-dimension space with respect to the ground-truth image so that the amount of image data is greatly reduced for storage or transmission. After image's valid description, standard image codec such as JPEG is leveraged to further compress image, which leads to image's great distortion and compression artifacts, especially blocking artifacts, detail missing, blurring, and ringing artifacts. Then, we use a post-processing neural network to remove these artifacts. Due to the challenge of directly learning a non-linear function for a standard codec based on convolutional neural network, we propose to learn a virtual codec neural network to approximate the projection from the valid description image to the post-processed compressed image, so that the gradient could be efficiently back-propagated from the post-processing neural network to the feature description neural network during training. Meanwhile, an advanced learning algorithm is proposed to train our deep neural networks for compression. Obviously, the priority of the proposed method is compatible with standard existing codecs and our learning strategy can be easily extended into these codecs based on convolutional neural network. Experimental results have demonstrated the advances of the proposed method as compared to several state-of-the-art approaches, especially at very low bit-rate.

Comments:	11 pages, 7 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1712.05969 [cs.CV]
	(or arXiv:1712.05969v7 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1712.05969

Submission history

From: Lijun Zhao [view email]
[v1] Sat, 16 Dec 2017 14:55:13 UTC (3,031 KB)
[v2] Tue, 19 Dec 2017 16:50:36 UTC (3,031 KB)
[v3] Wed, 20 Dec 2017 08:33:09 UTC (3,031 KB)
[v4] Tue, 2 Jan 2018 12:26:16 UTC (3,031 KB)
[v5] Mon, 8 Jan 2018 03:52:47 UTC (2,981 KB)
[v6] Sun, 14 Jan 2018 03:44:22 UTC (2,981 KB)
[v7] Tue, 16 Jan 2018 10:02:55 UTC (3,006 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning a Virtual Codec Based on Deep Convolutional Neural Network to Compress Image

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning a Virtual Codec Based on Deep Convolutional Neural Network to Compress Image

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators