Widening and Squeezing: Towards Accurate and Efficient QNNs

Liu, Chuanjian; Han, Kai; Wang, Yunhe; Chen, Hanting; Tian, Qi; Xu, Chunjing

Computer Science > Computer Vision and Pattern Recognition

arXiv:2002.00555v2 (cs)

[Submitted on 3 Feb 2020 (v1), last revised 12 Feb 2020 (this version, v2)]

Title:Widening and Squeezing: Towards Accurate and Efficient QNNs

Authors:Chuanjian Liu, Kai Han, Yunhe Wang, Hanting Chen, Qi Tian, Chunjing Xu

View PDF

Abstract:Quantization neural networks (QNNs) are very attractive to the industry because their extremely cheap calculation and storage overhead, but their performance is still worse than that of networks with full-precision parameters. Most of existing methods aim to enhance performance of QNNs especially binary neural networks by exploiting more effective training techniques. However, we find the representation capability of quantization features is far weaker than full-precision features by experiments. We address this problem by projecting features in original full-precision networks to high-dimensional quantization features. Simultaneously, redundant quantization features will be eliminated in order to avoid unrestricted growth of dimensions for some datasets. Then, a compact quantization neural network but with sufficient representation ability will be established. Experimental results on benchmark datasets demonstrate that the proposed method is able to establish QNNs with much less parameters and calculations but almost the same performance as that of full-precision baseline models, e.g. $29.9\%$ top-1 error of binary ResNet-18 on the ImageNet ILSVRC 2012 dataset.

Comments:	Tech report
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2002.00555 [cs.CV]
	(or arXiv:2002.00555v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2002.00555

Submission history

From: Kai Han [view email]
[v1] Mon, 3 Feb 2020 04:11:13 UTC (932 KB)
[v2] Wed, 12 Feb 2020 09:44:24 UTC (932 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2020-02

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Chuanjian Liu
Kai Han
Yunhe Wang
Hanting Chen
Chunjing Xu

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Widening and Squeezing: Towards Accurate and Efficient QNNs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Widening and Squeezing: Towards Accurate and Efficient QNNs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators