One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment

Sun, Qigong; Li, Xiufang; Ren, Yan; Huang, Zhongjian; Liu, Xu; Jiao, Licheng; Liu, Fang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2105.01353 (cs)

[Submitted on 4 May 2021]

Title:One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment

Authors:Qigong Sun, Xiufang Li, Yan Ren, Zhongjian Huang, Xu Liu, Licheng Jiao, Fang Liu

View PDF

Abstract:As an effective technique to achieve the implementation of deep neural networks in edge devices, model quantization has been successfully applied in many practical applications. No matter the methods of quantization aware training (QAT) or post-training quantization (PTQ), they all depend on the target bit-widths. When the precision of quantization is adjusted, it is necessary to fine-tune the quantized model or minimize the quantization noise, which brings inconvenience in practical applications. In this work, we propose a method to train a model for all quantization that supports diverse bit-widths (e.g., form 8-bit to 1-bit) to satisfy the online quantization bit-width adjustment. It is hot-swappable that can provide specific quantization strategies for different candidates through multiscale quantization. We use wavelet decomposition and reconstruction to increase the diversity of weights, thus significantly improving the performance of each quantization candidate, especially at ultra-low bit-widths (e.g., 3-bit, 2-bit, and 1-bit). Experimental results on ImageNet and COCO show that our method can achieve accuracy comparable performance to dedicated models trained at the same precision.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2105.01353 [cs.CV]
	(or arXiv:2105.01353v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2105.01353

Submission history

From: Qigong Sun [view email]
[v1] Tue, 4 May 2021 08:10:50 UTC (1,114 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Qigong Sun
Xiufang Li
Yan Ren
Xu Liu
Licheng Jiao

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators