Fisher-aware Quantization for DETR Detectors with Critical-category Objectives

Yang, Huanrui; Huang, Yafeng; Dong, Zhen; Gudovskiy, Denis A; Okuno, Tomoyuki; Nakata, Yohei; Du, Yuan; Keutzer, Kurt; Zhang, Shanghang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.03442 (cs)

[Submitted on 3 Jul 2024]

Title:Fisher-aware Quantization for DETR Detectors with Critical-category Objectives

Authors:Huanrui Yang, Yafeng Huang, Zhen Dong, Denis A Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Yuan Du, Kurt Keutzer, Shanghang Zhang

View PDF HTML (experimental)

Abstract:The impact of quantization on the overall performance of deep learning models is a well-studied problem. However, understanding and mitigating its effects on a more fine-grained level is still lacking, especially for harder tasks such as object detection with both classification and regression objectives. This work defines the performance for a subset of task-critical categories, i.e. the critical-category performance, as a crucial yet largely overlooked fine-grained objective for detection tasks. We analyze the impact of quantization at the category-level granularity, and propose methods to improve performance for the critical categories. Specifically, we find that certain critical categories have a higher sensitivity to quantization, and are prone to overfitting after quantization-aware training (QAT). To explain this, we provide theoretical and empirical links between their performance gaps and the corresponding loss landscapes with the Fisher information framework. Using this evidence, we apply a Fisher-aware mixed-precision quantization scheme, and a Fisher-trace regularization for the QAT on the critical-category loss landscape. The proposed methods improve critical-category metrics of the quantized transformer-based DETR detectors. They are even more significant in case of larger models and higher number of classes where the overfitting becomes more severe. For example, our methods lead to 10.4% and 14.5% mAP gains for, correspondingly, 4-bit DETR-R50 and Deformable DETR on the most impacted critical classes in the COCO Panoptic dataset.

Comments:	Poster presentation at the 2nd Workshop on Advancing Neural Network Training: Computational Efficiency, Scalability, and Resource Optimization (WANT@ICML 2024)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.03442 [cs.CV]
	(or arXiv:2407.03442v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.03442

Submission history

From: Huanrui Yang [view email]
[v1] Wed, 3 Jul 2024 18:35:53 UTC (16,545 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Fisher-aware Quantization for DETR Detectors with Critical-category Objectives

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Fisher-aware Quantization for DETR Detectors with Critical-category Objectives

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators