RTMDet: An Empirical Study of Designing Real-Time Object Detectors

Lyu, Chengqi; Zhang, Wenwei; Huang, Haian; Zhou, Yue; Wang, Yudong; Liu, Yanyi; Zhang, Shilong; Chen, Kai

Computer Science > Computer Vision and Pattern Recognition

arXiv:2212.07784 (cs)

[Submitted on 14 Dec 2022 (v1), last revised 16 Dec 2022 (this version, v2)]

Title:RTMDet: An Empirical Study of Designing Real-Time Object Detectors

Authors:Chengqi Lyu, Wenwei Zhang, Haian Huang, Yue Zhou, Yudong Wang, Yanyi Liu, Shilong Zhang, Kai Chen

View PDF

Abstract:In this paper, we aim to design an efficient real-time object detector that exceeds the YOLO series and is easily extensible for many object recognition tasks such as instance segmentation and rotated object detection. To obtain a more efficient model architecture, we explore an architecture that has compatible capacities in the backbone and neck, constructed by a basic building block that consists of large-kernel depth-wise convolutions. We further introduce soft labels when calculating matching costs in the dynamic label assignment to improve accuracy. Together with better training techniques, the resulting object detector, named RTMDet, achieves 52.8% AP on COCO with 300+ FPS on an NVIDIA 3090 GPU, outperforming the current mainstream industrial detectors. RTMDet achieves the best parameter-accuracy trade-off with tiny/small/medium/large/extra-large model sizes for various application scenarios, and obtains new state-of-the-art performance on real-time instance segmentation and rotated object detection. We hope the experimental results can provide new insights into designing versatile real-time object detectors for many object recognition tasks. Code and models are released at this https URL.

Comments:	15 pages, 4 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2212.07784 [cs.CV]
	(or arXiv:2212.07784v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2212.07784

Submission history

From: Chengqi Lyu [view email]
[v1] Wed, 14 Dec 2022 18:50:20 UTC (179 KB)
[v2] Fri, 16 Dec 2022 09:47:56 UTC (180 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:RTMDet: An Empirical Study of Designing Real-Time Object Detectors

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RTMDet: An Empirical Study of Designing Real-Time Object Detectors

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators