MDS-Net: A Multi-scale Depth Stratification Based Monocular 3D Object Detection Algorithm

Xie, Zhouzhen; Song, Yuying; Wu, Jingxuan; Li, Zecheng; Song, Chunyi; Xu, Zhiwei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2201.04341 (cs)

[Submitted on 12 Jan 2022 (v1), last revised 28 Apr 2022 (this version, v2)]

Title:MDS-Net: A Multi-scale Depth Stratification Based Monocular 3D Object Detection Algorithm

Authors:Zhouzhen Xie, Yuying Song, Jingxuan Wu, Zecheng Li, Chunyi Song, Zhiwei Xu

View PDF

Abstract:Monocular 3D object detection is very challenging in autonomous driving due to the lack of depth information. This paper proposes a one-stage monocular 3D object detection algorithm based on multi-scale depth stratification, which uses the anchor-free method to detect 3D objects in a per-pixel prediction. In the proposed MDS-Net, a novel depth-based stratification structure is developed to improve the network's ability of depth prediction by establishing mathematical models between depth and image size of objects. A new angle loss function is then developed to further improve the accuracy of the angle prediction and increase the convergence speed of training. An optimized soft-NMS is finally applied in the post-processing stage to adjust the confidence of candidate boxes. Experiments on the KITTI benchmark show that the MDS-Net outperforms the existing monocular 3D detection methods in 3D detection and BEV detection tasks while fulfilling real-time requirements.

Comments:	9 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2201.04341 [cs.CV]
	(or arXiv:2201.04341v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2201.04341

Submission history

From: Zhouzhen Xie [view email]
[v1] Wed, 12 Jan 2022 07:11:18 UTC (9,148 KB)
[v2] Thu, 28 Apr 2022 14:31:39 UTC (8,997 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MDS-Net: A Multi-scale Depth Stratification Based Monocular 3D Object Detection Algorithm

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MDS-Net: A Multi-scale Depth Stratification Based Monocular 3D Object Detection Algorithm

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators