3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization

Qiu, Rui; Xu, Ming; Yan, Yuyao; Smith, Jeremy S.; Yang, Xi

Computer Science > Computer Vision and Pattern Recognition

arXiv:2207.10895 (cs)

[Submitted on 22 Jul 2022 (v1), last revised 25 Jul 2022 (this version, v2)]

Title:3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization

Authors:Rui Qiu, Ming Xu, Yuyao Yan, Jeremy S. Smith, Xi Yang

View PDF

Abstract:Although deep-learning based methods for monocular pedestrian detection have made great progress, they are still vulnerable to heavy occlusions. Using multi-view information fusion is a potential solution but has limited applications, due to the lack of annotated training samples in existing multi-view datasets, which increases the risk of overfitting. To address this problem, a data augmentation method is proposed to randomly generate 3D cylinder occlusions, on the ground plane, which are of the average size of pedestrians and projected to multiple views, to relieve the impact of overfitting in the training. Moreover, the feature map of each view is projected to multiple parallel planes at different heights, by using homographies, which allows the CNNs to fully utilize the features across the height of each pedestrian to infer the locations of pedestrians on the ground plane. The proposed 3DROM method has a greatly improved performance in comparison with the state-of-the-art deep-learning based methods for multi-view pedestrian detection.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2207.10895 [cs.CV]
	(or arXiv:2207.10895v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2207.10895
Journal reference:	European Conference on Computer Vision 2022

Submission history

From: Rui Qiu [view email]
[v1] Fri, 22 Jul 2022 06:15:20 UTC (5,871 KB)
[v2] Mon, 25 Jul 2022 17:27:35 UTC (5,264 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators