ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes

Qi, Charles R.; Chen, Xinlei; Litany, Or; Guibas, Leonidas J.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2001.10692 (cs)

[Submitted on 29 Jan 2020]

Title:ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes

Authors:Charles R. Qi, Xinlei Chen, Or Litany, Leonidas J. Guibas

View PDF

Abstract:3D object detection has seen quick progress thanks to advances in deep learning on point clouds. A few recent works have even shown state-of-the-art performance with just point clouds input (e.g. VoteNet). However, point cloud data have inherent limitations. They are sparse, lack color information and often suffer from sensor noise. Images, on the other hand, have high resolution and rich texture. Thus they can complement the 3D geometry provided by point clouds. Yet how to effectively use image information to assist point cloud based detection is still an open question. In this work, we build on top of VoteNet and propose a 3D detection architecture called ImVoteNet specialized for RGB-D scenes. ImVoteNet is based on fusing 2D votes in images and 3D votes in point clouds. Compared to prior work on multi-modal detection, we explicitly extract both geometric and semantic features from the 2D images. We leverage camera parameters to lift these features to 3D. To improve the synergy of 2D-3D feature fusion, we also propose a multi-tower training scheme. We validate our model on the challenging SUN RGB-D dataset, advancing state-of-the-art results by 5.7 mAP. We also provide rich ablation studies to analyze the contribution of each design choice.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2001.10692 [cs.CV]
	(or arXiv:2001.10692v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2001.10692

Submission history

From: Charles Ruizhongtai Qi [view email]
[v1] Wed, 29 Jan 2020 05:09:28 UTC (8,091 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2020-01

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Charles R. Qi
Xinlei Chen
Or Litany
Leonidas J. Guibas

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators