HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds

Zhang, Gang; Chen, Junnan; Gao, Guohuan; Li, Jianmin; Hu, Xiaolin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2310.20234v1 (cs)

[Submitted on 31 Oct 2023]

Title:HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds

Authors:Gang Zhang, Junnan Chen, Guohuan Gao, Jianmin Li, Xiaolin Hu

View PDF

Abstract:3D object detection in point clouds is important for autonomous driving systems. A primary challenge in 3D object detection stems from the sparse distribution of points within the 3D scene. Existing high-performance methods typically employ 3D sparse convolutional neural networks with small kernels to extract features. To reduce computational costs, these methods resort to submanifold sparse convolutions, which prevent the information exchange among spatially disconnected features. Some recent approaches have attempted to address this problem by introducing large-kernel convolutions or self-attention mechanisms, but they either achieve limited accuracy improvements or incur excessive computational costs. We propose HEDNet, a hierarchical encoder-decoder network for 3D object detection, which leverages encoder-decoder blocks to capture long-range dependencies among features in the spatial space, particularly for large and distant objects. We conducted extensive experiments on the Waymo Open and nuScenes datasets. HEDNet achieved superior detection accuracy on both datasets than previous state-of-the-art methods with competitive efficiency. The code is available at this https URL.

Comments:	Accepted by NeurIPS 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2310.20234 [cs.CV]
	(or arXiv:2310.20234v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2310.20234

Submission history

From: Gang Zhang [view email]
[v1] Tue, 31 Oct 2023 07:32:08 UTC (755 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators