Sparse Cross-scale Attention Network for Efficient LiDAR Panoptic Segmentation

Xu, Shuangjie; Wan, Rui; Ye, Maosheng; Zou, Xiaoyi; Cao, Tongyi

Computer Science > Computer Vision and Pattern Recognition

arXiv:2201.05972 (cs)

[Submitted on 16 Jan 2022]

Title:Sparse Cross-scale Attention Network for Efficient LiDAR Panoptic Segmentation

Authors:Shuangjie Xu, Rui Wan, Maosheng Ye, Xiaoyi Zou, Tongyi Cao

View PDF

Abstract:Two major challenges of 3D LiDAR Panoptic Segmentation (PS) are that point clouds of an object are surface-aggregated and thus hard to model the long-range dependency especially for large instances, and that objects are too close to separate each other. Recent literature addresses these problems by time-consuming grouping processes such as dual-clustering, mean-shift offsets, etc., or by bird-eye-view (BEV) dense centroid representation that downplays geometry. However, the long-range geometry relationship has not been sufficiently modeled by local feature learning from the above methods. To this end, we present SCAN, a novel sparse cross-scale attention network to first align multi-scale sparse features with global voxel-encoded attention to capture the long-range relationship of instance context, which can boost the regression accuracy of the over-segmented large objects. For the surface-aggregated points, SCAN adopts a novel sparse class-agnostic representation of instance centroids, which can not only maintain the sparsity of aligned features to solve the under-segmentation on small objects, but also reduce the computation amount of the network through sparse convolution. Our method outperforms previous methods by a large margin in the SemanticKITTI dataset for the challenging 3D PS task, achieving 1st place with a real-time inference speed.

Comments:	Accepted by the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2201.05972 [cs.CV]
	(or arXiv:2201.05972v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2201.05972

Submission history

From: Shuangjie Xu [view email]
[v1] Sun, 16 Jan 2022 05:34:54 UTC (9,798 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Sparse Cross-scale Attention Network for Efficient LiDAR Panoptic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Sparse Cross-scale Attention Network for Efficient LiDAR Panoptic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators