Transformers in 3D Point Clouds: A Survey

Lu, Dening; Xie, Qian; Wei, Mingqiang; Gao, Kyle; Xu, Linlin; Li, Jonathan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2205.07417 (cs)

[Submitted on 16 May 2022 (v1), last revised 21 Sep 2022 (this version, v2)]

Title:Transformers in 3D Point Clouds: A Survey

Authors:Dening Lu, Qian Xie, Mingqiang Wei, Kyle Gao, Linlin Xu, Jonathan Li

View PDF

Abstract:Transformers have been at the heart of the Natural Language Processing (NLP) and Computer Vision (CV) revolutions. The significant success in NLP and CV inspired exploring the use of Transformers in point cloud processing. However, how do Transformers cope with the irregularity and unordered nature of point clouds? How suitable are Transformers for different 3D representations (e.g., point- or voxel-based)? How competent are Transformers for various 3D processing tasks? As of now, there is still no systematic survey of the research on these issues. For the first time, we provided a comprehensive overview of increasingly popular Transformers for 3D point cloud analysis. We start by introducing the theory of the Transformer architecture and reviewing its applications in 2D/3D fields. Then, we present three different taxonomies (i.e., implementation-, data representation-, and task-based), which can classify current Transformer-based methods from multiple perspectives. Furthermore, we present the results of an investigation of the variants and improvements of the self-attention mechanism in 3D. To demonstrate the superiority of Transformers in point cloud analysis, we present comprehensive comparisons of various Transformer-based methods for classification, segmentation, and object detection. Finally, we suggest three potential research directions, providing benefit references for the development of 3D Transformers.

Comments:	20 pages, 5 figures, 4 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2205.07417 [cs.CV]
	(or arXiv:2205.07417v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2205.07417

Submission history

From: Dening Lu [view email]
[v1] Mon, 16 May 2022 01:32:18 UTC (718 KB)
[v2] Wed, 21 Sep 2022 15:10:21 UTC (23,522 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Transformers in 3D Point Clouds: A Survey

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Transformers in 3D Point Clouds: A Survey

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators