ExpPoint-MAE: Better interpretability and performance for self-supervised point cloud transformers

Romanelis, Ioannis; Fotis, Vlassis; Moustakas, Konstantinos; Munteanu, Adrian

doi:10.1109/ACCESS.2024.3388155

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.10798 (cs)

[Submitted on 19 Jun 2023 (v1), last revised 10 Apr 2024 (this version, v3)]

Title:ExpPoint-MAE: Better interpretability and performance for self-supervised point cloud transformers

Authors:Ioannis Romanelis, Vlassis Fotis, Konstantinos Moustakas, Adrian Munteanu

View PDF HTML (experimental)

Abstract:In this paper we delve into the properties of transformers, attained through self-supervision, in the point cloud domain. Specifically, we evaluate the effectiveness of Masked Autoencoding as a pretraining scheme, and explore Momentum Contrast as an alternative. In our study we investigate the impact of data quantity on the learned features, and uncover similarities in the transformer's behavior across domains. Through comprehensive visualiations, we observe that the transformer learns to attend to semantically meaningful regions, indicating that pretraining leads to a better understanding of the underlying geometry. Moreover, we examine the finetuning process and its effect on the learned representations. Based on that, we devise an unfreezing strategy which consistently outperforms our baseline without introducing any other modifications to the model or the training pipeline, and achieve state-of-the-art results in the classification task among transformer models.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2306.10798 [cs.CV]
	(or arXiv:2306.10798v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.10798
Related DOI:	https://doi.org/10.1109/ACCESS.2024.3388155

Submission history

From: Ioannis Romanelis Mr [view email]
[v1] Mon, 19 Jun 2023 09:38:21 UTC (12,186 KB)
[v2] Fri, 23 Jun 2023 17:09:10 UTC (11,947 KB)
[v3] Wed, 10 Apr 2024 11:42:22 UTC (15,765 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ExpPoint-MAE: Better interpretability and performance for self-supervised point cloud transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ExpPoint-MAE: Better interpretability and performance for self-supervised point cloud transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators