MinkUNeXt: Point Cloud-based Large-scale Place Recognition using 3D Sparse Convolutions

Cabrera, J. J.; Santo, A.; Gil, A.; Viegas, C.; Payá, L.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.07593 (cs)

[Submitted on 12 Mar 2024 (v1), last revised 13 Mar 2024 (this version, v2)]

Title:MinkUNeXt: Point Cloud-based Large-scale Place Recognition using 3D Sparse Convolutions

Authors:J.J. Cabrera, A. Santo, A. Gil, C. Viegas, L. Payá

View PDF HTML (experimental)

Abstract:This paper presents MinkUNeXt, an effective and efficient architecture for place-recognition from point clouds entirely based on the new 3D MinkNeXt Block, a residual block composed of 3D sparse convolutions that follows the philosophy established by recent Transformers but purely using simple 3D convolutions. Feature extraction is performed at different scales by a U-Net encoder-decoder network and the feature aggregation of those features into a single descriptor is carried out by a Generalized Mean Pooling (GeM). The proposed architecture demonstrates that it is possible to surpass the current state-of-the-art by only relying on conventional 3D sparse convolutions without making use of more complex and sophisticated proposals such as Transformers, Attention-Layers or Deformable Convolutions. A thorough assessment of the proposal has been carried out using the Oxford RobotCar and the In-house datasets. As a result, MinkUNeXt proves to outperform other methods in the state-of-the-art.

Comments:	This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.07593 [cs.CV]
	(or arXiv:2403.07593v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.07593

Submission history

From: Juan José Cabrera Mora [view email]
[v1] Tue, 12 Mar 2024 12:25:54 UTC (2,333 KB)
[v2] Wed, 13 Mar 2024 09:39:14 UTC (2,333 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MinkUNeXt: Point Cloud-based Large-scale Place Recognition using 3D Sparse Convolutions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MinkUNeXt: Point Cloud-based Large-scale Place Recognition using 3D Sparse Convolutions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators