MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery

Khaliq, Ahmad; Milford, Michael; Garg, Sourav

doi:10.1109/LRA.2022.3147257

Computer Science > Computer Vision and Pattern Recognition

arXiv:2202.09146 (cs)

[Submitted on 18 Feb 2022]

Title:MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery

Authors:Ahmad Khaliq, Michael Milford, Sourav Garg

View PDF

Abstract:Visual Place Recognition (VPR) is a crucial component of 6-DoF localization, visual SLAM and structure-from-motion pipelines, tasked to generate an initial list of place match hypotheses by matching global place descriptors. However, commonly-used CNN-based methods either process multiple image resolutions after training or use a single resolution and limit multi-scale feature extraction to the last convolutional layer during training. In this paper, we augment NetVLAD representation learning with low-resolution image pyramid encoding which leads to richer place representations. The resultant multi-resolution feature pyramid can be conveniently aggregated through VLAD into a single compact representation, avoiding the need for concatenation or summation of multiple patches in recent multi-scale approaches. Furthermore, we show that the underlying learnt feature tensor can be combined with existing multi-scale approaches to improve their baseline performance. Evaluation on 15 viewpoint-varying and viewpoint-consistent benchmarking datasets confirm that the proposed MultiRes-NetVLAD leads to state-of-the-art Recall@N performance for global descriptor based retrieval, compared against 11 existing techniques. Source code is publicly available at this https URL.

Comments:	12 pages, 6 Figures, Accepted for publication in IEEE RA-L 2022 and ICRA 2022, includes supplementary material
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Robotics (cs.RO)
Cite as:	arXiv:2202.09146 [cs.CV]
	(or arXiv:2202.09146v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2202.09146
Journal reference:	IEEE Robotics and Automation Letters vol. 7 no. 2 (April 2022) pp. 3882-3889
Related DOI:	https://doi.org/10.1109/LRA.2022.3147257

Submission history

From: Ahmad Khaliq [view email]
[v1] Fri, 18 Feb 2022 11:53:01 UTC (1,738 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators