Patch-NetVLAD+: Learned patch descriptor and weighted matching strategy for place recognition

Y Cai, J Zhao, J Cui, F Zhang, T Feng… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Y Cai, J Zhao, J Cui, F Zhang, T Feng, C Ye
2022 IEEE International Conference on Multisensor Fusion and …, 2022ieeexplore.ieee.org
Visual Place Recognition (VPR) in areas with similar scenes such as urban or indoor
scenarios is a major challenge. Existing VPR methods using global descriptors have
difficulty capturing local specific region (LSR) in the scene and are therefore prone to
localization confusion in such scenarios. As a result, finding the LSRs that are critical for
location recognition becomes key. To address this challenge, we introduced Patch-
NetVLAD+, which was inspired by patch-based VPR researches. Our method proposed a …
Visual Place Recognition (VPR) in areas with similar scenes such as urban or indoor scenarios is a major challenge. Existing VPR methods using global descriptors have difficulty capturing local specific region (LSR) in the scene and are therefore prone to localization confusion in such scenarios. As a result, finding the LSRs that are critical for location recognition becomes key. To address this challenge, we introduced Patch-NetVLAD+, which was inspired by patch-based VPR researches. Our method proposed a fine-tuning strategy with triplet loss to make NetVLAD suitable for extracting patch-level descriptors. Moreover, unlike existing methods that treat all patches in an image equally, our method extracts patches of LSR, which present less frequently throughout the dataset, and makes them play an important role in VPR by assigning proper weights to them. Experiments on Pittsburgh30k and Tokyo247 datasets show that our approach achieved up to 9.3% performance improvement than existing patch-based methods.
ieeexplore.ieee.org