HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval

He, Chao; Wei, Hongxi

Computer Science > Computer Vision and Pattern Recognition

arXiv:2405.07524v2 (cs)

[Submitted on 13 May 2024 (v1), last revised 14 May 2024 (this version, v2)]

Title:HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval

Authors:Chao He, Hongxi Wei

View PDF HTML (experimental)

Abstract:Deep image hashing aims to map input images into simple binary hash codes via deep neural networks and thus enable effective large-scale image retrieval. Recently, hybrid networks that combine convolution and Transformer have achieved superior performance on various computer tasks and have attracted extensive attention from researchers. Nevertheless, the potential benefits of such hybrid networks in image retrieval still need to be verified. To this end, we propose a hybrid convolutional and self-attention deep hashing method known as HybridHash. Specifically, we propose a backbone network with stage-wise architecture in which the block aggregation function is introduced to achieve the effect of local self-attention and reduce the computational complexity. The interaction module has been elaborately designed to promote the communication of information between image blocks and to enhance the visual representations. We have conducted comprehensive experiments on three widely used datasets: CIFAR-10, NUS-WIDE and IMAGENET. The experimental results demonstrate that the method proposed in this paper has superior performance with respect to state-of-the-art deep hashing methods. Source code is available this https URL.

Comments:	Accepted by ICMR 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2405.07524 [cs.CV]
	(or arXiv:2405.07524v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2405.07524

Submission history

From: Chao He Hc [view email]
[v1] Mon, 13 May 2024 07:45:20 UTC (1,093 KB)
[v2] Tue, 14 May 2024 09:09:47 UTC (1,093 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators