research-article

Global Meets Local: Dual Activation Hashing Network for Large-Scale Fine-Grained Image Retrieval

Authors:

Xin Jiang,

Hao Tang,

Zechao LiAuthors Info & Claims

IEEE Transactions on Knowledge and Data Engineering, Volume 36, Issue 11

Pages 6266 - 6279

https://doi.org/10.1109/TKDE.2024.3393512

Published: 01 November 2024 Publication History

Abstract

In the Internet era, the exponential growth of fine-grained image databases poses a considerable challenge for efficient information retrieval. Hashing-based approaches gained traction for their computational and storage efficiency, yet fine-grained hashing retrieval presents unique challenges due to small inter-class and large intra-class variations inherent to fine-grained entities. Thus, traditional hashing algorithms falter in discerning these subtle, yet critical, visual differences and fail to generate compact yet semantically rich hash codes. To address this, we introduce a Dual Activation Hashing Network (<sc>DAHNet</sc>) designed to convert high-dimensional image data into optimized binary codes via an innovative feature activation paradigm. The architecture consists of dual branches specifically tailored for global and local semantic activation, thereby establishing direct correspondences between hash codes and distinguishable object parts through a hierarchical activation pipeline. Specifically, our spatial-oriented semantic activation module modulates dominant visual regions while amplifying the activations of subtle yet semantically rich areas in a controlled manner. Building on these activated visual representations, the proposed inter-region semantic enrichment module further enriches them by unearthing semantically complementary cues. Concurrently, <sc>DAHNet</sc> integrates a channel-oriented semantic activation module that exploits channel-specific correlations to distill contextual cues from spatially-activated visual features, thereby reinforcing robust learning to hash. To maintain the similarity of the original entities, we amalgamate final hash codes from both activation branches, capturing both local textural details and global structural information. Comprehensive evaluations on five fine-grained image retrieval benchmarks demonstrate <sc>DAHNet</sc>'s superior performance over existing state-of-the-art hashing solutions, especially on 12-bit, improving performance by 4%–15% compared to the current best results on the five benchmarks. Moreover, generalization studies validate the efficacy of our dual-activation framework in the domain of content-based fine-grained image retrieval.

Index Terms

Global Meets Local: Dual Activation Hashing Network for Large-Scale Fine-Grained Image Retrieval
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision representations
        Image representations
      2. Computer vision tasks
        Visual content-based indexing and retrieval
2. Information systems
  1. Information retrieval

Index terms have been assigned to the content through auto-classification.

Recommendations

Reward modulation of hippocampal subfield activation during successful associative encoding and retrieval

Emerging evidence suggests that motivation enhances episodic memory formation through interactions between medial-temporal lobe (MTL) structures and dopaminergic midbrain. In addition, recent theories propose that motivation specifically facilitates ...
Activation of inhibition: Diminishing impulsive behavior by direct current stimulation over the inferior frontal gyrus

A common feature of human existence is the ability to reverse decisions after they are made but before they are implemented. This cognitive control process, termed response inhibition, refers to the ability to inhibit an action once initiated and has ...
Executive semantic processing is underpinned by a large-scale neural network: Revealing the contribution of left prefrontal, posterior temporal, and parietal cortex to controlled retrieval and selection using tms

To understand the meanings of words and objects, we need to have knowledge about these items themselves plus executive mechanisms that compute and manipulate semantic information in a task-appropriate way. The neural basis for semantic control remains ...

Comments

Information & Contributors

Information

Published In

cover image IEEE Transactions on Knowledge and Data Engineering

IEEE Transactions on Knowledge and Data Engineering Volume 36, Issue 11

Nov. 2024

1887 pages

Issue’s Table of Contents

Publisher

IEEE Educational Activities Department

United States

Publication History

Published: 01 November 2024

Qualifiers

Research-article

Index Terms

Recommendations

Reward modulation of hippocampal subfield activation during successful associative encoding and retrieval

Activation of inhibition: Diminishing impulsive behavior by direct current stimulation over the inferior frontal gyrus

Executive semantic processing is underpinned by a large-scale neural network: Revealing the contribution of left prefrontal, posterior temporal, and parietal cortex to controlled retrieval and selection using tms

Comments

Published In

Publisher

Publication History

Qualifiers

Other Metrics

Article Metrics

Other Metrics

Abstract

Index Terms

Recommendations

Reward modulation of hippocampal subfield activation during successful associative encoding and retrieval

Activation of inhibition: Diminishing impulsive behavior by direct current stimulation over the inferior frontal gyrus

Executive semantic processing is underpinned by a large-scale neural network: Revealing the contribution of left prefrontal, posterior temporal, and parietal cortex to controlled retrieval and selection using tms

Comments

Information

Published In

Publisher

Publication History

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

Share

Share this Publication link

Share on social media

Affiliations