Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3474085.3475605acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Two-pronged Strategy: Lightweight Augmented Graph Network Hashing for Scalable Image Retrieval

Published: 17 October 2021 Publication History

Abstract

Hashing learns compact binary codes to store and retrieve massive data efficiently. Particularly, unsupervised deep hashing is supported by powerful deep neural networks and has the desirable advantage of label independence. It is a promising technique for scalable image retrieval. However, deep models introduce a large number of parameters, which is hard to optimize due to the lack of explicit semantic labels and brings considerable training cost. As a result, the retrieval accuracy and training efficiency of existing unsupervised deep hashing are still limited. To tackle the problems, in this paper, we propose a simple and efficient Lightweight Augmented Graph Network Hashing (LAGNH) method with a two-pronged strategy. For one thing, we extract the inner structure of the image as the auxiliary semantics to enhance the semantic supervision of the unsupervised hash learning process. For another, we design a lightweight network structure with the assistance of the auxiliary semantics, which greatly reduces the number of network parameters that needs to be optimized and thus greatly accelerates the training process. Specifically, we design a cross-modal attention module based on the auxiliary semantic information to adaptively mitigate the adverse effects in the deep image features. Besides, the hash codes are learned by multi-layer message passing within an adversarial regularized graph convolutional network. Simultaneously, the semantic representation capability of hash codes is further enhanced by reconstructing the similarity graph. Experimental results show that our method achieves significant performance improvement compared with the state-of-the-art unsupervised deep hashing methods in terms of both retrieval accuracy and efficiency. Notably, on MS-COCO dataset, our method achieves more than 10% improvement on retrieval precision and 2.7x speedup on training time compared with the second best result.

References

[1]
Yue Cao, Mingsheng Long, Bin Liu, and Jianmin Wang. 2018. Deep Cauchy Hashing for Hamming Space Retrieval. In CVPR. 1229--1237.
[2]
TatSeng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: A Real-world Web Image Database from National University of Singapore. In CIVR. 48:1--48:9.
[3]
Hui Cui, Lei Zhu, Jingjing Li, Yang Yang, and Liqiang Nie. 2020. Scalable Deep Hashing for Large-Scale Social Image Retrieval. TIP, Vol. 29 (2020), 1271--1284.
[4]
Jia Deng, Wei Dong, Richard Socher, LiJia Li, Kai Li, and FeiFei Li. 2009. Imagenet: A large-scale hierarchical image database. In CVPR. 248--255.
[5]
ThanhToan Do, AnhDzung Doan, and NgaiMan Cheung. 2016. Learning to hash with binary deep neural network. In ECCV. 219--234.
[6]
Aristides Gionis, Piotr Indyk, and Rajeev Motwani. 1999. Similarity Search in High Dimensions via Hashing. In VLDB. 518--529.
[7]
Yunchao Gong, Svetlana Lazebnik, Albert Gordo, and Florent Perronnin. 2013. Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval. TPAMI, Vol. 35, 12 (2013), 2916--2929.
[8]
Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron C. Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In NeurIPS. 2672--2680.
[9]
Qinghao Hu, Jiaxiang Wu, Jian Cheng, Lifang Wu, and Hanqing Lu. 2017. Pseudo Label based Unsupervised Deep Discriminative Hashing for Image Retrieval. In MM . 1584--1590.
[10]
Qingyuan Jiang and Wujun Li. 2015. Scalable Graph Hashing with Feature Transformation. In IJCAI. 2248--2254.
[11]
Qing-Yuan Jiang and Wu-Jun Li. 2017. Deep Cross-Modal Hashing. In CVPR. 3270--3278.
[12]
Qing-Yuan Jiang and Wu-Jun Li. 2018. Asymmetric Deep Supervised Hashing. In AAAI. 3342--3349.
[13]
Lu Jin, Zechao Li, Yonghua Pan, and Jinhui Tang. 2020. Weakly-Supervised Image Hashing through Masked Visual-Semantic Graph-based Reasoning. In MM. 916--924.
[14]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In ICLR.
[15]
Thomas N. Kipf and Max Welling. 2017a. Semi-Supervised Classification with Graph Convolutional Networks. In ICLR.
[16]
Thomas N. Kipf and Max Welling. 2017b. Variational Graph Auto-Encoders. In NeurIPS Bayesian Deep Learning Workshop.
[17]
Hanjiang Lai, Yan Pan, Ye Liu, and Shuicheng Yan. 2015. Simultaneous feature learning and hash coding with deep neural networks. In Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR). 3270--3278.
[18]
WuJun Li, Sheng Wang, and WangCheng Kang. 2016. Feature learning based deep supervised hashing with pairwise labels. In IJCAI. 1711--1717.
[19]
Guosheng Lin, Chunhua Shen, and Anton van den Hengel. 2015. Supervised Hashing Using Graph Cuts and Boosted Decision Trees. TPAMI, Vol. 37, 11 (2015), 2317--2331.
[20]
Kevin Lin, Jiwen Lu, ChuSong Chen, and Jie Zhou. 2016. Learning compact binary descriptors with unsupervised deep neural networks. In CVPR. 1183--1192.
[21]
Tsung-Yi Lin, Michael Maire, Serge J. Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollá r, and C. Lawrence Zitnick. 2014. Microsoft COCO: Common Objects in Context. In ECCV, Vol. 8693. 740--755.
[22]
Fan Liu, Zhiyong Cheng, Lei Zhu, Zan Gao, and Liqiang Nie. 2021 b. Interest-aware Message-Passing GCN for Recommendation. In WWW. 1296--1305.
[23]
Wei Liu, Jun Wang, Rongrong Ji, YuGang Jiang, and ShihFu Chang. 2012. Supervised hashing with kernels. In CVPR. 2074--2081.
[24]
Wei Liu, Jun Wang, Sanjiv Kumar, and ShihFu Chang. 2011. Hashing with graphs. In ICML. 1--8.
[25]
Zhenguang Liu, Haoming Chen, Runyang Feng, Shuang Wu, Shouling Ji, Bailin Yang, and Xun Wang. 2021 a. Deep Dual Consecutive Network for Human Pose Estimation. In CVPR. 525--534.
[26]
Zhenguang Liu, Peng Qian, Xiaoyang Wang, Yuan Zhuang, Lin Qiu, and Xun Wang. 2021 c. Combining Graph Neural Networks with Expert Knowledge for Smart Contract Vulnerability Detection. TKDE (2021). https://doi.org/10.1109/TKDE.2021.3095196
[27]
Zhenguang Liu, Shuang Wu, Shuyuan Jin, Qi Liu, Shijian Lu, Roger Zimmermann, and Li Cheng. 2019. Towards Natural and Accurate Future Motion Prediction of Humans and Animals. In CVPR. 10004--10012.
[28]
Jiwen Lu, Venice Erin Liong, and Jie Zhou. 2017a. Deep Hashing for Scalable Image Search. TIP, Vol. 26, 5 (2017), 2352--2367.
[29]
Xiaoqiang Lu, Xiangtao Zheng, and Xuelong Li. 2017b. Latent Semantic Minimal Hashing for Image Retrieval. TIP, Vol. 26, 1 (2017), 355--368.
[30]
Xu Lu, Lei Zhu, Zhiyong Cheng, Jingjing Li, Xiushan Nie, and Huaxiang Zhang. 2019. Flexible Online Multi-modal Hashing for Large-scale Multimedia Retrieval. In MM. 1129--1137.
[31]
Shirui Pan, Ruiqi Hu, Guodong Long, Jing Jiang, Lina Yao, and Chengqi Zhang. 2018. Adversarially Regularized Graph Autoencoder for Graph Embedding. In IJCAI. 2609--2615.
[32]
Maxim Raginsky and Svetlana Lazebnik. 2009. Locality-sensitive binary codes from shift-invariant kernels. In NeurIPS. 1509--1517.
[33]
Shaoqing Ren, Kaiming He, Ross B. Girshick, and Jian Sun. 2017. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. TPAMI, Vol. 39, 6 (2017), 1137--1149.
[34]
Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen. 2015. Supervised Discrete Hashing. In CVPR. 37--45.
[35]
Fumin Shen, Yan Xu, Li Liu, Yang Yang, Zi Huang, and Heng Tao Shen. 2018. Unsupervised deep hashing with similarity-adaptive and discrete optimization. TPAMI, Vol. 40, 12 (2018), 3034--3044.
[36]
Yuming Shen, Jie Qin, Jiaxin Chen, Mengyang Yu, Li Liu, Fan Zhu, Fumin Shen, and Ling Shao. 2020. Auto-Encoding Twin-Bottleneck Hashing. In CVPR. 2815--2824.
[37]
Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In ICLR.
[38]
Jingkuan Song, Tao He, Lianli Gao, Xing Xu, Alan Hanjalic, and Heng Tao Shen. 2018. Binary Generative Adversarial Networks for Image Retrieval. In AAAI. 394--401.
[39]
Shupeng Su, Chao Zhang, Kai Han, and Yonghong Tian. 2018. Greedy Hash: Towards Fast Optimization for Accurate Hash Coding in CNN. In Proc. Adv. Neural Inf. Process. Syst. (NeurIPS). 806--815.
[40]
Rong-Cheng Tu, Xianling Mao, and Wei Wei. 2020. MLS3RDUH: Deep Unsupervised Hashing via Manifold based Local Semantic Similarity Structure Reconstructing. In IJCAI. 3466--3472.
[41]
Jun Wang, Sanjiv Kumar, and ShihFu Chang. 2012. Semi-Supervised Hashing for Large-Scale Search. TPAMI, Vol. 34, 12 (2012), 2393--2406.
[42]
Jun Wang, Wei Liu, Sanjiv Kumar, and ShihFu Chang. 2016. Learning to Hash for Indexing Big Data - A Survey. PIEEE, Vol. 104, 1 (2016), 34--57.
[43]
Jingdong Wang, Ting Zhang, Jingkuan Song, Nicu Sebe, and Heng Tao Shen. 2018. A Survey on Learning to Hash. TPAMI, Vol. 40, 4 (2018), 769--790.
[44]
Yair Weiss, Antonio Torralba, and Robert Fergus. 2008. Spectral Hashing. In NeurIPS. 1753--1760.
[45]
Rongkai Xia, Yan Pan, Hanjiang Lai, Cong Liu, and Shuicheng Yan. 2014. Supervised hashing for image retrieval via image representation learning. In AAAI. 2156--2162.
[46]
Erkun Yang, Cheng Deng, Tongliang Liu, Wei Liu, and Dacheng Tao. 2018. Semantic Structure-based Unsupervised Deep Hashing. In IJCAI. 1064--1070.
[47]
Erkun Yang, Tongliang Liu, Cheng Deng, Wei Liu, and Dacheng Tao. 2019. DistillHash: Unsupervised Deep Hashing by Distilling Data Pairs. In CVPR. 2946--2955.
[48]
Xun Yang, Jianfeng Dong, Yixin Cao, Xun Wang, Meng Wang, and Tat-Seng Chua. 2020 a. Tree-Augmented Cross-Modal Encoding for Complex-Query Video Retrieval. In SIGIR. 1339--1348.
[49]
Xun Yang, Fuli Feng, Wei Ji, Meng Wang, and Tat-Seng Chua. 2021. Deconfounded Video Moment Retrieval with Causal Intervention. In SIGIR. 1--10.
[50]
Xun Yang, Xueliang Liu, Meng Jian, Xinjian Gao, and Meng Wang. 2020 b. Weakly-Supervised Video Object Grounding by Exploring Spatio-Temporal Contexts. In MM . 1939--1947.
[51]
Peichao Zhang, Wei Zhang, WuJun Li, and Minyi Guo. 2014. Supervised hashing with latent factor models. In SIGIR. 173--182.
[52]
Wanqian Zhang, Dayan Wu, Yu Zhou, Bo Li, Weiping Wang, and Dan Meng. 2020 b. Deep Unsupervised Hybrid-similarity Hadamard Hashing. In MM. 3274--3282.
[53]
Zheng Zhang, Zhihui Lai, Zi Huang, Wai Keung Wong, Guo-Sen Xie, Li Liu, and Ling Shao. 2019 a. Scalable Supervised Asymmetric Hashing With Semantic and Latent Factor Embedding. TIP, Vol. 28, 10 (2019), 4803--4818.
[54]
Zheng Zhang, Luyao Liu, Yadan Luo, Zi Huang, Fumin Shen, Heng Tao Shen, and Guangming Lu. 2020 a. Inductive Structure Consistent Hashing via Flexible Semantic Calibration. TNNLS (2020). https://doi.org/10.1109/TNNLS.2020.3018790
[55]
Zheng Zhang, Li Liu, Fumin Shen, Heng Tao Shen, and Ling Shao. 2019 b. Binary Multi-View Clustering. TPAMI, Vol. 41, 7 (2019), 1774--1782.
[56]
Chaoqun Zheng, Lei Zhu, Xu Lu, Jingjing Li, Zhiyong Cheng, and Hanwang Zhang. 2020. Fast Discrete Collaborative Multi-Modal Hashing for Large-Scale Multimedia Retrieval. TKDE, Vol. 32, 11 (2020), 2171--2184.
[57]
Lei Zhu, Hui Cui, Zhiyong Cheng, Jingjing Li, and Zheng Zhang. 2021. Dual-level Semantic Transfer Deep Hashing for Efficient Social Image Retrieval. TCSVT, Vol. 31, 4 (2021), 1478--1489.
[58]
Lei Zhu, Xu Lu, Zhiyong Cheng, Jingjing Li, and Huaxiang Zhang. 2020 a. Deep Collaborative Multi-View Hashing for Large-Scale Image Search. TIP, Vol. 29 (2020), 4643--4655.
[59]
Lei Zhu, Xu Lu, Zhiyong Cheng, Jingjing Li, and Huaxiang Zhang. 2020 b. Flexible Multi-modal Hashing for Scalable Multimedia Retrieval. TIST, Vol. 11, 2 (2020), 14:1--14:20.

Cited By

View all
  • (2024)Online Cross-modal Hashing With Dynamic PrototypeACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366524920:8(1-18)Online publication date: 13-Jun-2024
  • (2024)Semantic-Enhanced Proxy-Guided Hashing for Long-Tailed Image RetrievalIEEE Transactions on Multimedia10.1109/TMM.2024.339468426(9499-9514)Online publication date: 2024
  • (2024)Hugs Bring Double Benefits: Unsupervised Cross-Modal Hashing with Multi-granularity Aligned TransformersInternational Journal of Computer Vision10.1007/s11263-024-02009-7132:8(2765-2797)Online publication date: 18-Feb-2024
  • Show More Cited By

Index Terms

  1. Two-pronged Strategy: Lightweight Augmented Graph Network Hashing for Scalable Image Retrieval

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '21: Proceedings of the 29th ACM International Conference on Multimedia
    October 2021
    5796 pages
    ISBN:9781450386517
    DOI:10.1145/3474085
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 October 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. attention mechanism
    2. graph neural networks
    3. image retrieval
    4. similarity preservation
    5. unsupervised deep hashing

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    MM '21
    Sponsor:
    MM '21: ACM Multimedia Conference
    October 20 - 24, 2021
    Virtual Event, China

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)35
    • Downloads (Last 6 weeks)8
    Reflects downloads up to 22 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Online Cross-modal Hashing With Dynamic PrototypeACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366524920:8(1-18)Online publication date: 13-Jun-2024
    • (2024)Semantic-Enhanced Proxy-Guided Hashing for Long-Tailed Image RetrievalIEEE Transactions on Multimedia10.1109/TMM.2024.339468426(9499-9514)Online publication date: 2024
    • (2024)Hugs Bring Double Benefits: Unsupervised Cross-Modal Hashing with Multi-granularity Aligned TransformersInternational Journal of Computer Vision10.1007/s11263-024-02009-7132:8(2765-2797)Online publication date: 18-Feb-2024
    • (2023)CLIP-Based Adaptive Graph Attention Network for Large-Scale Unsupervised Multi-Modal Hashing RetrievalSensors10.3390/s2307343923:7(3439)Online publication date: 24-Mar-2023
    • (2023)Neural Image Popularity Assessment with Retrieval-augmented TransformerProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3611918(2427-2436)Online publication date: 26-Oct-2023
    • (2023)Stepwise Refinement Short Hashing for Image RetrievalProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3611864(6501-6509)Online publication date: 26-Oct-2023
    • (2023)HHF: Hashing-Guided Hinge Function for Deep Hashing RetrievalIEEE Transactions on Multimedia10.1109/TMM.2022.322259825(7428-7440)Online publication date: 1-Jan-2023
    • (2023)Multi-Modal Hashing for Efficient Multimedia Retrieval: A SurveyIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.328292136:1(239-260)Online publication date: 5-Jun-2023
    • (2022)BiasHash: A Bayesian Hashing Framework for Image Retrieval2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP)10.1109/IVMSP54334.2022.9816233(1-5)Online publication date: 26-Jun-2022
    • (2022)HAPGNNInformation Sciences: an International Journal10.1016/j.ins.2022.09.041613:C(435-452)Online publication date: 1-Oct-2022

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media