Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3240508.3240683acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Fast Discrete Cross-modal Hashing With Regressing From Semantic Labels

Published: 15 October 2018 Publication History
  • Get Citation Alerts
  • Abstract

    Hashing has recently received great attention in cross-modal retrieval. Cross-modal retrieval aims at retrieving information across heterogeneous modalities (e.g., texts vs. images). Cross-modal hashing compresses heterogeneous high-dimensional data into compact binary codes with similarity preserving, which provides efficiency and facility in both retrieval and storage. In this study, we propose a novel fast discrete cross-modal hashing (FDCH) method with regressing from semantic labels to take advantage of supervised labels to improve retrieval performance. In contrast to existing methods that learn the projection from hash codes to semantic labels, the proposed FDCH regresses the semantic labels of training examples to the corresponding hash codes with a drift. It not only accelerates the hash learning process, but also helps generate stable hash codes. Furthermore, the drift can adjust the regression and enhance the discriminative capability of hash codes. Especially in the case of training efficiency, FDCH is much faster than existing methods. Comparisons with several state-of-the-art techniques on three benchmark datasets have demonstrated the superiority of FDCH under various cross-modal retrieval scenarios.

    References

    [1]
    Olivier Bousquet and André Elisseeff. 2002. Stability and generalization. Journal of machine learning research, Vol. 2, Mar (2002), 499--526.
    [2]
    Michael M Bronstein, Alexander M Bronstein, Fabrice Michel, and Nikos Paragios. 2010. Data fusion through cross-modality metric learning using similarity-sensitive hashing. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE, 3594--3601.
    [3]
    Yue Cao, Mingsheng Long, Jianmin Wang, and Han Zhu. 2016. Correlation Autoencoder Hashing for Supervised Cross-Modal Search. In ACM SIGMM International Conference on Multimedia Retrieval. 197--204.
    [4]
    Zhiyong Cheng and Jialie Shen. 2016. On Effective Location-Aware Music Recommendation. Acm Transactions on Information Systems, Vol. 34, 2 (2016), 1--32.
    [5]
    Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: a real-world web image database from National University of Singapore. In Proceedings of the ACM international conference on image and video retrieval. ACM, 48.
    [6]
    Guiguang Ding, Yuchen Guo, and Jile Zhou. 2014. Collective matrix factorization hashing for multimodal data. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . 2075--2082.
    [7]
    Howard C. Elman. 1986. A Stability Analysis of Incomplete LU Factorizations. Math. Comp., Vol. 47, 175 (1986), 191--217. http://www.jstor.org/stable/2008089
    [8]
    Jie Gui, Tongliang Liu, Zhenan Sun., Dacheng Tao, and Tieniu Tan. 2017. Fast supervised discrete hashing. IEEE Transactions on Pattern Analysis & Machine Intelligence, Vol. PP, 99 (2017), 1--1.
    [9]
    Arthur E. Hoerl and Robert W. Kennard. 1970. Ridge Regression: Applications to Nonorthogonal Problems. Technometrics (1970), 69--82.
    [10]
    Qing-Yuan Jiang and Wu-Jun Li. 2017. Deep cross-modal hashing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition .
    [11]
    Weizhen Jing, Xiushan Nie, Chaoran Cui, Xiaoming Xi, Gongping Yang, and Yilong Yin. 2018. Global-view hashing: harnessing global relations in near-duplicate video retrieval. World Wide Web-internet & Web Information Systems 3 (2018), 1--19.
    [12]
    Weihao Kong, Wu Jun Li, and Minyi Guo. 2012. Manhattan hashing for large-scale image retrieval. In ACM International Conference on Research and Development in Information Retrieval. 45--54.
    [13]
    Zijia Lin, Guiguang Ding, Mingqing Hu, and Jianmin Wang. 2015. Semantics-preserving hashing for cross-view retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3864--3872.
    [14]
    Sean Moran, Victor Lavrenko, and Miles Osborne. 2013. Variable Bit Quantisation for LSH. In Meeting of the Association for Computational Linguistics. 753--758.
    [15]
    Xiushan Nie, Xiaoyu Li, Yane Chai, Chaoran Cui, Xiaoming Xi, and Yilong Yin. 2018. Robust Image Fingerprinting Based on Feature Point Relationship Mining. IEEE Transactions on Information Forensics & Security, Vol. PP, 99 (2018), 1--1.
    [16]
    Xiushan Nie, Yilong Yin, Jiande Sun, Ju Liu, and Chaoran Cui. 2017. Comprehensive Feature-Based Robust Video Fingerprinting Using Tensor Model. IEEE Transactions on Multimedia, Vol. 19, 4 (2017), 785--796.
    [17]
    Nikhil Rasiwasia, Jose Costa Pereira, Emanuele Coviello, Gabriel Doyle, Gert RG Lanckriet, Roger Levy, and Nuno Vasconcelos. 2010. A new approach to cross-modal multimedia retrieval. In Proceedings of the 18th ACM international conference on Multimedia. ACM, 251--260.
    [18]
    Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen. 2015. Supervised discrete hashing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 37--45.
    [19]
    Fumin Shen, Xiang Zhou, Yang Yang, Jingkuan Song, Heng Tao Shen, and Dacheng Tao. 2016. A fast optimization method for general binary code learning. IEEE Transactions on Image Processing, Vol. 25, 12 (2016), 5610--5621.
    [20]
    Jingkuan Song, Yang Yang, Yi Yang, Zi Huang, and Heng Tao Shen. 2013. Inter-media hashing for large-scale retrieval from heterogeneous data sources. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. ACM, 785--796.
    [21]
    Jun Tang, Ke Wang, and Ling Shao. 2016. Supervised matrix factorization hashing for cross-modal retrieval. IEEE Transactions on Image Processing, Vol. 25, 7 (2016), 3157--3166.
    [22]
    P. Tseng. 2001. Convergence of a Block Coordinate Descent Method for Nondifferentiable Minimization. Journal of Optimization Theory & Applications, Vol. 109, 3 (2001), 475--494.
    [23]
    Bokun Wang, Yang Yang, Xing Xu, Alan Hanjalic, and Heng Tao Shen. 2017. Adversarial Cross-Modal Retrieval. In Proceedings of the 2017 ACM on Multimedia Conference. ACM, 154--162.
    [24]
    Zhe Wang, Ling Yu Duan, Jie Lin, Xiaofang Wang, Tiejun Huang, and Wen Gao. 2015. Hamming compatible quantization for hashing. In International Conference on Artificial Intelligence. 2298--2304.
    [25]
    G. A. Watson. 1992. Characterization of the subdifferential of some matrix norms. Linear Algebra & Its Applications, Vol. 170, 6 (1992), 33--45.
    [26]
    Chaoran Cui Haoling Sun Yilong Yin Xingbo Liu, Xiushan Nie. 2018. Modality-specific Structure Preserving Hashing For Cross-modal Retrieval. In IEEE International Conference on Acoustics, Speech and Signal Processing . 1678--1682.
    [27]
    Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen, and Xuelong Li. 2017. Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval. IEEE Transactions on Image Processing (2017), 2494--2507.
    [28]
    Peter Young, Alice Lai, Micah Hodosh, and Julia Hockenmaier. 2014. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. Transactions of the Association for Computational Linguistics (2014), 67--78.
    [29]
    Dongqing Zhang and Wu-Jun Li. 2014. Large-Scale Supervised Multimodal Hashing with Semantic Correlation Maximization. In the Association for the Advance of Artificial Intelligence. 7.
    [30]
    Yi Zhen and Dit-Yan Yeung. 2012. Co-regularized hashing for multimodal data. In Advances in neural information processing systems. 1376--1384.
    [31]
    Jile Zhou, Guiguang Ding, and Yuchen Guo. 2014. Latent semantic sparse hashing for cross-modal similarity search. In Proceedings of the 37th international ACM SIGIR conference on Research development in information retrieval. ACM, 415--424.

    Cited By

    View all
    • (2024)Hierarchical Consensus Hashing for Cross-Modal RetrievalIEEE Transactions on Multimedia10.1109/TMM.2023.327216926(824-836)Online publication date: 1-Jan-2024
    • (2024)Structures Aware Fine-Grained Contrastive Adversarial Hashing for Cross-Media RetrievalIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.335625836:7(3514-3528)Online publication date: Jul-2024
    • (2024)Online Discriminative Cross-Modal HashingIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.334241834:7(5242-5254)Online publication date: Jul-2024
    • Show More Cited By

    Index Terms

    1. Fast Discrete Cross-modal Hashing With Regressing From Semantic Labels

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      MM '18: Proceedings of the 26th ACM international conference on Multimedia
      October 2018
      2167 pages
      ISBN:9781450356657
      DOI:10.1145/3240508
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 15 October 2018

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. cross-modal retrieval
      2. learning-based hashing
      3. supervised hashing

      Qualifiers

      • Research-article

      Funding Sources

      • National Natural Science Foundation of China

      Conference

      MM '18
      Sponsor:
      MM '18: ACM Multimedia Conference
      October 22 - 26, 2018
      Seoul, Republic of Korea

      Acceptance Rates

      MM '18 Paper Acceptance Rate 209 of 757 submissions, 28%;
      Overall Acceptance Rate 995 of 4,171 submissions, 24%

      Upcoming Conference

      MM '24
      The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)27
      • Downloads (Last 6 weeks)4

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Hierarchical Consensus Hashing for Cross-Modal RetrievalIEEE Transactions on Multimedia10.1109/TMM.2023.327216926(824-836)Online publication date: 1-Jan-2024
      • (2024)Structures Aware Fine-Grained Contrastive Adversarial Hashing for Cross-Media RetrievalIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.335625836:7(3514-3528)Online publication date: Jul-2024
      • (2024)Online Discriminative Cross-Modal HashingIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.334241834:7(5242-5254)Online publication date: Jul-2024
      • (2024)Semi-Supervised Semi-Paired Cross-Modal HashingIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.331238534:7(6517-6529)Online publication date: Jul-2024
      • (2024)Two-stage zero-shot sparse hashing with missing labels for cross-modal retrievalPattern Recognition10.1016/j.patcog.2024.110717155(110717)Online publication date: Nov-2024
      • (2024)Zero-shot discrete hashing with adaptive class correlation for cross-modal retrievalKnowledge-Based Systems10.1016/j.knosys.2024.111820295(111820)Online publication date: Jul-2024
      • (2024)Weighted cross-modal hashing with label enhancementKnowledge-Based Systems10.1016/j.knosys.2024.111657293(111657)Online publication date: Jun-2024
      • (2024)Unpaired robust hashing with noisy labels for zero-shot cross-modal retrievalEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.108197133(108197)Online publication date: Jul-2024
      • (2024)Robust zero-shot discrete hashing with noisy labels for cross-modal retrievalInternational Journal of Machine Learning and Cybernetics10.1007/s13042-024-02131-5Online publication date: 13-Apr-2024
      • (2024)Ordinal-Preserving Latent Graph HashingBinary Representation Learning on Visual Images10.1007/978-981-97-2112-2_5(111-141)Online publication date: 7-Mar-2024
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media