research-article

Fast Discrete Cross-modal Hashing With Regressing From Semantic Labels

Authors:

Yilong YinAuthors Info & Claims

MM '18: Proceedings of the 26th ACM international conference on Multimedia

October 2018

Pages 1662 - 1669

https://doi.org/10.1145/3240508.3240683

Published: 15 October 2018 Publication History

Abstract

Hashing has recently received great attention in cross-modal retrieval. Cross-modal retrieval aims at retrieving information across heterogeneous modalities (e.g., texts vs. images). Cross-modal hashing compresses heterogeneous high-dimensional data into compact binary codes with similarity preserving, which provides efficiency and facility in both retrieval and storage. In this study, we propose a novel fast discrete cross-modal hashing (FDCH) method with regressing from semantic labels to take advantage of supervised labels to improve retrieval performance. In contrast to existing methods that learn the projection from hash codes to semantic labels, the proposed FDCH regresses the semantic labels of training examples to the corresponding hash codes with a drift. It not only accelerates the hash learning process, but also helps generate stable hash codes. Furthermore, the drift can adjust the regression and enhance the discriminative capability of hash codes. Especially in the case of training efficiency, FDCH is much faster than existing methods. Comparisons with several state-of-the-art techniques on three benchmark datasets have demonstrated the superiority of FDCH under various cross-modal retrieval scenarios.

References

[1]

Olivier Bousquet and André Elisseeff. 2002. Stability and generalization. Journal of machine learning research, Vol. 2, Mar (2002), 499--526.

Digital Library

[2]

Michael M Bronstein, Alexander M Bronstein, Fabrice Michel, and Nikos Paragios. 2010. Data fusion through cross-modality metric learning using similarity-sensitive hashing. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE, 3594--3601.

[3]

Yue Cao, Mingsheng Long, Jianmin Wang, and Han Zhu. 2016. Correlation Autoencoder Hashing for Supervised Cross-Modal Search. In ACM SIGMM International Conference on Multimedia Retrieval. 197--204.

Digital Library

[4]

Zhiyong Cheng and Jialie Shen. 2016. On Effective Location-Aware Music Recommendation. Acm Transactions on Information Systems, Vol. 34, 2 (2016), 1--32.

Digital Library

[5]

Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: a real-world web image database from National University of Singapore. In Proceedings of the ACM international conference on image and video retrieval. ACM, 48.

Digital Library

[6]

Guiguang Ding, Yuchen Guo, and Jile Zhou. 2014. Collective matrix factorization hashing for multimodal data. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . 2075--2082.

Digital Library

[7]

Howard C. Elman. 1986. A Stability Analysis of Incomplete LU Factorizations. Math. Comp., Vol. 47, 175 (1986), 191--217. http://www.jstor.org/stable/2008089

Digital Library

[8]

Jie Gui, Tongliang Liu, Zhenan Sun., Dacheng Tao, and Tieniu Tan. 2017. Fast supervised discrete hashing. IEEE Transactions on Pattern Analysis & Machine Intelligence, Vol. PP, 99 (2017), 1--1.

Digital Library

[9]

Arthur E. Hoerl and Robert W. Kennard. 1970. Ridge Regression: Applications to Nonorthogonal Problems. Technometrics (1970), 69--82.

[10]

Qing-Yuan Jiang and Wu-Jun Li. 2017. Deep cross-modal hashing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition .

[11]

Weizhen Jing, Xiushan Nie, Chaoran Cui, Xiaoming Xi, Gongping Yang, and Yilong Yin. 2018. Global-view hashing: harnessing global relations in near-duplicate video retrieval. World Wide Web-internet & Web Information Systems 3 (2018), 1--19.

[12]

Weihao Kong, Wu Jun Li, and Minyi Guo. 2012. Manhattan hashing for large-scale image retrieval. In ACM International Conference on Research and Development in Information Retrieval. 45--54.

Digital Library

[13]

Zijia Lin, Guiguang Ding, Mingqing Hu, and Jianmin Wang. 2015. Semantics-preserving hashing for cross-view retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3864--3872.

[14]

Sean Moran, Victor Lavrenko, and Miles Osborne. 2013. Variable Bit Quantisation for LSH. In Meeting of the Association for Computational Linguistics. 753--758.

[15]

Xiushan Nie, Xiaoyu Li, Yane Chai, Chaoran Cui, Xiaoming Xi, and Yilong Yin. 2018. Robust Image Fingerprinting Based on Feature Point Relationship Mining. IEEE Transactions on Information Forensics & Security, Vol. PP, 99 (2018), 1--1.

Digital Library

[16]

Xiushan Nie, Yilong Yin, Jiande Sun, Ju Liu, and Chaoran Cui. 2017. Comprehensive Feature-Based Robust Video Fingerprinting Using Tensor Model. IEEE Transactions on Multimedia, Vol. 19, 4 (2017), 785--796.

Digital Library

[17]

Nikhil Rasiwasia, Jose Costa Pereira, Emanuele Coviello, Gabriel Doyle, Gert RG Lanckriet, Roger Levy, and Nuno Vasconcelos. 2010. A new approach to cross-modal multimedia retrieval. In Proceedings of the 18th ACM international conference on Multimedia. ACM, 251--260.

Digital Library

[18]

Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen. 2015. Supervised discrete hashing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 37--45.

[19]

Fumin Shen, Xiang Zhou, Yang Yang, Jingkuan Song, Heng Tao Shen, and Dacheng Tao. 2016. A fast optimization method for general binary code learning. IEEE Transactions on Image Processing, Vol. 25, 12 (2016), 5610--5621.

Digital Library

[20]

Jingkuan Song, Yang Yang, Yi Yang, Zi Huang, and Heng Tao Shen. 2013. Inter-media hashing for large-scale retrieval from heterogeneous data sources. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. ACM, 785--796.

Digital Library

[21]

Jun Tang, Ke Wang, and Ling Shao. 2016. Supervised matrix factorization hashing for cross-modal retrieval. IEEE Transactions on Image Processing, Vol. 25, 7 (2016), 3157--3166.

Digital Library

[22]

P. Tseng. 2001. Convergence of a Block Coordinate Descent Method for Nondifferentiable Minimization. Journal of Optimization Theory & Applications, Vol. 109, 3 (2001), 475--494.

Digital Library

[23]

Bokun Wang, Yang Yang, Xing Xu, Alan Hanjalic, and Heng Tao Shen. 2017. Adversarial Cross-Modal Retrieval. In Proceedings of the 2017 ACM on Multimedia Conference. ACM, 154--162.

Digital Library

[24]

Zhe Wang, Ling Yu Duan, Jie Lin, Xiaofang Wang, Tiejun Huang, and Wen Gao. 2015. Hamming compatible quantization for hashing. In International Conference on Artificial Intelligence. 2298--2304.

Digital Library

[25]

G. A. Watson. 1992. Characterization of the subdifferential of some matrix norms. Linear Algebra & Its Applications, Vol. 170, 6 (1992), 33--45.

[26]

Chaoran Cui Haoling Sun Yilong Yin Xingbo Liu, Xiushan Nie. 2018. Modality-specific Structure Preserving Hashing For Cross-modal Retrieval. In IEEE International Conference on Acoustics, Speech and Signal Processing . 1678--1682.

[27]

Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen, and Xuelong Li. 2017. Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval. IEEE Transactions on Image Processing (2017), 2494--2507.

Digital Library

[28]

Peter Young, Alice Lai, Micah Hodosh, and Julia Hockenmaier. 2014. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. Transactions of the Association for Computational Linguistics (2014), 67--78.

[29]

Dongqing Zhang and Wu-Jun Li. 2014. Large-Scale Supervised Multimodal Hashing with Semantic Correlation Maximization. In the Association for the Advance of Artificial Intelligence. 7.

Digital Library

[30]

Yi Zhen and Dit-Yan Yeung. 2012. Co-regularized hashing for multimodal data. In Advances in neural information processing systems. 1376--1384.

Digital Library

[31]

Jile Zhou, Guiguang Ding, and Yuchen Guo. 2014. Latent semantic sparse hashing for cross-modal similarity search. In Proceedings of the 37th international ACM SIGIR conference on Research development in information retrieval. ACM, 415--424.

Digital Library

Cited By

Sun YRen ZHu PPeng DWang X(2024)Hierarchical Consensus Hashing for Cross-Modal RetrievalIEEE Transactions on Multimedia10.1109/TMM.2023.327216926(824-836)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TMM.2023.3272169
Liang MLi YYu YCao XXue ZLi ALu K(2024)Structures Aware Fine-Grained Contrastive Adversarial Hashing for Cross-Media RetrievalIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.335625836:7(3514-3528)Online publication date: Jul-2024
https://doi.org/10.1109/TKDE.2024.3356258
Kang XLiu XZhang XNie XYin Y(2024)Online Discriminative Cross-Modal HashingIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.334241834:7(5242-5254)Online publication date: Jul-2024
https://doi.org/10.1109/TCSVT.2023.3342418
Show More Cited By

Index Terms

Fast Discrete Cross-modal Hashing With Regressing From Semantic Labels
1. Information systems
  1. Information systems applications
    1. Multimedia information systems

Recommendations

Supervised Discriminative Discrete Hashing for Cross-Modal Retrieval
Advanced Data Mining and Applications
Abstract
With the growing interest in cross-modal retrieval technology, cross-modal hashing has become a mainstream trend for comparing and searching between different modalities. However, when faced with multi-label information, existing research has ... $^{}$ $^{}$
Read More
Cross-modal hashing with missing labels
Abstract
Hashing-based cross-modal retrieval methods have become increasingly popular due to their advantages in storage and speed. While current methods have demonstrated impressive results, there are still several issues that have not been addressed. ...
Read More
Semi-supervised discrete hashing for efficient cross-modal retrieval
Abstract
Cross-modal hashing has recently gained significant popularity to facilitate multimedia retrieval across different modalities. Since the acquisition of large-scale labeled training data are very labor intensive, most supervised cross-modal hashing ...
Read More

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '18: Proceedings of the 26th ACM international conference on Multimedia

October 2018

2167 pages

ISBN:9781450356657

DOI:10.1145/3240508

General Chairs:
Susanne Boll
University of Oldenburg, Germany
,
Kyoung Mu Lee
Seoul National University, Korea
,
Jiebo Luo
University of Rochester, USA
,
Wenwu Zhu
Tsinghua University, China
,
Program Chairs:
Hyeran Byun
Yonsei University, Korea
,
Chang Wen Chen
State Univ. Of New York at Buffalo, USA
,
Rainer Lienhart
University of Augsburg, Germany
,
Tao Mei
JD AI, China

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China

Conference

MM '18

Sponsor:

SIGMM

MM '18: ACM Multimedia Conference

October 22 - 26, 2018

Seoul, Republic of Korea

Acceptance Rates

MM '18 Paper Acceptance Rate 209 of 757 submissions, 28%;

Overall Acceptance Rate 995 of 4,171 submissions, 24%

Upcoming Conference

MM '24

Sponsor:
sigmm

The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

46
Total Citations
View Citations
551
Total Downloads

Downloads (Last 12 months)27
Downloads (Last 6 weeks)4

Other Metrics

View Author Metrics

Citations

Cited By

Sun YRen ZHu PPeng DWang X(2024)Hierarchical Consensus Hashing for Cross-Modal RetrievalIEEE Transactions on Multimedia10.1109/TMM.2023.327216926(824-836)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TMM.2023.3272169
Liang MLi YYu YCao XXue ZLi ALu K(2024)Structures Aware Fine-Grained Contrastive Adversarial Hashing for Cross-Media RetrievalIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.335625836:7(3514-3528)Online publication date: Jul-2024
https://doi.org/10.1109/TKDE.2024.3356258
Kang XLiu XZhang XNie XYin Y(2024)Online Discriminative Cross-Modal HashingIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.334241834:7(5242-5254)Online publication date: Jul-2024
https://doi.org/10.1109/TCSVT.2023.3342418
Zhang XLiu XNie XKang XYin Y(2024)Semi-Supervised Semi-Paired Cross-Modal HashingIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.331238534:7(6517-6529)Online publication date: Jul-2024
https://doi.org/10.1109/TCSVT.2023.3312385
Yong KShu ZWang HYu Z(2024)Two-stage zero-shot sparse hashing with missing labels for cross-modal retrievalPattern Recognition10.1016/j.patcog.2024.110717155(110717)Online publication date: Nov-2024
https://doi.org/10.1016/j.patcog.2024.110717
Yong KShu ZYu JYu Z(2024)Zero-shot discrete hashing with adaptive class correlation for cross-modal retrievalKnowledge-Based Systems10.1016/j.knosys.2024.111820295(111820)Online publication date: Jul-2024
https://doi.org/10.1016/j.knosys.2024.111820
Wang YDong FWang KNie XChen Z(2024)Weighted cross-modal hashing with label enhancementKnowledge-Based Systems10.1016/j.knosys.2024.111657293(111657)Online publication date: Jun-2024
https://doi.org/10.1016/j.knosys.2024.111657
Yong KShu ZYu Z(2024)Unpaired robust hashing with noisy labels for zero-shot cross-modal retrievalEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.108197133(108197)Online publication date: Jul-2024
https://doi.org/10.1016/j.engappai.2024.108197
Yong KShu ZWang HYu Z(2024)Robust zero-shot discrete hashing with noisy labels for cross-modal retrievalInternational Journal of Machine Learning and Cybernetics10.1007/s13042-024-02131-5Online publication date: 13-Apr-2024
https://doi.org/10.1007/s13042-024-02131-5
Zhang ZZhang Z(2024)Ordinal-Preserving Latent Graph HashingBinary Representation Learning on Visual Images10.1007/978-981-97-2112-2_5(111-141)Online publication date: 7-Mar-2024
https://doi.org/10.1007/978-981-97-2112-2_5
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents