Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2832747.2832791guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Semantic topic multimodal hashing for cross-media retrieval

Published: 25 July 2015 Publication History

Abstract

Multimodal hashing is essential to cross-media similarity search for its low storage cost and fast query speed. Most existing multimodal hashing methods embedded heterogeneous data into a common low-dimensional Hamming space, and then rounded the continuous embeddings to obtain the binary codes. Yet they usually neglect the inherent discrete nature of hashing for relaxing the discrete constraints, which will cause degraded retrieval performance especially for long codes. For this purpose, a novel Semantic Topic Multimodal Hashing (STMH) is developed by considering latent semantic information in coding procedure. It first discovers clustering patterns of texts and robust factorizes the matrix of images to obtain multiple semantic topics of texts and concepts of images. Then the learned multimodal semantic features are transformed into a common subspace by their correlations. Finally, each bit of unified hash code can be generated directly by figuring out whether a topic or concept is contained in a text or an image. Therefore, the obtained model by STMH is more suitable for hashing scheme as it directly learns discrete hash codes in the coding process. Experimental results demonstrate that the proposed method outperforms several state-of-the-art methods.

References

[1]
Punam Bedi, Harmeet Kaur, and Sudeep Marwaha. Trust based recommender system for semantic web. In Proceedings of the 20th International Joint Conference on Artificial Intelligence, pages 2677-2682, Hyderabad, India, January 2007. AAAI Press.
[2]
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993-1022, 2003.
[3]
David M. Blei. Probabilistic topic models. Communications of the ACM, 55(4):77-84, 2012.
[4]
Michael M. Bronstein, Alexander M. Bronstein, Fabrice Michel, and Nikos Paragios. Data fusion through cross-modality metric learning using simi-larity-sensitive hashing. In Proceedings of the 23rd IEEE International Conference on Computer Vision and Pattern Recognition, pages 3594-3601, San Francisco, CA, USA, June 2010. IEEE Computer Society.
[5]
Deng Cai, Xiaofei He, Jiawei Han, and T. S. Huang. Graph regularized nonnegative matrix factoriza-tion for data representation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(8):1548-1560, 2011.
[6]
Chris Ding, Ding Zhou, Xiaofeng He, and Hongyuan Zha. R1-PCA: rotational invariant L1- norm principal component analysis for robust subspace fac-torization. In Proceedings of the 23rd International Conference on Machine Learning, pages 281-288, Pittsburgh, Pennsylvania, USA, June 2006. ACM.
[7]
Guiguang Ding, Yuchen Guo, and Jile Zhou. Collective matrix factorization hashing for multimodal data. In Proceedings of the 27th IEEE International Conference on Computer Vision and Pattern Recognition, pages 2083-2090, Columbus, OH, USA, June 2014. IEEE Computer Society.
[8]
Deguang Kong, Chris Ding, and Heng Huanga. Robust nonnegative matrix factorization using L2,1-norm. In Proceedings of the 20th ACM International Conference on Information and knowledge management, pages 673-682, Glasgow, United Kingdom, October 2011. ACM.
[9]
Shaishav Kumar and Raghavendra Udupa. Learning hash functions for cross-view similarity search. In Proceedings of the 22nd International Joint Conference on Artificial Intelligence, pages 1360-1367, Barcelona, Catalonia, Spain, July 2011. AAAI Press.
[10]
Daniel D. Lee and Sebastian H. Seung. Learning the parts of objects by non-negative matrix factorization. Nature, 401(6755):788-791, 1999.
[11]
Jingkuan Song, Yang Yang, Yi Yang, Zi Huang, and Hengtao Shen. Inter-media hashing for large-scale retrieval from heterogeneous data sources. In Proceedings of the 19th International Conference on Management of Data, pages 785-796, Ahmedabad, India, December 2013. Computer Society of India.
[12]
Yair Weiss, Antonio Torralba, and Rob Fergus. Spectral hashing. In Proceedings of the 23rd Annual Conference on Neural Information Processing Systems, pages 1753-1760, Vancouver, British Columbia, Canada, December 2009. Curran Associates.
[13]
Jason Weston, Samy Bengio, and Ni colas Usunier. Wsabie: scaling up to large vocabulary image annotation. In Proceedings of the 22nd Interna-tional Joint Conference on Artificial Intelligence, pages 2764-2770, Barcelona, Catalonia, Spain, July 2011. AAAI Press.
[14]
Deming Zhai, Hong Chang, Yi Zhen, Xianming Liu, Xilin Chen, and Wen Gao. Parametric local multimodal hashing for cross-view similarity search. In Proceedings of the 23rd International Joint Conference on Artificial Intelligence, pages 2754-2760, Beijing, China, August 2013. AAAI Press.
[15]
Tianyi Zhou and Dacheng Tao. Godec: randomized low-rank & sparse matrix decomposition in noisy case. In Proceedings of the 28th International Conference on Machine Learning, pages 33-40, Bellevue, Washington, USA, June 2011. ACM.
[16]
Jile Zhou, Guiguang Ding, and Yuhen Guo. Latent semantic sparse hashing for cross-modal similarity search. In Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 415-424, Gold Coast, QLD, Australia, July 2014. ACM.
[17]
X. Zhu, Z. Huang, H. Shen, Z. Huang, and X. Zhao. Linear cross-modal hashing for efficient multimedia search. In Proceedings of the 21st ACM Multimedia Conference, pages 143-152, Barcelona, Spain, October 2013. ACM.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
IJCAI'15: Proceedings of the 24th International Conference on Artificial Intelligence
July 2015
4429 pages
ISBN:9781577357384

Sponsors

  • The International Joint Conferences on Artificial Intelligence, Inc. (IJCAI)

Publisher

AAAI Press

Publication History

Published: 25 July 2015

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 26 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Deep Cross-Modal Hashing Based on Semantic Consistent RankingIEEE Transactions on Multimedia10.1109/TMM.2023.325419925(9530-9542)Online publication date: 1-Jan-2023
  • (2023)Unsupervised Cross-Modal Hashing via Semantic Text MiningIEEE Transactions on Multimedia10.1109/TMM.2023.324360825(8946-8957)Online publication date: 1-Jan-2023
  • (2023)A review of emerging research directions in Abstract Visual ReasoningInformation Fusion10.1016/j.inffus.2022.11.01191:C(713-736)Online publication date: 9-Feb-2023
  • (2022)Discrete Fusion Adversarial Hashing for cross-modal retrievalKnowledge-Based Systems10.1016/j.knosys.2022.109503253:COnline publication date: 11-Oct-2022
  • (2022)Adaptive weight multi-channel center similar deep hashingJournal of Visual Communication and Image Representation10.1016/j.jvcir.2022.10364289:COnline publication date: 1-Nov-2022
  • (2022)Biometric recognition based on scalable end-to-end convolutional neural network using photoplethysmographyComputers in Biology and Medicine10.1016/j.compbiomed.2022.105654147:COnline publication date: 1-Aug-2022
  • (2021)Misinformation detection using multitask learning with mutual learning for novelty detection and emotion recognitionInformation Processing and Management: an International Journal10.1016/j.ipm.2021.10263158:5Online publication date: 1-Sep-2021
  • (2021)Pyramid regional graph representation learning for content-based video retrievalInformation Processing and Management: an International Journal10.1016/j.ipm.2020.10248858:3Online publication date: 1-May-2021
  • (2020)Kinetics and Scene Features for Intent DetectionCompanion Publication of the 2020 International Conference on Multimodal Interaction10.1145/3395035.3425641(135-139)Online publication date: 25-Oct-2020
  • (2020)Multi-Pathway Generative Adversarial Hashing for Unsupervised Cross-Modal RetrievalIEEE Transactions on Multimedia10.1109/TMM.2019.292212822:1(174-187)Online publication date: 3-Jan-2020
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media