research-article

Free access

Scalable Supervised Discrete Hashing for Large-Scale Search

Authors:

Xin-Shun XuAuthors Info & Claims

WWW '18: Proceedings of the 2018 World Wide Web Conference

Pages 1603 - 1612

https://doi.org/10.1145/3178876.3186072

Published: 10 April 2018 Publication History

All formats PDF

Abstract

Supervised hashing methods have attracted much attention in these years. However, most existing supervised hashing algorithms have some of the following problems. First, most of them leverage the pairwise similarity matrix, whose size is quadratic to the number of training samples, to supervise the learning of hash codes. Thus, they are not scalable when dealing with large data. Second, most of them relax the discrete constraints for easy optimization and then quantize the learnt real-valued solution to binary hash codes. Therefore, the quantization error caused by the relaxation may lead to a decline of retrieval performance. To address these issues and make the supervised method scalable to large datasets, we present a novel hashing method, named Scalable Supervised Discrete Hashing (SSDH). Specifically, based on a new loss function, SSDH bypasses the direct optimization on the n by n pairwise similarity matrix. In addition, SSDH adopts no relaxation optimization scheme in the learning procedure and avoids the large quantization error problem. Moreover, during learning, it leverages both the pairwise similarity matrix and label matrix; thus, more semantic information can be embedded to the learning of hash codes. Extensive experiments are conducted on six benchmark datasets including two large-scale datasets, i.e., NUS-WIDE and ImageNet. The results show that SSDH can outperform state-of-the-art baselines on these datasets, demonstrating its effectiveness and efficiency.

References

[1]

Alexandr Andoni, Piotr Indyk, Thijs Laarhoven, Ilya Razenshteyn, and Ludwig Schmidt. 2015. Practical and optimal LSH for angular distance. In NIPS. 1225--1233.

Digital Library

[2]

Fatih Cakir and Stan Sclaroff. 2015 a. Adaptive hashing for fast similarity search. In ICCV. 1044--1052.

Digital Library

[3]

Fatih Cakir and Stan Sclaroff. 2015 b. Online supervised hashing. In ICIP. 2606--2610.

[4]

Yuan Cao, Heng Qi, Wenrui Zhou, Jien Kato, Keqiu Li, Xiulong Liu, and Jie Gui. 2018. Binary hashing for approximate nearest neighbor search on big data: A survey. IEEE Access Vol. 6 (2018), 2039--2054.

[5]

Lianhua Chi and Xingquan Zhu. 2017. Hashing Techniques: A survey and taxonomy. CSUR Vol. 50, 1 (2017), 11.

Digital Library

[6]

Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yan-Tao Zheng. 2009. NUS-WIDE: A real-world web image database from National University of Singapore CIVR. 48:1--48:9.

Digital Library

[7]

Mayur Datar, Nicole Immorlica, Piotr Indyk, and Vahab S Mirrokni. 2004. Locality-sensitive hashing scheme based on p-stable distributions SoCG. 253--262.

Digital Library

[8]

Aristides Gionis, Piotr Indyk, Rajeev Motwani, et almbox. 1999. Similarity search in high dimensions via hashing. In VLDB. 518--529.

Digital Library

[9]

Yunchao Gong, Svetlana Lazebnik, Albert Gordo, and Florent Perronnin. 2013. Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. TPAMI Vol. 35, 12 (2013), 2916--2929.

Digital Library

[10]

Albert Gordo, Florent Perronnin, Yunchao Gong, and Svetlana Lazebnik. 2014. Asymmetric distances for binary embeddings. TPAMI Vol. 36, 1 (2014), 33--47.

Digital Library

[11]

Jie Gui, Tongliang Liu, Zhenan Sun, Dacheng Tao, and Tieniu Tan. 2018. Fast supervised discrete hashing. TPAMI Vol. 40, 2 (2018), 490--496.

Digital Library

[12]

Yanbin Hao, Tingting Mu, John Y Goulermas, Jianguo Jiang, Richang Hong, and Meng Wang. 2017. Unsupervised t-distributed video hashing and its deep hashing extension. TIP Vol. 26, 11 (2017), 5531--5544.

Digital Library

[13]

Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics SIGIR. ACM, 355--364.

Digital Library

[14]

Xiangnan He, Hanwang Zhang, Min-Yen Kan, and Tat-Seng Chua. 2016. Fast matrix factorization for online recommendation with implicit feedback SIGIR. 549--558.

Digital Library

[15]

Long-Kai Huang and Sinno Jialin Pan. 2016. Class-wise supervised hashing with label embedding and active bits IJCAI. 1585--1591.

Digital Library

[16]

Mark J. Huiskes and Michael S. Lew. 2008. The MIR flickr retrieval evaluation. In MIR. 39--43.

Digital Library

[17]

Qing-Yuan Jiang and Wu-Jun Li. 2015. Scalable graph hashing with feature transformation IJCAI. 2248--2254.

Digital Library

[18]

Wang-Cheng Kang, Wu-Jun Li, and Zhi-Hua Zhou. 2016. Column sampling based discrete supervised hashing. In AAAI. 1230--1236.

Digital Library

[19]

Alex Krizhevsky and Geoffrey Hinton. 2009. Learning multiple layers of features from tiny images. Technical Report, University of Toronto (2009).

[20]

Brian Kulis and Kristen Grauman. 2009. Kernelized locality-sensitive hashing for scalable image search ICCV. 2130--2137.

[21]

Brian Kulis, Prateek Jain, and Kristen Grauman. 2009. Fast similarity search for learned metrics. TPAMI Vol. 31, 12 (2009), 2143--2157.

Digital Library

[22]

Shaishav Kumar and Raghavendra Udupa. 2011. Learning hash functions for cross-view similarity search IJCAI. 1360--1365.

Digital Library

[23]

Hanjiang Lai, Yan Pan, Ye Liu, and Shuicheng Yan. 2015. Simultaneous feature learning and hash coding with deep neural networks CVPR. 3270--3278.

[24]

Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. P IEEE Vol. 86, 11 (1998), 2278--2324.

[25]

Cong Leng, Jian Cheng, Jiaxiang Wu, Xi Zhang, and Hanqing Lu. 2014. Supervised hashing with soft constraints. In CIKM. 1851--1854.

Digital Library

[26]

Cong Leng, Jiaxiang Wu, Jian Cheng, Xi Zhang, and Hanqing Lu. 2015. Hashing for distributed data. In ICML. 1642--1650.

Digital Library

[27]

Ping Li, Anshumali Shrivastava, Joshua L Moore, and Arnd C König. 2011. Hashing algorithms for large-scale learning. In NIPS. 2672--2680.

Digital Library

[28]

Wu-Jun Li, Sheng Wang, and Wang-Cheng Kang. 2016. Feature learning based deep supervised hashing with pairwise labels IJCAI. 1711--1717.

Digital Library

[29]

Guosheng Lin, Chunhua Shen, Qinfeng Shi, Anton Van den Hengel, and David Suter. 2014. Fast supervised hashing with decision trees for high-dimensional data CVPR. 1963--1970.

Digital Library

[30]

Guosheng Lin, Chunhua Shen, David Suter, and Anton van den Hengel. 2013. A general two-step approach to learning-based hashing ICCV. 2552--2559.

Digital Library

[31]

Zijia Lin, Guiguang Ding, Mingqing Hu, and Jianmin Wang. 2015. Semantics-preserving hashing for cross-view retrieval CVPR. 3864--3872.

[32]

Qi Liu and Hongtao Lu. 2016. Natural supervised hashing. In IJCAI. 1788--1794.

Digital Library

[33]

Wei Liu, Cun Mu, Sanjiv Kumar, and Shih-Fu Chang. 2014. Discrete graph hashing. In NIPS. 3419--3427.

Digital Library

[34]

Wei Liu, Jun Wang, Rongrong Ji, Yu-Gang Jiang, and Shih-Fu Chang. 2012. Supervised hashing with kernels. In CVPR. 2074--2081.

Digital Library

[35]

Xianglong Liu, Cheng Deng, Bo Lang, Dacheng Tao, and Xuelong Li. 2016. Query-adaptive reciprocal hash tables for nearest neighbor search. TIP Vol. 25, 2 (2016), 907--919.

Digital Library

[36]

Xianglong Liu, Lei Huang, Cheng Deng, Jiwen Lu, and Bo Lang. 2015. Multi-view complementary hash tables for nearest neighbor search ICCV. 1107--1115.

Digital Library

[37]

Mingsheng Long, Yue Cao, Jianmin Wang, and Philip S Yu. 2016. Composite correlation quantization for efficient multimodal retrieval SIGIR. 579--588.

Digital Library

[38]

Liqiang Nie, Meng Wang, Zhengjun Zha, Guangda Li, and Tat-Seng Chua. 2011. Multimedia answering: enriching text QA with media information SIGIR. 695--704.

Digital Library

[39]

Liqiang Nie, Meng Wang, Zheng-Jun Zha, and Tat-Seng Chua. 2012 a. Oracle in image search: A content-based approach to performance prediction. TOIS Vol. 30, 2 (2012), 13.

Digital Library

[40]

Liqiang Nie, Shuicheng Yan, Meng Wang, Richang Hong, and Tat-Seng Chua. 2012 b. Harvesting visual concepts for image search with complex queries MM. 59--68.

Digital Library

[41]

Liqiang Nie, Yi-Liang Zhao, Mohammad Akbari, Jialie Shen, and Tat-Seng Chua. 2015. Bridging the vocabulary gap between health seekers and healthcare knowledge. TKDE Vol. 27, 2 (2015), 396--409.

[42]

Mohammad Norouzi and David M Blei. 2011. Minimal loss hashing for compact binary codes. In ICML. 353--360.

Digital Library

[43]

Aude Oliva and Antonio Torralba. 2001. Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV Vol. 42, 3 (2001), 145--175.

Digital Library

[44]

Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, and Li Fei-Fei. 2015. ImageNet large scale visual recognition challenge. IJCV Vol. 115, 3 (2015), 211--252.

Digital Library

[45]

Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen. 2015. Supervised discrete hashing. In CVPR. 37--45.

[46]

Fumin Shen, Chunhua Shen, Qinfeng Shi, Anton Hengel, and Zhenmin Tang. 2013. Inductive hashing on manifolds. In CVPR. 1562--1569.

Digital Library

[47]

Anshumali Shrivastava and Ping Li. 2014. Asymmetric LSH (ALSH) for sublinear time maximum inner product search (MIPS) NIPS. 2321--2329.

Digital Library

[48]

Jun Wang, Sanjiv Kumar, and Shih-Fu Chang. 2010. Semi-supervised hashing for scalable image retrieval CVPR. 3424--3431.

[49]

Jingdong Wang, Ting Zhang, Jingkuan Song, Nicu Sebe, and Heng Tao Shen. 2018. A survey on learning to hash. TPAMI Vol. 40, 4 (2018), 769--790.

[50]

Meng Wang, Weijie Fu, Shijie Hao, Hengchang Liu, and Xindong Wu. 2017. Learning on big graph: Label inference and regularization with anchor hierarchy. TKDE Vol. 29, 5 (2017), 1101--1114.

Digital Library

[51]

Meng Wang, Weijie Fu, Shijie Hao, Dacheng Tao, and Xindong Wu. 2016. Scalable semi-supervised learning by efficient anchor graph regularization. TKDE Vol. 28, 7 (2016), 1864--1877.

[52]

Yair Weiss, Antonio Torralba, and Rob Fergus. 2009. Spectral hashing. In NIPS. 1753--1760.

Digital Library

[53]

Rongkai Xia, Yan Pan, Hanjiang Lai, Cong Liu, and Shuicheng Yan. 2014. Supervised hashing for image retrieval via image representation learning AAAI. 2156--2162.

Digital Library

[54]

Liang Xie, Jialie Shen, and Lei Zhu. 2016. Online cross-modal hashing for web image retrieval AAAI. 294--300.

Digital Library

[55]

Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen, and Xuelong Li. 2017. Learning discriminative binary codes for large-scale cross-modal retrieval. TIP Vol. 26, 5 (2017), 2494--2507.

Digital Library

[56]

Xun Yang, Meng Wang, Richang Hong, Qi Tian, and Yong Rui. 2017. Enhancing person re-identification in a self-trained subspace. TOMM Vol. 13, 3 (2017), 27:1--27:23.

Digital Library

[57]

Felix Yu, Sanjiv Kumar, Yunchao Gong, and Shih-Fu Chang. 2014. Circulant binary embedding. In ICML. 946--954.

Digital Library

[58]

Dongqing Zhang and Wu-Jun Li. 2014. Large-scale supervised multimodal hashing with semantic correlation maximization AAAI. 2177--2183.

Digital Library

[59]

Hanwang Zhang, Fumin Shen, Wei Liu, Xiangnan He, Huanbo Luan, and Tat-Seng Chua. 2016. Discrete collaborative filtering. In SIGIR. 325--334.

Digital Library

[60]

Peichao Zhang, Wei Zhang, Wu-Jun Li, and Minyi Guo. 2014 b. Supervised hashing with latent factor models. In SIGIR. 173--182.

Digital Library

[61]

Ruimao Zhang, Liang Lin, Rui Zhang, Wangmeng Zuo, and Lei Zhang. 2015. Bit-scalable deep hashing with regularized similarity learning for image retrieval and person re-identification. TIP Vol. 24, 12 (2015), 4766--4779.

Digital Library

[62]

Zhiwei Zhang, Qifan Wang, Lingyun Ruan, and Luo Si. 2014 a. Preference preserving hashing for efficient recommendation SIGIR. 183--192.

Digital Library

[63]

Yi Zhen, Piyush Rai, Hongyuan Zha, and Lawrence Carin. 2015. Cross-modal similarity learning via pairs, preferences, and active supervision AAAI. 3203--3209.

Digital Library

[64]

Lei Zhu, Zi Huang, Xiaobai Liu, Xiangnan He, Jiande Sun, and Xiaofang Zhou. 2017. Discrete multimodal hashing with canonical views for robust mobile landmark search. TMM Vol. 19, 9 (2017), 2066--2079.

Digital Library

Cited By

Sun YDai JRen ZLi QPeng D(2024)Relaxed Energy Preserving Hashing for Image RetrievalIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.335184125:7(7388-7400)Online publication date: Jul-2024
https://doi.org/10.1109/TITS.2024.3351841
Zhang MLi JZheng X(2024)Semantic embedding based online cross-modal hashing methodScientific Reports10.1038/s41598-023-50242-w14:1Online publication date: 6-Jan-2024
https://doi.org/10.1038/s41598-023-50242-w
Zhang ZZhang Z(2024)Scalable Supervised Asymmetric HashingBinary Representation Learning on Visual Images10.1007/978-981-97-2112-2_2(17-50)Online publication date: 7-Mar-2024
https://doi.org/10.1007/978-981-97-2112-2_2
Show More Cited By

Index Terms

Scalable Supervised Discrete Hashing for Large-Scale Search

Recommendations

Fast Scalable Supervised Hashing
SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval

Despite significant progress in supervised hashing, there are three common limitations of existing methods. First, most pioneer methods discretely learn hash codes bit by bit, making the learning procedure rather time-consuming. Second, to reduce the ...
Supervised discrete hashing through similarity learning
Abstract
Supervised hashing has achieved better accuracy than unsupervised hashing in many practical applications owing to its use of semantic label information. However, the mutual relationship between semantic labels is always ignored when leveraging ...
Robust discrete code modeling for supervised hashing

We propose a novel supervised hashing scheme to generate high-quality hash codes and hash functions for facilitating large-scale multimedia applications.We devise an effective binary code modeling approach based on l2,p-norm, which can adaptively induce ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '18: Proceedings of the 2018 World Wide Web Conference

April 2018

2000 pages

ISBN:9781450356398

General Chairs:
Pierre-Antoine Champin
Universitè Claude Bernard Lyon 1, France
,
Fabien Gandon
Inria, Université Côte d'Azur, CNRS, I3S, France
,
Lionel Médini
Université Claude Bernard Lyon 1, France
,
Program Chairs:
Mounia Lalmas
Spotify, UK
,
Panagiotis G. Ipeirotis
New York University, USA

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

IW3C2: International World Wide Web Conference Committee

In-Cooperation

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 10 April 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Natural Science Foundation of Shandong Province
National Natural Science Foundation of China

Conference

WWW '18

Sponsor:

IW3C2

WWW '18: The Web Conference 2018

April 23 - 27, 2018

Lyon, France

Acceptance Rates

WWW '18 Paper Acceptance Rate 170 of 1,155 submissions, 15%;

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

32
Total Citations
View Citations
962
Total Downloads

Downloads (Last 12 months)116
Downloads (Last 6 weeks)13

Reflects downloads up to 11 Aug 2024

Other Metrics

View Author Metrics

Citations

Cited By

Sun YDai JRen ZLi QPeng D(2024)Relaxed Energy Preserving Hashing for Image RetrievalIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.335184125:7(7388-7400)Online publication date: Jul-2024
https://doi.org/10.1109/TITS.2024.3351841
Zhang MLi JZheng X(2024)Semantic embedding based online cross-modal hashing methodScientific Reports10.1038/s41598-023-50242-w14:1Online publication date: 6-Jan-2024
https://doi.org/10.1038/s41598-023-50242-w
Zhang ZZhang Z(2024)Scalable Supervised Asymmetric HashingBinary Representation Learning on Visual Images10.1007/978-981-97-2112-2_2(17-50)Online publication date: 7-Mar-2024
https://doi.org/10.1007/978-981-97-2112-2_2
Wang LMeng ZDong FYang XXi XNie X(2023)Attention-Oriented Deep Multi-Task Hash LearningElectronics10.3390/electronics1205122612:5(1226)Online publication date: 4-Mar-2023
https://doi.org/10.3390/electronics12051226
Zhang CLuo XZhan YZhang PChen ZWang YYang XXu XEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)Self-Distillation Dual-Memory Online Hashing with Hash Centers for Streaming Data RetrievalProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612119(6340-6349)Online publication date: 26-Oct-2023
https://dl.acm.org/doi/10.1145/3581783.3612119
Sun YPeng DDai JRen ZEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)Stepwise Refinement Short Hashing for Image RetrievalProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3611864(6501-6509)Online publication date: 26-Oct-2023
https://dl.acm.org/doi/10.1145/3581783.3611864
Teng SNing CZhang WWu NZeng Y(2023)Fast Asymmetric and Discrete Cross-Modal Hashing With Semantic ConsistencyIEEE Transactions on Computational Social Systems10.1109/TCSS.2022.319570410:2(577-589)Online publication date: Apr-2023
https://doi.org/10.1109/TCSS.2022.3195704
Xu HLai ZKong H(2023)Jointly sparse fast hashing with orthogonal learning for large-scale image retrievalSignal Processing: Image Communication10.1016/j.image.2023.117062119(117062)Online publication date: Nov-2023
https://doi.org/10.1016/j.image.2023.117062
Li BYao DLi Z(2023)RICH: A rapid method for image-text cross-modal hash retrievalDisplays10.1016/j.displa.2023.10248979(102489)Online publication date: Sep-2023
https://doi.org/10.1016/j.displa.2023.102489
Zhang ZZhu LLi YXu Y(2022)Deep Discrete Hashing for Label Distribution LearningIEEE Signal Processing Letters10.1109/LSP.2022.315822929(832-836)Online publication date: 2022
https://doi.org/10.1109/LSP.2022.3158229
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents