Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3178876.3186072acmotherconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
research-article
Free access

Scalable Supervised Discrete Hashing for Large-Scale Search

Published: 10 April 2018 Publication History
  • Get Citation Alerts
  • Abstract

    Supervised hashing methods have attracted much attention in these years. However, most existing supervised hashing algorithms have some of the following problems. First, most of them leverage the pairwise similarity matrix, whose size is quadratic to the number of training samples, to supervise the learning of hash codes. Thus, they are not scalable when dealing with large data. Second, most of them relax the discrete constraints for easy optimization and then quantize the learnt real-valued solution to binary hash codes. Therefore, the quantization error caused by the relaxation may lead to a decline of retrieval performance. To address these issues and make the supervised method scalable to large datasets, we present a novel hashing method, named Scalable Supervised Discrete Hashing (SSDH). Specifically, based on a new loss function, SSDH bypasses the direct optimization on the n by n pairwise similarity matrix. In addition, SSDH adopts no relaxation optimization scheme in the learning procedure and avoids the large quantization error problem. Moreover, during learning, it leverages both the pairwise similarity matrix and label matrix; thus, more semantic information can be embedded to the learning of hash codes. Extensive experiments are conducted on six benchmark datasets including two large-scale datasets, i.e., NUS-WIDE and ImageNet. The results show that SSDH can outperform state-of-the-art baselines on these datasets, demonstrating its effectiveness and efficiency.

    References

    [1]
    Alexandr Andoni, Piotr Indyk, Thijs Laarhoven, Ilya Razenshteyn, and Ludwig Schmidt. 2015. Practical and optimal LSH for angular distance. In NIPS. 1225--1233.
    [2]
    Fatih Cakir and Stan Sclaroff. 2015 a. Adaptive hashing for fast similarity search. In ICCV. 1044--1052.
    [3]
    Fatih Cakir and Stan Sclaroff. 2015 b. Online supervised hashing. In ICIP. 2606--2610.
    [4]
    Yuan Cao, Heng Qi, Wenrui Zhou, Jien Kato, Keqiu Li, Xiulong Liu, and Jie Gui. 2018. Binary hashing for approximate nearest neighbor search on big data: A survey. IEEE Access Vol. 6 (2018), 2039--2054.
    [5]
    Lianhua Chi and Xingquan Zhu. 2017. Hashing Techniques: A survey and taxonomy. CSUR Vol. 50, 1 (2017), 11.
    [6]
    Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yan-Tao Zheng. 2009. NUS-WIDE: A real-world web image database from National University of Singapore CIVR. 48:1--48:9.
    [7]
    Mayur Datar, Nicole Immorlica, Piotr Indyk, and Vahab S Mirrokni. 2004. Locality-sensitive hashing scheme based on p-stable distributions SoCG. 253--262.
    [8]
    Aristides Gionis, Piotr Indyk, Rajeev Motwani, et almbox. 1999. Similarity search in high dimensions via hashing. In VLDB. 518--529.
    [9]
    Yunchao Gong, Svetlana Lazebnik, Albert Gordo, and Florent Perronnin. 2013. Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. TPAMI Vol. 35, 12 (2013), 2916--2929.
    [10]
    Albert Gordo, Florent Perronnin, Yunchao Gong, and Svetlana Lazebnik. 2014. Asymmetric distances for binary embeddings. TPAMI Vol. 36, 1 (2014), 33--47.
    [11]
    Jie Gui, Tongliang Liu, Zhenan Sun, Dacheng Tao, and Tieniu Tan. 2018. Fast supervised discrete hashing. TPAMI Vol. 40, 2 (2018), 490--496.
    [12]
    Yanbin Hao, Tingting Mu, John Y Goulermas, Jianguo Jiang, Richang Hong, and Meng Wang. 2017. Unsupervised t-distributed video hashing and its deep hashing extension. TIP Vol. 26, 11 (2017), 5531--5544.
    [13]
    Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics SIGIR. ACM, 355--364.
    [14]
    Xiangnan He, Hanwang Zhang, Min-Yen Kan, and Tat-Seng Chua. 2016. Fast matrix factorization for online recommendation with implicit feedback SIGIR. 549--558.
    [15]
    Long-Kai Huang and Sinno Jialin Pan. 2016. Class-wise supervised hashing with label embedding and active bits IJCAI. 1585--1591.
    [16]
    Mark J. Huiskes and Michael S. Lew. 2008. The MIR flickr retrieval evaluation. In MIR. 39--43.
    [17]
    Qing-Yuan Jiang and Wu-Jun Li. 2015. Scalable graph hashing with feature transformation IJCAI. 2248--2254.
    [18]
    Wang-Cheng Kang, Wu-Jun Li, and Zhi-Hua Zhou. 2016. Column sampling based discrete supervised hashing. In AAAI. 1230--1236.
    [19]
    Alex Krizhevsky and Geoffrey Hinton. 2009. Learning multiple layers of features from tiny images. Technical Report, University of Toronto (2009).
    [20]
    Brian Kulis and Kristen Grauman. 2009. Kernelized locality-sensitive hashing for scalable image search ICCV. 2130--2137.
    [21]
    Brian Kulis, Prateek Jain, and Kristen Grauman. 2009. Fast similarity search for learned metrics. TPAMI Vol. 31, 12 (2009), 2143--2157.
    [22]
    Shaishav Kumar and Raghavendra Udupa. 2011. Learning hash functions for cross-view similarity search IJCAI. 1360--1365.
    [23]
    Hanjiang Lai, Yan Pan, Ye Liu, and Shuicheng Yan. 2015. Simultaneous feature learning and hash coding with deep neural networks CVPR. 3270--3278.
    [24]
    Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. P IEEE Vol. 86, 11 (1998), 2278--2324.
    [25]
    Cong Leng, Jian Cheng, Jiaxiang Wu, Xi Zhang, and Hanqing Lu. 2014. Supervised hashing with soft constraints. In CIKM. 1851--1854.
    [26]
    Cong Leng, Jiaxiang Wu, Jian Cheng, Xi Zhang, and Hanqing Lu. 2015. Hashing for distributed data. In ICML. 1642--1650.
    [27]
    Ping Li, Anshumali Shrivastava, Joshua L Moore, and Arnd C König. 2011. Hashing algorithms for large-scale learning. In NIPS. 2672--2680.
    [28]
    Wu-Jun Li, Sheng Wang, and Wang-Cheng Kang. 2016. Feature learning based deep supervised hashing with pairwise labels IJCAI. 1711--1717.
    [29]
    Guosheng Lin, Chunhua Shen, Qinfeng Shi, Anton Van den Hengel, and David Suter. 2014. Fast supervised hashing with decision trees for high-dimensional data CVPR. 1963--1970.
    [30]
    Guosheng Lin, Chunhua Shen, David Suter, and Anton van den Hengel. 2013. A general two-step approach to learning-based hashing ICCV. 2552--2559.
    [31]
    Zijia Lin, Guiguang Ding, Mingqing Hu, and Jianmin Wang. 2015. Semantics-preserving hashing for cross-view retrieval CVPR. 3864--3872.
    [32]
    Qi Liu and Hongtao Lu. 2016. Natural supervised hashing. In IJCAI. 1788--1794.
    [33]
    Wei Liu, Cun Mu, Sanjiv Kumar, and Shih-Fu Chang. 2014. Discrete graph hashing. In NIPS. 3419--3427.
    [34]
    Wei Liu, Jun Wang, Rongrong Ji, Yu-Gang Jiang, and Shih-Fu Chang. 2012. Supervised hashing with kernels. In CVPR. 2074--2081.
    [35]
    Xianglong Liu, Cheng Deng, Bo Lang, Dacheng Tao, and Xuelong Li. 2016. Query-adaptive reciprocal hash tables for nearest neighbor search. TIP Vol. 25, 2 (2016), 907--919.
    [36]
    Xianglong Liu, Lei Huang, Cheng Deng, Jiwen Lu, and Bo Lang. 2015. Multi-view complementary hash tables for nearest neighbor search ICCV. 1107--1115.
    [37]
    Mingsheng Long, Yue Cao, Jianmin Wang, and Philip S Yu. 2016. Composite correlation quantization for efficient multimodal retrieval SIGIR. 579--588.
    [38]
    Liqiang Nie, Meng Wang, Zhengjun Zha, Guangda Li, and Tat-Seng Chua. 2011. Multimedia answering: enriching text QA with media information SIGIR. 695--704.
    [39]
    Liqiang Nie, Meng Wang, Zheng-Jun Zha, and Tat-Seng Chua. 2012 a. Oracle in image search: A content-based approach to performance prediction. TOIS Vol. 30, 2 (2012), 13.
    [40]
    Liqiang Nie, Shuicheng Yan, Meng Wang, Richang Hong, and Tat-Seng Chua. 2012 b. Harvesting visual concepts for image search with complex queries MM. 59--68.
    [41]
    Liqiang Nie, Yi-Liang Zhao, Mohammad Akbari, Jialie Shen, and Tat-Seng Chua. 2015. Bridging the vocabulary gap between health seekers and healthcare knowledge. TKDE Vol. 27, 2 (2015), 396--409.
    [42]
    Mohammad Norouzi and David M Blei. 2011. Minimal loss hashing for compact binary codes. In ICML. 353--360.
    [43]
    Aude Oliva and Antonio Torralba. 2001. Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV Vol. 42, 3 (2001), 145--175.
    [44]
    Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, and Li Fei-Fei. 2015. ImageNet large scale visual recognition challenge. IJCV Vol. 115, 3 (2015), 211--252.
    [45]
    Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen. 2015. Supervised discrete hashing. In CVPR. 37--45.
    [46]
    Fumin Shen, Chunhua Shen, Qinfeng Shi, Anton Hengel, and Zhenmin Tang. 2013. Inductive hashing on manifolds. In CVPR. 1562--1569.
    [47]
    Anshumali Shrivastava and Ping Li. 2014. Asymmetric LSH (ALSH) for sublinear time maximum inner product search (MIPS) NIPS. 2321--2329.
    [48]
    Jun Wang, Sanjiv Kumar, and Shih-Fu Chang. 2010. Semi-supervised hashing for scalable image retrieval CVPR. 3424--3431.
    [49]
    Jingdong Wang, Ting Zhang, Jingkuan Song, Nicu Sebe, and Heng Tao Shen. 2018. A survey on learning to hash. TPAMI Vol. 40, 4 (2018), 769--790.
    [50]
    Meng Wang, Weijie Fu, Shijie Hao, Hengchang Liu, and Xindong Wu. 2017. Learning on big graph: Label inference and regularization with anchor hierarchy. TKDE Vol. 29, 5 (2017), 1101--1114.
    [51]
    Meng Wang, Weijie Fu, Shijie Hao, Dacheng Tao, and Xindong Wu. 2016. Scalable semi-supervised learning by efficient anchor graph regularization. TKDE Vol. 28, 7 (2016), 1864--1877.
    [52]
    Yair Weiss, Antonio Torralba, and Rob Fergus. 2009. Spectral hashing. In NIPS. 1753--1760.
    [53]
    Rongkai Xia, Yan Pan, Hanjiang Lai, Cong Liu, and Shuicheng Yan. 2014. Supervised hashing for image retrieval via image representation learning AAAI. 2156--2162.
    [54]
    Liang Xie, Jialie Shen, and Lei Zhu. 2016. Online cross-modal hashing for web image retrieval AAAI. 294--300.
    [55]
    Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen, and Xuelong Li. 2017. Learning discriminative binary codes for large-scale cross-modal retrieval. TIP Vol. 26, 5 (2017), 2494--2507.
    [56]
    Xun Yang, Meng Wang, Richang Hong, Qi Tian, and Yong Rui. 2017. Enhancing person re-identification in a self-trained subspace. TOMM Vol. 13, 3 (2017), 27:1--27:23.
    [57]
    Felix Yu, Sanjiv Kumar, Yunchao Gong, and Shih-Fu Chang. 2014. Circulant binary embedding. In ICML. 946--954.
    [58]
    Dongqing Zhang and Wu-Jun Li. 2014. Large-scale supervised multimodal hashing with semantic correlation maximization AAAI. 2177--2183.
    [59]
    Hanwang Zhang, Fumin Shen, Wei Liu, Xiangnan He, Huanbo Luan, and Tat-Seng Chua. 2016. Discrete collaborative filtering. In SIGIR. 325--334.
    [60]
    Peichao Zhang, Wei Zhang, Wu-Jun Li, and Minyi Guo. 2014 b. Supervised hashing with latent factor models. In SIGIR. 173--182.
    [61]
    Ruimao Zhang, Liang Lin, Rui Zhang, Wangmeng Zuo, and Lei Zhang. 2015. Bit-scalable deep hashing with regularized similarity learning for image retrieval and person re-identification. TIP Vol. 24, 12 (2015), 4766--4779.
    [62]
    Zhiwei Zhang, Qifan Wang, Lingyun Ruan, and Luo Si. 2014 a. Preference preserving hashing for efficient recommendation SIGIR. 183--192.
    [63]
    Yi Zhen, Piyush Rai, Hongyuan Zha, and Lawrence Carin. 2015. Cross-modal similarity learning via pairs, preferences, and active supervision AAAI. 3203--3209.
    [64]
    Lei Zhu, Zi Huang, Xiaobai Liu, Xiangnan He, Jiande Sun, and Xiaofang Zhou. 2017. Discrete multimodal hashing with canonical views for robust mobile landmark search. TMM Vol. 19, 9 (2017), 2066--2079.

    Cited By

    View all

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '18: Proceedings of the 2018 World Wide Web Conference
    April 2018
    2000 pages
    ISBN:9781450356398
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    • IW3C2: International World Wide Web Conference Committee

    In-Cooperation

    Publisher

    International World Wide Web Conferences Steering Committee

    Republic and Canton of Geneva, Switzerland

    Publication History

    Published: 10 April 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. discrete optimization
    2. learning-to-hash
    3. scalable search
    4. supervised hashing

    Qualifiers

    • Research-article

    Funding Sources

    • Natural Science Foundation of Shandong Province
    • National Natural Science Foundation of China

    Conference

    WWW '18
    Sponsor:
    • IW3C2
    WWW '18: The Web Conference 2018
    April 23 - 27, 2018
    Lyon, France

    Acceptance Rates

    WWW '18 Paper Acceptance Rate 170 of 1,155 submissions, 15%;
    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)116
    • Downloads (Last 6 weeks)13
    Reflects downloads up to 11 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Relaxed Energy Preserving Hashing for Image RetrievalIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.335184125:7(7388-7400)Online publication date: Jul-2024
    • (2024)Semantic embedding based online cross-modal hashing methodScientific Reports10.1038/s41598-023-50242-w14:1Online publication date: 6-Jan-2024
    • (2024)Scalable Supervised Asymmetric HashingBinary Representation Learning on Visual Images10.1007/978-981-97-2112-2_2(17-50)Online publication date: 7-Mar-2024
    • (2023)Attention-Oriented Deep Multi-Task Hash LearningElectronics10.3390/electronics1205122612:5(1226)Online publication date: 4-Mar-2023
    • (2023)Self-Distillation Dual-Memory Online Hashing with Hash Centers for Streaming Data RetrievalProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612119(6340-6349)Online publication date: 26-Oct-2023
    • (2023)Stepwise Refinement Short Hashing for Image RetrievalProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3611864(6501-6509)Online publication date: 26-Oct-2023
    • (2023)Fast Asymmetric and Discrete Cross-Modal Hashing With Semantic ConsistencyIEEE Transactions on Computational Social Systems10.1109/TCSS.2022.319570410:2(577-589)Online publication date: Apr-2023
    • (2023)Jointly sparse fast hashing with orthogonal learning for large-scale image retrievalSignal Processing: Image Communication10.1016/j.image.2023.117062119(117062)Online publication date: Nov-2023
    • (2023)RICH: A rapid method for image-text cross-modal hash retrievalDisplays10.1016/j.displa.2023.10248979(102489)Online publication date: Sep-2023
    • (2022)Deep Discrete Hashing for Label Distribution LearningIEEE Signal Processing Letters10.1109/LSP.2022.315822929(832-836)Online publication date: 2022
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media