Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Binary Representation via Jointly Personalized Sparse Hashing

Published: 31 October 2022 Publication History

Abstract

Unsupervised hashing has attracted much attention for binary representation learning due to the requirement of economical storage and efficiency of binary codes. It aims to encode high-dimensional features in the Hamming space with similarity preservation between instances. However, most existing methods learn hash functions in manifold-based approaches. Those methods capture the local geometric structures (i.e., pairwise relationships) of data, and lack satisfactory performance in dealing with real-world scenarios that produce similar features (e.g., color and shape) with different semantic information. To address this challenge, in this work, we propose an effective unsupervised method, namely, Jointly Personalized Sparse Hashing (JPSH), for binary representation learning. To be specific, first, we propose a novel personalized hashing module, i.e., Personalized Sparse Hashing (PSH). Different personalized subspaces are constructed to reflect category-specific attributes for different clusters, adaptively mapping instances within the same cluster to the same Hamming space. In addition, we deploy sparse constraints for different personalized subspaces to select important features. We also collect the strengths of the other clusters to build the PSH module with avoiding over-fitting. Then, to simultaneously preserve semantic and pairwise similarities in our proposed JPSH, we incorporate the proposed PSH and manifold-based hash learning into the seamless formulation. As such, JPSH not only distinguishes the instances from different clusters but also preserves local neighborhood structures within the cluster. Finally, an alternating optimization algorithm is adopted to iteratively capture analytical solutions of the JPSH model. We apply the proposed representation learning algorithm JPSH to the similarity search task. Extensive experiments on four benchmark datasets verify that the proposed JPSH outperforms several state-of-the-art unsupervised hashing algorithms.

References

[1]
Ming Chen, Lu Zhang, and Jan P. Allebach. 2015. Learning deep features for image emotion classification. In Proceedings of the IEEE International Conference on Image Processing. IEEE, 4491–4495.
[2]
Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: A real-world web image database from National University of Singapore. In Proceedings of the ACM International Conference on Image and Video Retrieval. 1–9.
[3]
Mayur Datar, Nicole Immorlica, Piotr Indyk, and Vahab S. Mirrokni. 2004. Locality-sensitive hashing scheme based on p-stable distributions. In Proceedings of the Annual Symposium on Computational Geometry. 253–262.
[4]
Xiao Dong, Li Liu, Lei Zhu, Zhiyong Cheng, and Huaxiang Zhang. 2021. Unsupervised deep K-means hashing for efficient image retrieval and clustering. IEEE Trans. Circ. Syst. Video Technol. 31, 8 (2021), 3266–3277.
[5]
Ruben Fernandez-Beltran, Begüm Demir, Filiberto Pla, and Antonio Plaza. 2020. Unsupervised remote sensing image retrieval using probabilistic latent semantic hashing. IEEE Geosci. Remote Sens. Lett. 18, 2 (2020), 256–260.
[6]
Yunchao Gong, Svetlana Lazebnik, Albert Gordo, and Florent Perronnin. 2012. Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 35, 12 (2012), 2916–2929.
[7]
David Hallac, Jure Leskovec, and Stephen Boyd. 2015. Network lasso: Clustering and optimization in large graphs. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 387–396.
[8]
Yanbin Hao, Tingting Mu, John Y. Goulermas, Jianguo Jiang, Richang Hong, and Meng Wang. 2017. Unsupervised t-distributed video hashing and its deep hashing extension. IEEE Trans. Image Process. 26, 11 (2017), 5531–5544.
[9]
Di Hu, Feiping Nie, and Xuelong Li. 2018. Discrete spectral hashing for efficient similarity retrieval. IEEE Trans. Image Process. 28, 3 (2018), 1080–1091.
[10]
Hengtong Hu, Lingxi Xie, Richang Hong, and Qi Tian. 2020. Creating something from nothing: Unsupervised knowledge distillation for cross-modal hashing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3123–3132.
[11]
Mark J. Huiskes and Michael S. Lew. 2008. The MIR flickr retrieval evaluation. In Proceedings of the ACM International Conference on Multimedia Information Retrieval. 39–43.
[12]
Piotr Indyk. 2001. A small approximately min-wise independent family of hash functions. J. Algor. 38, 1 (2001), 84–90.
[13]
Go Irie, Zhenguo Li, Xiao-Ming Wu, and Shih-Fu Chang. 2014. Locally linear hashing for extracting non-linear manifolds. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2115–2122.
[14]
Rongrong Ji, Hong Liu, Liujuan Cao, Di Liu, Yongjian Wu, and Feiyue Huang. 2017. Toward optimal manifold hashing via discrete locally linear embedding. IEEE Trans. Image Process. 26, 11 (2017), 5411–5420.
[15]
Qing-Yuan Jiang and Wu-Jun Li. 2015. Scalable graph hashing with feature transformation. In Proceedings of the International Joint Conference on Artificial Intelligence, Vol. 15. 2248–2254.
[16]
Lu Jin, Kai Li, Zechao Li, Fu Xiao, Guo-Jun Qi, and Jinhui Tang. 2018. Deep semantic-preserving ordinal hashing for cross-modal similarity search. IEEE Trans. Neural Netw. Learn. Syst. 30, 5 (2018), 1429–1440.
[17]
Sheng Jin, Hongxun Yao, Qin Zhou, Yao Liu, Jianqiang Huang, and Xiansheng Hua. 2021. Unsupervised discrete hashing with affinity similarity. IEEE Trans. Image Process. 30 (2021), 6130–6141.
[18]
Deguang Kong, Ryohei Fujimaki, Ji Liu, Feiping Nie, and Chris Ding. 2014. Exclusive feature learning on arbitrary structures via l1,2-norm. Adv. Neural Info. Process. Syst. 27 (2014), 1655–1663.
[19]
Deguang Kong, Ji Liu, Bo Liu, and Xuan Bao. 2016. Uncorrelated group lasso. In Proceedings of the AAAI Conference on Artificial Intelligence. 1765–1771.
[20]
Alex Krizhevsky, Geoffrey Hinton, et al. 2009. Learning multiple layers of features from tiny images. Citeseer.
[21]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2017. Imagenet classification with deep convolutional neural networks. Commun. ACM 60, 6 (2017), 84–90.
[22]
Zhihui Lai, Yudong Chen, Jian Wu, Wai Keung Wong, and Fumin Shen. 2018. Jointly sparse hashing for image retrieval. IEEE Trans. Image Process. 27, 12 (2018), 6147–6158.
[23]
Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. Proc. IEEE 86, 11 (1998), 2278–2324.
[24]
Jundong Li, Liang Wu, Harsh Dani, and Huan Liu. 2018. Unsupervised personalized feature selection. In Proceedings of the AAAI Conference on Artificial Intelligence. 3514–3521.
[25]
Jundong Li, Liang Wu, Osmar R. Zaïane, and Huan Liu. 2017. Toward personalized relational learning. In Proceedings of the SIAM International Conference on Data Mining. SIAM, 444–452.
[26]
Shuyan Li, Xiu Li, Jiwen Lu, and Jie Zhou. 2022. Structure-adaptive neighborhood preserving hashing for scalable video search. IEEE Trans. Circ. Syst. Video Technol. 32, 4 (2022), 2441–2454.
[27]
Xuelong Li, Di Hu, and Feiping Nie. 2017. Large graph hashing with spectral rotation. In Proceedings of the AAAI Conference on Artificial Intelligence. 2203–2209.
[28]
Zechao Li, Jinhui Tang, Liyan Zhang, and Jian Yang. 2020. Weakly supervised semantic guided hashing for social image retrieval. Int. J. Comput. Vision 128, 8 (2020), 2265–2278.
[29]
Mingbao Lin, Rongrong Ji, Hong Liu, and Yongjian Wu. 2018. Supervised online hashing via hadamard codebook learning. In Proceedings of the ACM International Conference on Multimedia. 1635–1643.
[30]
Zijia Lin, Guiguang Ding, Mingqing Hu, and Jianmin Wang. 2015. Semantics-preserving hashing for cross-view retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3864–3872.
[31]
Hong Liu, Rongrong Ji, Jingdong Wang, and Chunhua Shen. 2018. Ordinal constraint binary coding for approximate nearest neighbor search. IEEE Trans. Pattern Anal. Mach. Intell. 41, 4 (2018), 941–955.
[32]
Hong Liu, Rongrong Ji, Yongjian Wu, and Wei Liu. 2016. Towards optimal binary code learning via ordinal embedding. In Proceedings of the AAAI Conference on Artificial Intelligence. 1258–1265.
[33]
Wei Liu, Junfeng He, and Shih-Fu Chang. 2010. Large graph construction for scalable semi-supervised learning. In Proceedings of the International Conference on International Conference on Machine Learning. 679–686.
[34]
Wei Liu, Cun Mu, Sanjiv Kumar, and Shih-Fu Chang. 2014. Discrete graph hashing. Advances in Neural Information Processing Systems 27 (2014), 3419–3427.
[35]
Wei Liu, Jun Wang, Sanjiv Kumar, and Shih-Fu Chang. 2011. Hashing with graphs. In Proceedings of the International Conference on Machine Learning. 1–8.
[36]
Aude Oliva and Antonio Torralba. 2001. Modeling the shape of the scene: A holistic representation of the spatial envelope. Int. J. Comput. Vision 42, 3 (2001), 145–175.
[37]
Tianrong Rao, Xiaoxu Li, and Min Xu. 2020. Learning multi-level deep representations for image emotion classification. Neural Process. Lett. 51, 3 (2020), 2043–2061.
[38]
Tianrong Rao, Xiaoxu Li, Haimin Zhang, and Min Xu. 2019. Multi-level region-based convolutional neural network for image emotion classification. Neurocomputing 333 (2019), 429–439.
[39]
Fumin Shen, Chunhua Shen, Qinfeng Shi, Anton Van Den Hengel, and Zhenmin Tang. 2013. Inductive hashing on manifolds. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1562–1569.
[40]
Yichun Shi, Xiang Yu, Kihyuk Sohn, Manmohan Chandraker, and Anil K. Jain. 2020. Towards universal representation learning for deep face recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 6817–6826.
[41]
Jinhui Tang, Jie Lin, Zechao Li, and Jian Yang. 2018. Discriminative deep quantization hashing for face image retrieval. IEEE Trans. Neural Netw. Learn. Syst. 29, 12 (2018), 6154–6162.
[42]
Zhibao Tian, Hui Zhang, Yong Chen, and Dell Zhang. 2020. Unsupervised hashing based on the recovery of subspace structures. Pattern Recogn. 103 (2020), 107261.
[43]
Laurens Van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE.J. Mach. Learn. Res. 9, 11 (2008).
[44]
Kilian Weinberger, Anirban Dasgupta, John Langford, Alex Smola, and Josh Attenberg. 2009. Feature hashing for large scale multitask learning. In Proceedings of the Annual International Conference on Machine Learning. 1113–1120.
[45]
Yair Weiss, Antonio Torralba, and Rob Fergus. 2008. Spectral hashing. Adv. Neural Info. Process. Syst. 21 (2008), 1753–1760.
[46]
Zhenyu Weng and Yuesheng Zhu. 2020. Concatenation hashing: A relative position preserving method for learning binary codes. Pattern Recogn. 100 (2020), 107151.
[47]
Yan Xia, Kaiming He, Pushmeet Kohli, and Jian Sun. 2015. Sparse projections for high-dimensional binary codes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3332–3339.
[48]
Xinguang Xiang, Yajie Zhang, Lu Jin, Zechao Li, and Jinhui Tang. 2021. Sub-Region localized hashing for fine-grained image retrieval. IEEE Trans. Image Process. 31 (2021), 314–326.
[49]
Erkun Yang, Cheng Deng, Tongliang Liu, Wei Liu, and Dacheng Tao. 2018. Semantic structure-based unsupervised deep hashing. In Proceedings of the International Joint Conference on Artificial Intelligence. 1064–1070.
[50]
Xingxu Yao, Dongyu She, Haiwei Zhang, Jufeng Yang, Ming-Ming Cheng, and Liang Wang. 2020. Adaptive deep metric learning for affective image retrieval and classification. IEEE Trans. Multimedia 23 (2020), 1640–1653.
[51]
Xingxu Yao, Dongyu She, Sicheng Zhao, Jie Liang, Yu-Kun Lai, and Jufeng Yang. 2019. Attention-aware polarity sensitive embedding for affective image retrieval. In Proceedings of the IEEE International Conference on Computer Vision. 1140–1150.
[52]
Wanqian Zhang, Dayan Wu, Yu Zhou, Bo Li, Weiping Wang, and Dan Meng. 2020. Deep unsupervised hybrid-similarity hadamard hashing. In Proceedings of the ACM International Conference on Multimedia. 3274–3282.
[53]
Zheng Zhang, Guo-sen Xie, Yang Li, Sheng Li, and Zi Huang. 2019. SADIH: Semantic-aware discrete hashing. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 5853–5860.
[54]
Sicheng Zhao, Hongxun Yao, You Yang, and Yanhao Zhang. 2014. Affective image retrieval via multi-graph learning. In Proceedings of the ACM International Conference on Multimedia. 1025–1028.
[55]
Lei Zhu, Zi Huang, Zhihui Li, Liang Xie, and Heng Tao Shen. 2018. Exploring auxiliary context: Discrete semantic transfer hashing for scalable image retrieval. IEEE Trans. Neural Netw. Learn. Syst. 29, 11 (2018), 5264–5276.
[56]
Xiaofeng Zhu, Xuelong Li, Shichao Zhang, Zongben Xu, Litao Yu, and Can Wang. 2017. Graph PCA hashing for similarity search. IEEE Trans. Multimedia 19, 9 (2017), 2033–2044.
[57]
Xiaofeng Zhu, Lei Zhang, and Zi Huang. 2014. A sparse embedding and least variance encoding approach to hashing. IEEE Trans. Image Process. 23, 9 (2014), 3737–3750.
[58]
Hui Zou, Trevor Hastie, and Robert Tibshirani. 2006. Sparse principal component analysis. J. Comput. Graph. Stat. 15, 2 (2006), 265–286.

Cited By

View all
  • (2024)Detecting Logos for Indoor Environmental Perception Using Unsupervised and Few-Shot LearningElectronics10.3390/electronics1312224613:12(2246)Online publication date: 7-Jun-2024
  • (2024)Conflicting Marks Archive Dataset: A Dataset of Conflicting Marks from the Brazilian Intellectual Property OfficeData10.3390/data90200339:2(33)Online publication date: 9-Feb-2024
  • (2024)Online Cross-modal Hashing With Dynamic PrototypeACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366524920:8(1-18)Online publication date: 13-Jun-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications
ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 18, Issue 3s
October 2022
381 pages
ISSN:1551-6857
EISSN:1551-6865
DOI:10.1145/3567476
  • Editor:
  • Abdulmotaleb El Saddik
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 October 2022
Online AM: 06 September 2022
Accepted: 12 August 2022
Revised: 03 August 2022
Received: 03 December 2021
Published in TOMM Volume 18, Issue 3s

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Binary representation
  2. personalized hashing
  3. manifold hashing
  4. similarity search

Qualifiers

  • Research-article
  • Refereed

Funding Sources

  • Guanxi Natural Science Foundation
  • National Natural Science Foundation of China
  • Natural Science Foundation of Hunan Province
  • Guangxi Key Laboratory of Image and Graphic Intelligent Processing

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)54
  • Downloads (Last 6 weeks)3
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Detecting Logos for Indoor Environmental Perception Using Unsupervised and Few-Shot LearningElectronics10.3390/electronics1312224613:12(2246)Online publication date: 7-Jun-2024
  • (2024)Conflicting Marks Archive Dataset: A Dataset of Conflicting Marks from the Brazilian Intellectual Property OfficeData10.3390/data90200339:2(33)Online publication date: 9-Feb-2024
  • (2024)Online Cross-modal Hashing With Dynamic PrototypeACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366524920:8(1-18)Online publication date: 13-Jun-2024
  • (2024)Bridging the Domain Gap in Scene Flow Estimation via Hierarchical Smoothness RefinementACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366182320:8(1-21)Online publication date: 12-Jun-2024
  • (2024)High Fidelity Makeup via 2D and 3D Identity Preservation NetACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365647520:8(1-24)Online publication date: 13-Jun-2024
  • (2024)RAST: Restorable Arbitrary Style TransferACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363877020:5(1-21)Online publication date: 22-Jan-2024
  • (2024)Learning Offset Probability Distribution for Accurate Object DetectionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363721420:5(1-24)Online publication date: 22-Jan-2024
  • (2024)Supervised Hierarchical Online Hashing for Cross-modal RetrievalACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363252720:4(1-23)Online publication date: 11-Jan-2024
  • (2024)HARR: Learning Discriminative and High-Quality Hash Codes for Image RetrievalACM Transactions on Multimedia Computing, Communications, and Applications10.1145/362716220:5(1-23)Online publication date: 22-Jan-2024
  • (2024)Efficient Crowd Counting via Dual Knowledge DistillationIEEE Transactions on Image Processing10.1109/TIP.2023.334360933(569-583)Online publication date: 1-Jan-2024
  • Show More Cited By

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media