Abstract
Hashing learning is efficient for large-scale image retrieval by using the nearest neighbor search with binary codes instead of continuous representations. With the success of deep neural networks in related tasks such as data representation, recent hashing methods based on deep learning can further improve image retrieval quality and classification accuracy. However, most existing methods are primarily designed to maximize the performance of retrieval based on linear scan of hash codes which is still time-consuming on large-scale datasets. Fortunately, Hamming space retrieval is an alternative as it is less time-consuming by retrieving data points that are within a Hamming ball with a given Hamming radius, but few works focus on that. In this paper, we propose a concentrated hashing method with neighborhood embedding (CHNE) for efficient and effective image retrieval and classification. By integrating Cauchy cross-entropy and pair-wise weighted similarity loss, CHNE can enable similar data pairs with smaller Hamming distance and dissimilar data pairs with larger Hamming distance. In addition, existing hashing methods are usually designed for retrieval, thus the performance of classification using the binary codes is not guaranteed. To tackle this problem, we jointly minimize the regression quantization and neighborhood structure reconstruction errors in the loss function to improve the classification accuracy. The proposed end-to-end deep hashing method can be optimized by back-propagation in a standard manner. Experimental results on several datasets demonstrate that the proposed method can improve the performance of retrieval and classification. Due to its generality, the proposed method is expected to be useful for image retrieval and classification in broader areas.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Wang J, Zhang T, Sebe N, Shen HT et al (2017) A survey on learning to hash. IEEE Trans Pattern Anal Mach Intell 40(4):769–790
Di H, Nie F, Li X (2018) Deep binary reconstruction for cross-modal hashing. IEEE Trans Multimed 21(4):973–985
Kuang Z, Zhang X, Jun Y, Li Z, Fan J (2021) Deep embedding of concept ontology for hierarchical fashion recognition. Neurocomputing 425:191–206
Brasnett P, Bober M (2008) Fast and robust image identification. In: 2008 19th International Conference on pattern recognition, pp 1–5. IEEE, 2008
Bronstein MM, Bronstein AM, Michel F, Paragios N (2010) Data fusion through cross-modality metric learning using similarity-sensitive hashing. In: 2010 IEEE computer society conference on computer vision and pattern recognition, IEEE. pp 3594–3601
Torralba A, Fergus R, Weiss Y, et al (2008) Small codes and large image databases for recognition. In: CVPR, volume 1, page 2. Citeseer, 2008
Salakhutdinov R, Hinton G (2009) Semantic hashing. Int J Approx Reason 50(7):969–978
Andoni A, Indyk P (2006) Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. In: 2006 47th Annual IEEE Symposium on foundations of computer science (FOCS’06), pp. 459–468. IEEE, 2006
Gong Y, Lazebnik S, Gordo A, Perronnin F (2012) Iterative quantization: a procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans Pattern Anal Mach Intell 35(12):2916–2929
Strecha C, Bronstein A, Bronstein M, Fua P (2011) Ldahash: improved matching with smaller descriptors. IEEE Trans Pattern Anal Mach Intell 34(1):66–78
Kulis B, Darrell T (2009) Learning to hash with binary reconstructive embeddings. In: Bengio Y, Schuurmans D, Lafferty J, Williams C, Culotta A (eds) Advances in neural information processing systems, vol 2. pp 1042–1050
Liu W, Wang J, Ji R, Jiang Y-G, Chang S-F (2012) Supervised hashing with kernels. In: 2012 IEEE Conference on computer vision and pattern recognition, pp 2074–2081. IEEE, 2012
Weiss Y, Torralba A, Fergus R (2009) Spectral hashing. In: Advances in neural information processing systems, pp 1753–1760, 2009
Shen F, Shen C, Shi Q, Van Den Hengel A, Tang Z (2013) Inductive hashing on manifolds. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp 1562–1569, 2013
Liu Y, Feng L, Liu S, Sun M (2019) An elm based local topology preserving hashing. Int J Mach Learn Cybern 10(10):2691–2708
Cao Z, Long M, Wang J, Yu PS (2017) Hashnet: deep learning to hash by continuation. In: Proceedings of the IEEE International Conference on computer vision, pp 5608–5617, 2017
Zhu H, Long M, Wang J, Cao Y (2016) Deep hashing network for efficient similarity retrieval. In: Proceedings of the thirtieth AAAI conference on artificial intelligence (AAAI-16), vol 30(1). pp 2415–2421
Zhou X, Shen F, Liu L, Liu W, Nie L, Yang Y, Shen H-T (2018) Graph convolutional network hashing. IEEE Trans Cybern 50(4):1460–1472
Li J, Ng WWY, Tian X, Kwong S, Wang H (2020) Weighted multi-deep ranking supervised hashing for efficient image retrieval. Int J Mach Learn Cybern 11(4):883–897
van der Maaten L, Hinton G (2008) Visualizing data using t-sne. J Mach Learn Res 9:2579–2605
Cakir F, He K, Adel Bargal S, Sclaroff S (2017) Mihash: online hashing with mutual information. In: Proceedings of the IEEE International Conference on computer vision, pp 437–445, 2017
Gionis A, Indyk P, Motwani R et al (1999) Similarity search in high dimensions via hashing. In Vldb 99:518–529
Wang J, Kumar S, Chang S-F (2012) Semi-supervised hashing for large-scale search. IEEE Trans Pattern Anal Mach Intell 34(12):2393–2406
Dai B, Guo R, Kumar S, He N, Song L (2017) Stochastic generative hashing. In: Proceedings of the 34th International Conference on machine learning-volume 70, pp 913–922. JMLR.org, 2017
Zhang D, Wang J, Cai D, Lu J (2010) Self-taught hashing for fast similarity search. In: Proceedings of the 33rd International ACM SIGIR Conference on research and development in information retrieval, pp 18–25. ACM, 2010
Shen F, Shen C, Liu W, Tao Shen H (2015) Supervised discrete hashing. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp 37–45, 2015
Cao Y, Long M, Liu B, Wang J (2018) Deep cauchy hashing for hamming space retrieval. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp 1229–1237, 2018
Suykens JAK, Vandewalle J (1999) Least squares support vector machine classifiers. Neural Process Lett 9(3):293–300
Keller JM, Gray MR, Givens JA (1985) A fuzzy k-nearest neighbor algorithm. IEEE Trans Syst Man Cybern 4:580–585
Zhang Z, Lyons M, Schuster M, Akamatsu S (1998) Comparison between geometry-based and Gbor-wavelets-based facial expression recognition using multi-layer perceptron. In: Proceedings Third IEEE International Conference on automatic face and gesture recognition, pp 454–459. IEEE, 1998
Zhang W, Kang P, Fang X, Teng L, Han N (2019) Joint sparse representation and locality preserving projection for feature extraction. Int J Mach Learn Cybern 10(7):1731–1745
Nanni L, Ghidoni S, Brahnam S (2017) Handcrafted vs. non-handcrafted features for computer vision classification. Pattern Recognit 71:158–172
Liao R, Shiqi Yu, An W, Huang Y (2020) A model-based gait recognition method with body pose and human prior knowledge. Pattern Recognit 98:107069
LeCun Y, Bottou L, Bengio Y, Haffner P et al (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, vol 25. pp 1097–1105
Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp 2818–2826, 2016
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp 770–778, 2016
Liao R, An W, Li Z, Bhattacharyya SS (2021) A novel view synthesis approach based on view space covering for gait recognition. Neurocomputing 453:13–25
Zhang Z, Chen Y, Saligrama V (2016) Efficient training of very deep neural networks for supervised hashing. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp 1487–1495, 2016
Zhang S, Liu S, Cao X, Song Z, Zhou J (2018) Watch fashion shows to tell clothing attributes. Neurocomputing 282:98–110
Liu J, Song X, Chen Z, Ma J (2019) Neural fashion experts: I know how to make the complementary clothing matching. Neurocomputing 359:249–263
Liu L, Zhang H, Ji Y, Jonathan Wu QM (2019) Toward ai fashion design: An attribute-gan model for clothing match. Neurocomputing 341:156–167
Dmochowski JP, Sajda P, Parra LC (2010) Maximum likelihood in cost-sensitive learning: model specification, approximations, and upper bounds. J Mach Learn Res 11:3313–3332
Nl Johnson S, Kotz N Balakrishnan (1970) Continuous univariate distributions, vol 1. Houghton Mifflin, Boston
Willliam F (2008) An introduction to probability theory and its applications, vol 2. Wiley, Hoboken
Morgado P, Li Y, Pereira JC, Saberian M, Vasconcelos N (2021) Deep hashing with hash-consistent large margin proxy embeddings. Int J Comput Vis 129(2):419–438
Wang Y, Xianfeng O, Liang J, Sun Z (2020) Deep semantic reconstruction hashing for similarity retrieval. IEEE Trans Circ Syst Video Technol 31(1):387–400
Li Y, Cao L, Zhu J, Luo J (2017) Mining fashion outfit composition using an end-to-end deep learning approach on set data. IEEE Trans Multimed 19(8):1946–1955
Liu S, Feng J, Domokos C, Hui X, Huang J, Zhenzhen H, Yan S (2013) Fashion parsing with weak color-category labels. IEEE Trans Multimed 16(1):253–265
Srinivasan K, Dastoor PH, Radhakrishnaiah P, Jayaraman S (1992) Fdas: a knowledge-based framework for analysis of defects in woven textile structures. J Text Inst 83(3):431–448
Shen F, Zhou X, Yang Y, Song J, Shen HT, Tao D (2016) A fast optimization method for general binary code learning. IEEE Trans Image Process 25(12):5610–5621
Cao Y, Long M, Wang J, Zhu H, Wen Q (2016) Deep quantization network for efficient image retrieval. In: Thirtieth AAAI Conference on artificial intelligence, 2016
Liu H, Wang R, Shan S, Chen X (2016) Deep supervised hashing for fast image retrieval. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp 2064–2072, 2016
Zhang Z, Zou Q, Lin Y, Chen L, Wang S (2019) Improved deep hashing with soft pairwise similarity for multi-label image retrieval. IEEE Trans Multimed 22(2):540–553
Krizhevsky A, Hinton G et al (2009) Learning multiple layers of features from tiny images. Technical report, Citeseer
Russakovsky O, Deng J, Hao S, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252
Chua T-S, Tang J, Hong R, Li H, Luo Z, Zheng Y (2009) Nus-wide: a real-world web image database from national university of Singapore. In: Proceedings of the ACM International Conference on image and video retrieval, p 48. ACM, 2009
Tianchi database obtained from ai challenge in xuelang manufacturing: Visual computing assisted quality inspection. https://tianchi.aliyun.com/competition/entrance/231666/%20introduction. Accessed 10 July 2018
Fabric database collected from the competition of Guangdong industrial intelligence innovation. http://bbs.cvmart.net/topics/706/tianchi. Accessed 8 Aug 2019
Xiao H, Rasul K, Vollgraf R (2017) Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms. arXiv:1708.07747
Liu Z, Luo P, Qiu S, Wang X, Tang X (2016) Deepfashion: powering robust clothes recognition and retrieval with rich annotations. In: Proceedings of IEEE Conference on computer vision and pattern recognition, pp 1096–1104
Acknowledgements
This research is supported by Laboratory for Artificial Intelligence in Design (Project Code: RP3-4) under InnoHK Research Clusters, Hong Kong SAR Government.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Mo, D., Wong, W.K., Liu, X. et al. Concentrated hashing with neighborhood embedding for image retrieval and classification. Int. J. Mach. Learn. & Cyber. 13, 1571–1587 (2022). https://doi.org/10.1007/s13042-021-01466-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-021-01466-7