Abstract
This study introduces a robust framework for enhancing Content-Based Image Retrieval (CBIR) systems through the integration of supervised and unsupervised machine learning algorithms. Supervised learning algorithms, such as K-Nearest Neighbors (KNN), Support Vector Machines (SVM), Linear Discriminant Analysis (LDA), and ensemble methods like Bagging and AdaBoost, are used with unsupervised learning techniques, including K-Means and K-Medoids clustering to improve the performance of CBIR. The core of the framework leverages advanced feature extraction methods, specifically ResNet-HOG Visual Word Fusion (RVWF) and ResNet-HOG Feature Fusion (RHFF), which utilize ResNet-50 for capturing high-level semantic information and Histogram of Oriented Gradients (HOG) for detailed texture analysis. A comparison was made between the similarity-based CBIR (standalone CBIR), classification-based CBIR, and clustering-based CBIR methods. The findings reveal that classification-based CBIR methods are superior to standalone and clustering-based CBIR methods in terms of retrieval accuracy and semantic interpretation. The proposed methods outperformed the state-of-the-art methods for different databases used in this study. The proposed frameworks demonstrated superior performance across multiple databases, including VisTex, Brodatz, Corel 10K, and Corel 1K. In the VisTex database, clustering using K-Medoids-based RVWF increased performance from 98.75% to 99.52%, while classification methods like Linear Discriminant or Bagging-based RVWF achieved 100% accuracy. Similarly, in the Brodatz database, K-Medoids-based RVWF clustering improved accuracy from 97.62% to 99.62%, with classification methods such as AdaBoost or Bagging-based RVWF reaching up to 100% accuracy. For the Corel 1K and Corel 10K databases, K-Medoids-based RVWF clustering enhanced results to 95.61% and 99.20% for RVW, respectively, while classification methods further increased accuracy to 98.20% for Corel 1K and 100% for Corel 10K. These results show that combining advanced feature extraction with machine learning algorithms can improve the performance of CBIR systems. CBIR based on clustering proved to outperform standalone CBIR systems, while classification-based CBIR systems offered the best results, making them the most suitable for accurate image retrieval.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
Mentioned in the paper.
Code availability
Not applicable and could be provided.
References
Kakulapati V, Pentapati V (2022) A textual framework for contour retrieval using sub-multiple contour striking spread position learning. Int J Inf Technol 14:1575–1583. https://doi.org/10.1007/s41870-020-00425-9
Agarwal M (2023) Neighborhood ternary co-occurrence for natural and texture image retrieval. Int J Inf Technol 15:1999–2006. https://doi.org/10.1007/s41870-023-01238-2
Tamilkodi R, Nesakumari GR (2022) Image retrieval system based on multi feature extraction and its performance assessment. Int J Inf Technol 14:1161–1173. https://doi.org/10.1007/s41870-020-00556-z
Kanaparthi SK, Raju USN (2022) Content based image retrieval on big image data using local and global features. Int J Inf Technol 14:49–68. https://doi.org/10.1007/s41870-021-00806-8
Ahmad K, Sahu M, Shrivastava M et al (2020) An efficient image retrieval tool: query-based image management system. Int J Inf Technol 12:103–111. https://doi.org/10.1007/s41870-018-0198-9
Zhou Z, Li M, Chen H, Ma Y, Luo J (2017) A review of recent advances in content-based image retrieval. Multimed Tools Appl 76(20):21271–21311
Pang L, Du L, Han J, Ding Y (2017) A review of recent advances in content-based medical image retrieval. Healthc Technol Lett 4(2):45–52
Li X, Hu Y, Zhang L, Zhang L (2017) Deep collaborative learning for content-based image retrieval. Neurocomputing 267:630–640
Wang X, Feng Y, Wang R, Li S (2021) Unsupervised deep learning-based hashing for content-based image retrieval. J Vis Commun Image Represent 73:102934
Yousuf M, Mehmood Z, Habib HA, Mahmood T, Saba T, Rehman A, Rashid M (2018) A novel technique based on visual words fusion analysis of sparse features for effective content-based image retrieval. Math Probl Eng 2018:1–13. https://doi.org/10.1155/2018/2134395
Mehmood Z, Abbas F, Mahmood T, Javid MA, Rehman A, Nawaz T (2018) Content based image retrieval based on visual words fusion versus features fusion of local and global features. Arab J Sci Eng 43(12):7265–7284. https://doi.org/10.1007/s13369-018-3062-0
Ashraf R, Bashir K, Irtaza A, Mahmood M (2015) Content based image retrieval using embedded neural networks with bandletized regions. Entropy 17(6):3552–3580. https://doi.org/10.3390/e17063552
Sarwar A, Mehmood Z, Saba T, Qazi KA, Adnan A, Jamal H (2019) A novel method for content-based image retrieval to improve the effectiveness of the bag-of-words model using a support vector machine. J Inf Sci 45(1):117–135. https://doi.org/10.1177/0165551518782825
Wan J (2014) Deep learning for content-based image retrieval. In: proceedings of the ACM international conference on multimedia-MM ’14, New York, USA, p 157–166
Alzu’bi A, Amira A, Ramzan N (2017) Content-based image retrieval with compact deep convolutional features. Neurocomputing 249:95–105. https://doi.org/10.1016/j.neucom.2017.03.072
Tzelepi M, Tefas A (2018) Deep convolutional learning for content based image retrieval. Neurocomputing 275:2467–2478. https://doi.org/10.1016/j.neucom.2017.11.022
Zheng Q, Tian X, Yang M, Wang H (2019) Differential learning: a powerful tool for interactive content-based image retrieval. Eng Lett 27(1):202–215
Sezavar A, Farsi H, Mohamadzadeh S (2019) Content-based image retrieval by combining convolutional neural networks and sparse representation. Multimed Tools Appl 78(15):20895–20912. https://doi.org/10.1007/s11042-019-7321-1
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of IEEE conference on computer vision and pattern recognition (CVPR), LasVegas, USA, 27–30 June, p 770–778, IEEE, USA
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of IEEE computer society conference on computer vision and pattern recognition (CVPR’05), San Diego, CA, USA, 20–25 June, IEEE, USA
Alrahhal M, Supreethi KP (2022) COVID-19 diagnostic system using medical image classification and retrieval: a novel method for image analysis. Comput J 65(8):2146–2163. https://doi.org/10.1093/comjnl/bxab051
Alrahhal M, Supreethi K (2019) Content-based image retrieval using local patterns and supervised machine learning techniques. In: proceedings of amity international conference on artificial intelligence (AICAI), Dubai, United Arab Emirates, 4– 6 February, IEEE, Amity University, Dubai, United Arab Emirates, p 118–124
Alrahhal M, Supreethi KP (2021) Full direction local neighbors pattern (FDLNP). Int J Adv Comput Sci Appl (IJACSA). https://doi.org/10.14569/IJACSA.2021.0120116
Alrahhal M, Supreethi KP (2020) Multimedia image retrieval system by combining CNN with handcraft features in three different similarity measures. Int J Comput Vis Image Process 10:1–23
Corel 1000 image database. http://wang.ist.psu.edu/docs/related/
Content based image retrieval/image database search engine (SIMPLIcity, WIPE, virtual microscope), Wang.ist.psu.edu, 2021. http://wang.ist.psu.edu/docs/related/
Brodatz P (1996) Textures: A Photographic Album for Artist and Designers. Dover, New York
SIPI Image Database, Sipi.usc.edu, 2021. http://sipi.usc.edu/database/
Index of /pub/VisTex, Vismod.media.mit.edu, 2021. https://vismod.media.mit.edu/pub/VisTex/
Arthur D, Vassilvitskii S (2007) K-means++: the advantages of careful seeding. In: proceedings of the 18th annual ACM-SIAM symposium on discrete algorithms, 7–9 January. Society for industrial and applied mathematics, 3600 University city science center Philadelphia, PA, p 1027–1035
Zhang Z (2010) A comprehensive review of the K-nearest neighbor algorithm. Advances in Artificial Intelligence. Springer, Berlin, pp 7–20
Gou J, Gao Y (2011) Improved KNN text classification algorithm based on cosine similarity. In: international conference on computer science and network technology, IEEE, p 724–727
Rakotomamonjy A (2020) Support vector machines: recent trends and open problems. Neural Netw 133:97–116
Gao Q, Maji P (2021) Support vector machines: a survey. ACM Comput Surv (CSUR) 54(2):1–38
Khan F, Arif M, Saeed A (2020) Feature extraction using linear discriminant analysis and K-nearest neighbors for EEG-based emotion recognition. J Ambient Intell Humaniz Comput 11(10):4831–4843
Xu C, Yang H, Wang H (2021) A novel robust linear discriminant analysis for face recognition. Knowl Based Syst 216:106753
Chrysanthou G, Pavlou A (2020) Decision tree ensemble with bagging for detection of cybersecurity threats. Comput Secur 90:101714
Zhang X, Zhou Z, Tang Y, Li S, Wang S (2021) An improved random forest based on bagging ensemble algorithm. IEEE Access 9:20796–20811
Yang L, Jiang H, Zhao S, Zhou X (2021) AdaBoost-ResNet: a novel network traffic classification approach based on deep learning and ensemble learning. Inf Sci 559:147–166
Wang W, Sun D (2021) The improved AdaBoost algorithms for imbalanced data classification. Inf Sci 563:358–374. https://doi.org/10.1016/j.ins.2021.03.042
Veloso A., Reis J, Vasconcelos F, Silveira J, Abreu P, Sarmento G, Silva T, Rabêlo R (2024) Deep clustering algorithm for load profile business intelligence dashboard for consumer and utility management. Proceedings of the 20th Brazilian Symposium on Information Systems, 1–8. https://doi.org/10.1145/3658321.3658368
Jha PC, Biswas R, Koley S (2022) An efficient K-means clustering algorithm for big data analytics in the internet of things. J Netw Comput Appl 195:105073
Pratap RA, Vani KS, Devi JR, Rao KN (2011) An efficient density-based improved K-Medoids clustering algorithm. Int Adv Comput Sci Appl (IJACSA) 2(6). https://doi.org/10.14569/IJACSA.2011.020607
Deng J, Guo J, Wang Y, Wang Y (2019) A novel K-medoids clustering recommendation algorithm based on probability distribution for collaborative filtering. Knowl-Based Syst 175:12. https://doi.org/10.1016/j.knosys.2019.03.009
Funding
Not applicable.
Author information
Authors and Affiliations
Contributions
The authors, Maher Alrahhal and Supreethi K P, contributed equally to the work.
Corresponding author
Ethics declarations
Conflict of interest
The authors confirm no conflict of interest.
Informed consent
Not applicable.
Research involving human and /or animals
Not applicable.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Alrahhal, M., Supreethi, K.P. Integrating machine learning algorithms for robust content-based image retrieval. Int. j. inf. tecnol. 16, 5005–5021 (2024). https://doi.org/10.1007/s41870-024-02169-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-024-02169-2