Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Public Access

Query Expansion for Content-Based Similarity Search Using Local and Global Features

Published: 31 May 2017 Publication History

Abstract

This article presents an efficient and totally unsupervised content-based similarity search method for multimedia data objects represented by high-dimensional feature vectors. The assumption is that the similarity measure is applicable to feature vectors of arbitrary length. During the offline process, different sets of features are selected by a generalized version of the Laplacian Score in an unsupervised way for individual data objects in the database. Online retrieval is performed by ranking the query object in the feature spaces of candidate objects. Those candidates for which the query object is ranked highly are selected as the query results. The ranking scheme is incorporated into an automated query expansion framework to further improve the semantic quality of the search result. Extensive experiments were conducted on several datasets to show the capability of the proposed method in boosting effectiveness without losing efficiency.

References

[1]
K. Bache and M. Lichman. 2013. UCI Machine Learning Repository. Retrieved from http://archive.ics.uci.edu/ml.
[2]
Claudio Carpineto and Giovanni Romano. 2012. A survey of automatic query expansion in information retrieval. ACM Comput. Surv. 44, 1 (2012).
[3]
Ondrej Chum, James Philbin, Josef Sivic, Michael Isard, and Andrew Zisserman. 2007. Total recall: Automatic query expansion with a generative feature model for object retrieval. In Proceedings of the 11th International Conference on Computer Vision. 1--8.
[4]
Wei Dong, Moses Charikar, and Kai Li. 2011. Efficient k-nearest neighbor graph construction for generic similarity measures. In Proceedings of the 20th International Conference on World Wide Web. 577--586.
[5]
Thomas Drugman, Mihai Gurban, and Jean-Philippe Thiran. 2007. Relevant feature selection for audio-visual speech recognition. In Proceedings of the International Workshop on Multimedia Signal Processing. 179--182.
[6]
Jennifer G. Dy, Carla E. Brodley, Avinash C. Kak, Lynn S. Broderick, and Alex M. Aisen. 2003. Unsupervised feature selection applied to content-based retrieval of lung images. IEEE Trans. Pattern Anal. Mach. Intell. 25, 3 (2003), 373--378.
[7]
M. Everingham, J. Sivic, and A. Zisserman. 2006. “Hello! My name is … Buffy”—Automatic naming of characters in TV video. In Proceedings of the British Machine Vision Conference. 899--908.
[8]
Mark A. Fanty and Ronald A. Cole. 1991. Spoken letter recognition. In Advances in Neural Information Processing Systems 3. Morgan-Kaufmann, 220--226.
[9]
Bailan Feng, Juan Cao, Zhineng Chen, Yongdong Zhang, and Shouxun Lin. 2010. Multi-modal query expansion for web video search. In SIGIR. 721--722.
[10]
François Fleuret. 2004. Fast binary feature selection with conditional mutual information. J. Mach. Learn. Res. 5 (2004), 1531--1555.
[11]
Lianli Gao, Jingkuan Song, Xingyi Liu, Junming Shao, Jiajun Liu, and Jie Shao. 2015. Learning in high-dimensional multimedia data: The state of the art. Multimedia Systems (2015), 1--11.
[12]
Aristides Gionis, Piotr Indyk, and Rajeev Motwani. 1999. Similarity search in high dimensions via hashing. In Proceedings of the 25th International Conference on Very Large Data Bases. 518--529.
[13]
Esin Guldogan and Moncef Gabbouj. 2008. Feature selection for content-based image retrieval. Sign. Image Video Process. 2, 3 (2008), 241--250.
[14]
Xiaofei He, Deng Cai, and Partha Niyogi. 2005. Laplacian score for feature selection. In Advances in Neural Information Processing Systems 18. MIT Press, 507--514.
[15]
Laura Hollink, Guus Schreiber, and Bob Wielinga. 2006. Query expansion for image content search. Pattern Recogn. 39 (2006), 210--222.
[16]
Michael E. Houle, Xiguo Ma, Vincent Oria, and Jichao Sun. 2014. Improving the quality of k-nn graphs for image databases through vector sparsification. In Proceedings of the International Conference on Multimedia Retrieval. 89--96.
[17]
Michael E. Houle and Michael Nett. 2015. Rank-based similarity search: Reducing the dimensional dependence. IEEE Trans. Pattern Anal. Mach. Intell. 37, 1 (2015), 136--150.
[18]
Michael E. Houle, Vincent Oria, Shin’ichi Satoh, and Jichao Sun. 2013. Annotation propagation in image databases using similarity graphs. TOMCCAP 10, 1 (2013), 7.
[19]
Michael E. Houle and Jun Sakuma. 2005. Fast approximate similarity search in extremely high-dimensional data sets. In Proceedings of the 21st International Conference on Data Engineering. 619--630.
[20]
Wei Jiang, Guihua Er, Qionghai Dai, and Jinwei Gu. 2006. Similarity-based online feature selection in content-based image retrieval. IEEE Trans. Image Process. 15, 3 (2006), 702--712.
[21]
David R. Karger and Matthias Ruhl. 2002. Finding nearest neighbors in growth-restricted metrics. In Proceedings of the 34th Symposium on Theory of Computing. 741--750.
[22]
Yin-Hsi Kuo, Kuan-Ting Chen, Chien-Hsing Chiang, and Winston H. Hsu. 2009. Query expansion for hash-based image object retrieval. In Proceedings of the ACM Multimedia. 65--74.
[23]
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. 1998. Gradient-based learning applied to document recognition. Proc. IEEE 86, 11 (1998), 2278--2324.
[24]
Chee Wee Leong, Samer Hassan, Miguel E. Ruiz, and Rada Mihalcea. 2011. Improving query expansion for image retrieval via saliency and picturability. In Proceedings of the 2nd International Conference on Multilingual and Multimodal Information Access Evaluation (CLEF’11). 137--142.
[25]
Fei-Fei Li, Robert Fergus, and Pietro Perona. 2007. Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. Comput. Vis, Image Und. 106, 1 (2007), 59--70.
[26]
Yingfei Li, Bo Geng, Zheng-Jun Zha, Yangxi Li, Dacheng Tao, and Chao Xu. 2011a. Query expansion by spatial co-occurrence for image retrieval. In Proceedings of the ACM Multimedia. 1177--1180.
[27]
Yang Li, Fei-Fei Li, Ke Yi, Bin Yao, and Min Wang. 2011b. Flexible aggregate similarity search. In Proceedings of the International Conference on Management of Data (SIGMOD’11). 1009--1020.
[28]
Yan Liu and John R. Kender. 2004. Video feature selection using fast-converging sort-merge tree. In Proceedings of the IEEE Conference on Multimedia and Expo. 2083--2086.
[29]
Ying Liu, Dengsheng Zhang, Guojun Lu, and Wei-Ying Ma. 2007. A survey of content-based image retrieval with high-level semantics. Pattern Recogn. 40, 1 (2007), 262--282.
[30]
Nhu-Van Nguyen, Alain Boucher, Jean-Marc Ogier, and Salvatore Tabbone. 2010. Clusters-based relevance feedback for CBIR: A combination of query movement and query expansion. In Proceedings of the IEEE International Conference on Computing and Communication Technologies, Research, Innovation, and Vision for the Future. 1--6.
[31]
Michael Ortega-Binderberger and Sharad Mehrotra. 2004. Relevance feedback techniques in the MARS image retrieval system. Multimedia Syst. 9, 6 (2004), 535--547.
[32]
Md. Mahmudur Rahman, Sameer Antani, and George R. Thoma. 2011. A query expansion framework in image retrieval domain based on local and global analysis. Inf. Process. Manage. 47, 5 (2011), 676--691.
[33]
Esmat Rashedi, Hossein Nezamabadi-pour, and Saeid Saryazdi. 2009. GSA: A gravitational search algorithm. Inf. Sci. 179, 13 (2009), 2232--2248.
[34]
Esmat Rashedi, Hossein Nezamabadi-pour, and Saeid Saryazdi. 2013. A simultaneous feature adaptation and feature selection method for content-based image retrieval systems. Knowl.-Based Syst. 39 (2013), 85--94.
[35]
Juha Reunanen. 2007. Model selection and assessment using cross-indexing. In Proceedings of the International Joint Conference on Neural Networks. 2581--2585.
[36]
Pasi Saari, Tuomas Eerola, and Olivier Lartillot. 2011. Generalizability and simplicity as criteria in feature selection: Application to mood classification in music. IEEE Trans. Aud. Speech Lang. Process. 19, 6 (2011), 1802--1812.
[37]
S. Srinivasan, D. Ponceleon, D. Petkovic, and M. Viswanathan. 2000. Query expansion for imperfect speech: Applications in distributed learning. In Proceedings of the IEEE Workshop on Content-based Access of Image and Video Libraries. 50--54.
[38]
Yu Sun and B. Bhanu. 2010. Image retrieval with feature selection and relevance feedback. In Proceedings of the 17th IEEE International Conference on Image Processing. 3209--3212.
[39]
Nuno Vasconcelos and Manuela Vasconcelos. 2004. Scalable discriminant feature selection for image retrieval and recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Vol. 2. 770--775.
[40]
Yang Wang, Xuemin Lin, Lin Wu, and Wenjie Zhang. 2015. Effective multi-query expansions: Robust landmark retrieval. In Proceedings of the 23rd Annual ACM Conference on Multimedia Conference (MM’15). 79--88.
[41]
Yi Yang, Heng Tao Shen, Zhigang Ma, Zi Huang, and Xiaofang Zhou. 2011. l2, 1-Norm regularized discriminative feature selection for unsupervised learning. In Proceedings of the International Joint Conferences on Artificial Intelligence. 1589--1594.
[42]
Yun Zhai, Jingen Liu, and Mubarak Shah. 2006. Automatic query expansion for news video retrieval. In Proceedings of the IEEE Conference on Multimedia and Expo. 965--968.
[43]
Zheng Zhao and Huan Liu. 2007. Spectral feature selection for supervised and unsupervised learning. In Proceedings of the 24th International Conference on Machine Learning. 1151--1157.

Cited By

View all
  • (2024)An Empirical Evaluation of Search Strategies for Locality-Sensitive Hashing: Lookup, Voting, and Natural Classifier SearchSimilarity Search and Applications10.1007/978-3-031-75823-2_13(155-169)Online publication date: 25-Oct-2024
  • (2023)RbQE: An Efficient Method for Content-Based Medical Image Retrieval Based on Query ExpansionJournal of Digital Imaging10.1007/s10278-022-00769-736:3(1248-1261)Online publication date: 26-Jan-2023
  • (2020)Query Expansion Based on Top-Ranked Images for Content-Based Medical Image RetrievalIEEE Access10.1109/ACCESS.2020.30335048(194541-194550)Online publication date: 2020
  • Show More Cited By

Index Terms

  1. Query Expansion for Content-Based Similarity Search Using Local and Global Features

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Multimedia Computing, Communications, and Applications
    ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 13, Issue 3
    August 2017
    233 pages
    ISSN:1551-6857
    EISSN:1551-6865
    DOI:10.1145/3104033
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 31 May 2017
    Accepted: 01 February 2017
    Revised: 01 January 2017
    Received: 01 August 2016
    Published in TOMM Volume 13, Issue 3

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Content-based similarity search
    2. flexible aggregation
    3. query expansion
    4. subjective feature space
    5. unsupervised feature selection

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)96
    • Downloads (Last 6 weeks)22
    Reflects downloads up to 03 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)An Empirical Evaluation of Search Strategies for Locality-Sensitive Hashing: Lookup, Voting, and Natural Classifier SearchSimilarity Search and Applications10.1007/978-3-031-75823-2_13(155-169)Online publication date: 25-Oct-2024
    • (2023)RbQE: An Efficient Method for Content-Based Medical Image Retrieval Based on Query ExpansionJournal of Digital Imaging10.1007/s10278-022-00769-736:3(1248-1261)Online publication date: 26-Jan-2023
    • (2020)Query Expansion Based on Top-Ranked Images for Content-Based Medical Image RetrievalIEEE Access10.1109/ACCESS.2020.30335048(194541-194550)Online publication date: 2020
    • (2020)LTR-expand: query expansion model based on learning to rank association rulesJournal of Intelligent Information Systems10.1007/s10844-020-00596-8Online publication date: 21-Mar-2020
    • (2018)Improved biomedical term selection in pseudo relevance feedbackDatabase10.1093/database/bay0562018:1Online publication date: 2-Jul-2018
    • (2018)Intrinsic Degree: An Estimator of the Local Growth Rate in GraphsSimilarity Search and Applications10.1007/978-3-030-02224-2_15(195-208)Online publication date: 7-Oct-2018
    • (2018)On the Correlation Between Local Intrinsic Dimensionality and OutliernessSimilarity Search and Applications10.1007/978-3-030-02224-2_14(177-191)Online publication date: 7-Oct-2018
    • (2017)A Discriminatively Learned CNN Embedding for Person ReidentificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/315917114:1(1-20)Online publication date: 13-Dec-2017

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Full Access

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media