Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1460096.1460126acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Learning tag relevance by neighbor voting for social image retrieval

Published: 30 October 2008 Publication History

Abstract

Social image retrieval is important for exploiting the increasing amounts of amateur-tagged multimedia such as Flickr images. Since amateur tagging is known to be uncontrolled, ambiguous, and personalized, a fundamental problem is how to reliably interpret the relevance of a tag with respect to the visual content it is describing. Intuitively, if different persons label similar images using the same tags, these tags are likely to reflect objective aspects of the visual content. Starting from this intuition, we propose a novel algorithm that scalably and reliably learns tag relevance by accumulating votes from visually similar neighbors. Further, treated as tag frequency, learned tag relevance is seamlessly embedded into current tag-based social image retrieval paradigms.
Preliminary experiments on one million Flickr images demonstrate the potential of the proposed algorithm. Overall comparisons for both single-word queries and multiple-word queries show substantial improvement over the baseline by learning and using tag relevance. Specifically, compared with the baseline using the original tags, on average, retrieval using improved tags increases mean average precision by 24%, from 0.54 to 0.67. Moreover, simulated experiments indicate that performance can be improved further by scaling up the amount of images used in the proposed neighbor voting algorithm.

References

[1]
E. Auchard. Flickr to map the world's latest photo hotspots, November 2007. Reuters, http://www.reuters.com/article/technologyNews/idUSHO94233920071119?sp=t%rue, retrieved on 2008--06--16.
[2]
K. Barnard, P. Duygulu, D. Forsyth, N. de Freitas, D. M. Blei, and M. I. Jordan. Matching words and pictures. JMLR, pages 1107--1135, 2003.
[3]
G. Begelman, P. Keller, and F. Smadja. Automated tag clustering: Improving search and exploration in the tag space. In WWW Collaborative Web Tagging Workshop, 2006.
[4]
E. Chang, G. Kingshy, G. Sychay, and G. Wu. Cbsa: content-based soft annotation for multimodal image retrieval using bayes point machines. TCSVT, 13(2):26--38, 2003.
[5]
T.-S. Chua, S.-Y. Neo, K.-Y. Li, G. Wang, R. Shi, M. Zhao, and H. Xu. TRECVID 2004 search and feature extraction task by NUS PRIS. In TRECVID Workshop, 2004.
[6]
R. Datta, D. Joshi, J. Li, and J. Z. Wang. Image retrieval: Ideas, in uences, and trends of the new age. ACM Computing Survey, 40(2):1--60, 2008.
[7]
C. Fellbaum, editor. WordNet: An Electronic Lexical Database. The MIT Press, 1998.
[8]
R. Fergus, P. Perona, and A. Zisserman. A visual category filter for google images. In ECCV, pages 242--256, 2004.
[9]
H. Ferhatosmanoglu, E. Tuncel, D. Agrawal, and A. E. Abbadi. Approximate nearest neighbor searching in multimedia databases. In ICDE, pages 503--511, 2001.
[10]
S. A. Golder and B. A. Huberman. Usage patterns of collaborative tagging systems. Jour. Information Science, 32(2):198--208, 2006.
[11]
W. Hsu, L. Kennedy, and S.-F. Chang. Video search reranking via information bottleneck principle. In ACM Multimedia, pages 35--44, 2006.
[12]
J. Huang, S. Kumar, M. Mitra, W. Zhu, and R. Zabih. Image indexing using color correlograms. In CVPR, pages 762--768, 1997.
[13]
B. J. Jansen and A. Spink. How are we searching the world wide web?: a comparison of nine search engine transaction logs. JIPM, 42(1):248--263, 2006.
[14]
Y. Jin, L. Khan, L. Wang, and M. Awad. Image annotations by combining multiple evidence & Wordnet. In ACM Multimedia, pages 706--715, 2005.
[15]
K. S. Jones, S. Walker, and S. E. Robertson. A probabilistic model of information retrieval: development and comparative experiments - part 2. Jour. Information Processing and Management, 36(6):809--840, 2000.
[16]
L. Kennedy, M. Naaman, S. Ahern, R. Nair, and T. Rattenbury. How ickr helps us make sense of the world: Context and content in community-contributed media collections. In ACM Multimedia, pages 631--640, 2007.
[17]
J. Li and J. Z. Wang. Real-time computerized annotation of pictures. TPAMI, 30(6):985--1002, 2008.
[18]
X. Li, L. Chen, L. Zhang, F. Lin, and W.-Y. Ma. Image annotation by large-scale content-based image retrieval. In ACM Multimedia, pages 607--610, 2006.
[19]
X. Li, X.-J. Wang, C. Wang, and L. Zhang. SBIA: search-based image annotation by leveraging web-scale images. In ACM Multimedia, pages 467--468, 2007.
[20]
W.-H. Lin and A. Hauptmann. Web image retrieval re-ranking with relevance model. In Web Intelligence, pages 242--248, 2003.
[21]
K. Matusiak. Towards user-centered indexing in digital image collections. OCLC Systems and Services, 22(4):283--298, 2006.
[22]
F. F.-H. Nah. A study on tolerable waiting time: how long are Web users willing to wait? Jour. Behaviour and Information Technology, 23(3):153--163, 2004.
[23]
G. Park, Y. Baek, and H.-K. Lee. Majority based ranking approach in web image retrieval. In CIVR, pages 499--504, 2003.
[24]
N. Sebe and Q. Tian. Personalized multimedia retrieval: the new trend? In ACM MIR Workshop, pages 299--306, 2007.
[25]
A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. TPAMI, 22(2):1349--1380, 2000.
[26]
A. Torralba, R. Fergus, and W. Freeman. 80 million tiny images: a large dataset for non-parametric object and scene recognition. TPAMI, 2008 (in press).
[27]
C. Wang, F. Jing, L. Zhang, and H.-J. Zhang. Scalable search-based image annotation of personal images. In ACM MIR Workshop, pages 269--278, 2006.
[28]
C. Wang, F. Jing, L. Zhang, and H.-J. Zhang. Content-based image annotation refinement. In CVPR, pages 1--8, 2007.
[29]
X.-J. Wang, L. Zhang, X. Li, and W.-Y. Ma. Annotating images by mining image search results. TPAMI, 2008 (in press).
[30]
Y. Wu, J.-Y. Bouguet, A. Nefian, and I. V. Kozintsev. Learning concept templates from web images to query personal image database. In ICME, pages 1986--1989, 2007.
[31]
R. Yan, A. Hauptmann, and R. Jin. Multimedia search with pseudo-relevance feedback. In CIVR, pages 649--654, 2003.
[32]
H. Yu, M. Li, H. Zhang, and J. Feng. Color texture moment for content-based image retrieval. In ICIP, pages 929--932, 2002.

Cited By

View all
  • (2024)The Contemporary Art of Image Search: Iterative User Intent Expansion via Vision-Language ModelProceedings of the ACM on Human-Computer Interaction10.1145/36410198:CSCW1(1-31)Online publication date: 26-Apr-2024
  • (2022)A survey on social image semantic analysisChinese Science Bulletin10.1360/TB-2022-093868:25(3368-3384)Online publication date: 11-Nov-2022
  • (2021)How to Make a Query in Image Retrieval with Partial Information Extracted from Multiple Image Samples?International Journal of Pattern Recognition and Artificial Intelligence10.1142/S021800142154021535:07(2154021)Online publication date: 27-Mar-2021
  • Show More Cited By

Index Terms

  1. Learning tag relevance by neighbor voting for social image retrieval

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      MIR '08: Proceedings of the 1st ACM international conference on Multimedia information retrieval
      October 2008
      506 pages
      ISBN:9781605583129
      DOI:10.1145/1460096
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 30 October 2008

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. neighbor voting
      2. social image retrieval
      3. tag relevance

      Qualifiers

      • Research-article

      Conference

      MM08
      Sponsor:
      MM08: ACM Multimedia Conference 2008
      October 30 - 31, 2008
      British Columbia, Vancouver, Canada

      Upcoming Conference

      MM '24
      The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)12
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 04 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)The Contemporary Art of Image Search: Iterative User Intent Expansion via Vision-Language ModelProceedings of the ACM on Human-Computer Interaction10.1145/36410198:CSCW1(1-31)Online publication date: 26-Apr-2024
      • (2022)A survey on social image semantic analysisChinese Science Bulletin10.1360/TB-2022-093868:25(3368-3384)Online publication date: 11-Nov-2022
      • (2021)How to Make a Query in Image Retrieval with Partial Information Extracted from Multiple Image Samples?International Journal of Pattern Recognition and Artificial Intelligence10.1142/S021800142154021535:07(2154021)Online publication date: 27-Mar-2021
      • (2021)Image Tagging by Fine-tuning Class Semantics Using Text Data from Web Scraping2021 24th International Conference on Computer and Information Technology (ICCIT)10.1109/ICCIT54785.2021.9689793(1-6)Online publication date: 18-Dec-2021
      • (2021)Transfer Learning-Based Image Tagging Using Word Embedding Technique for Image Retrieval ApplicationsSoft Computing and its Engineering Applications10.1007/978-981-16-0708-0_14(157-168)Online publication date: 5-Mar-2021
      • (2019)Image annotation refinement via 2P-KNN based group sparse reconstructionMultimedia Tools and Applications10.1007/s11042-018-5925-578:10(13213-13225)Online publication date: 1-May-2019
      • (2019)Saliency detection on sampled images for tag rankingMultimedia Systems10.1007/s00530-017-0546-925:1(35-47)Online publication date: 1-Feb-2019
      • (2019)Socially-Enriched Multimedia Data Co-clusteringIndigenous Knowledge and Education in Africa10.1007/978-3-030-02985-2_5(111-135)Online publication date: 1-May-2019
      • (2019)IntroductionIndigenous Knowledge and Education in Africa10.1007/978-3-030-02985-2_1(3-14)Online publication date: 1-May-2019
      • (2018)IMAGE SEARCH BY COMPARING GABOR FILTER WITH SVM AND SIFTi-manager's Journal on Information Technology10.26634/jit.7.3.144037:3(10)Online publication date: 2018
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media