Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1178677.1178689acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

An adaptive graph model for automatic image annotation

Published: 26 October 2006 Publication History

Abstract

Automatic keyword annotation is a promising solution to enable more effective image search by using keywords. In this paper, we propose a novel automatic image annotation method based on manifold ranking learning, in which the visual and textual information are well integrated. Due to complex and unbalanced data distribution and limited prior information in practice, we design two new schemes to make manifold ranking efficient for image annotation. Firstly, we design a new scheme named the Nearest Spanning Chain (NSC) to generate an adaptive similarity graph, which is robust across data distribution and easy to implement. Secondly, the word-to-word correlations obtained from WordNet and the pairwise co-occurrence are taken into consideration to expand the annotations and prune irrelevant annotations for each image. Experiments conducted on standard Corel dataset and web image dataset demonstrate the effectiveness and efficiency of the proposed method for image annotation.

References

[1]
Budanitsky, A. and Hirst, G. Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures. In Workshop on WordNet and Other Lexical Resources, 2 nd of the North American Chapter of the ACL, Pittsburgh, 2001.
[2]
Claudio, C., Gianluigi, C., Raimondo, S. Image annotation using SVM. In Proceeding Of Internet imaging IV, Vol. SPIE, 2004.
[3]
Cai, D., Yu, S., Wen, J.R. and Ma, W.Y. VIPS: a vision-based page segmentation algorithm. Microsoft Technical Report (MSR-TR-2003-79), 2003.
[4]
Edward Chang, Kingshy Goh, Gerard Sychay, Gang Wu. CBSA: content-base soft annotation for multimodal image retrieval using bayes point machines. CirSysVideo, pp. 26--38, 13(1), 2003.
[5]
David, M., Michael, I. Jordan. Modeling annotated data. Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 127--134, July 2003.
[6]
Gustavo, C. and Nuno, V. A database centric view of semantic image annotation and retrieval. In ACM SIGIR Conf. on Information Retrieval, Salvador, Brazil. 2005.
[7]
He, J., Li, M., Zhang, H.J. Tong, H. and Zhang, C. Manifold-ranking based Image Retrieval. Proceedings of the 12thannual ACM international conference on Multimedia, pp. 9--16, New York, 2004.
[8]
He, X., Cai, D., Liu, H. and Ma, W.Y. Learning a locality preserving subspace for visual recognition. In Proc. IEEE Conf. on Computer Vision, Nice, France, 2003.
[9]
He, X., Cai, D., Liu, H. and Ma, W.Y. Locality preserving indexing for document representation. In ACM SIGIR Conf. on Information Retrieval, Sheffield, 2004.
[10]
He, X., Ma, W.Y. and Zhang, H.J. Learning an image manifold for retrieval. In Proc. of ACM international conference on Multimedia, New York, USA, 2004.
[11]
Hua, Z.G., Wang, X.J., Liu, Q.S. and Lu, H.Q. Semantic knowledge Extraction and Annotation for web images. Proceedings of the 13th Annual ACM International Conference On Multimedia, pp. 467--470, Singapore, 2005.
[12]
J. Jeon, V. Lavrenko and R. Manmatha. Automatic Image Annotation and Retrieval Using Cross-media Relevance Models. In Proc. of ACM SIGIR conference on Research and development in information retrieval, pp. 119--126, July 2003.
[13]
Jiang, J. and Conrath, D. Semantic similarity based on corpus statistics and lexical taxonomy. In Proceedings on International Conference on Research in Computational Linguistics, 1997.
[14]
Kobus, B., Pinar, D. et al. Matching words and pictures. Journal of Machine Learning Research, pp. 1107--1135, 2003.
[15]
Li, J. and Wang, J. Z. Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Trans. On Pattern Analysis and Machine Intelligence, pp. 1075--1088, 25(19), 2003.
[16]
Pucher, M. Performance Evaluation of WordNet-based Semantic Relatedness Measures for Word Prediction in Conversational Speech. In Sixth International Workshop on Computational Semantics, Tilburg, Netherlands, 2005.
[17]
Pinar, D. and Kobus, B. Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In Seventh European Conference on Computer Vision, 4:97--112, 2002.
[18]
Pan, J.Y., Yang, H.J. and Pinar, D. Automatic multimedia cross-modal correlation discovery. The Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 653--658, August 2004.
[19]
R. Manmatha, V. Lavrenko, and J. Jeon, A Model for Learning the Semantics of Pictures. In Proc. of the 17th Annual Conf. on Neural Information Processing Systems, 2003.
[20]
S. L. Feng, R. Manmatha and V. Lavrenko. Multiple Bernouli Relevance Models for Image and Video Annotation. In Proc. Of CVPR, Washington, DC, June, 2004.
[21]
Shi, J.B. and J. Malik. Normalized Cuts and Image Segmentation. IEEE Conference Computer Vision and Pattern Recognition(CVPR), pp. 731--737, June 1997.
[22]
Tong, H., He, J., Li, M.J., Zhang, C., and Ma W.Y., Graph Based Multi-Modality Learning. Proceedings of the 13th Annual ACM international conference on Multimedia, pp. 862--871, Singapore, 2005.
[23]
Wojciech, M., Hanspeter, P., Matt, B. A data-driven reflectance model. In Proc. of SIGGRAPH, 2003.
[24]
Zhou, D., Bousquet, O., Lal, T.N., Weston, J., and Schölkopf, B. Ranking on Data Manifolds. 18th Annual Conf. on Neural Information Processing System, pp. 169--176, 2003.
[25]
Zhou, D., J. Huang and B. Schölkopf. Learning with local and global consistency. 18 th Annual Conference on Neural Information Processing Systems, 2003.
[26]
Yohan Jin, Khan, L., Wang, L. and Awad, M. Image Annotations By Combining Multiple Evidence & WordNet. Proceedings of the 13th Annual ACM International Conference On Multimedia, pp. 706--715, Singapore, 2005.

Cited By

View all
  • (2021)Generic Multi-label Annotation via Adaptive Graph and Marginalized AugmentationACM Transactions on Knowledge Discovery from Data10.1145/345188416:1(1-20)Online publication date: 20-Jul-2021
  • (2021)Semi-Supervised Dual Relation Learning for Multi-Label ClassificationIEEE Transactions on Image Processing10.1109/TIP.2021.312200330(9125-9135)Online publication date: 2021
  • (2019)Adaptive Hypergraph Embedded Semi-Supervised Multi-Label Image AnnotationIEEE Transactions on Multimedia10.1109/TMM.2019.290986021:11(2837-2849)Online publication date: Nov-2019
  • Show More Cited By

Index Terms

  1. An adaptive graph model for automatic image annotation

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MIR '06: Proceedings of the 8th ACM international workshop on Multimedia information retrieval
    October 2006
    344 pages
    ISBN:1595934952
    DOI:10.1145/1178677
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 October 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. image annotation
    2. image retrieval
    3. manifold ranking

    Qualifiers

    • Article

    Conference

    MM06
    MM06: The 14th ACM International Conference on Multimedia 2006
    October 26 - 27, 2006
    California, Santa Barbara, USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 09 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Generic Multi-label Annotation via Adaptive Graph and Marginalized AugmentationACM Transactions on Knowledge Discovery from Data10.1145/345188416:1(1-20)Online publication date: 20-Jul-2021
    • (2021)Semi-Supervised Dual Relation Learning for Multi-Label ClassificationIEEE Transactions on Image Processing10.1109/TIP.2021.312200330(9125-9135)Online publication date: 2021
    • (2019)Adaptive Hypergraph Embedded Semi-Supervised Multi-Label Image AnnotationIEEE Transactions on Multimedia10.1109/TMM.2019.290986021:11(2837-2849)Online publication date: Nov-2019
    • (2018)Adaptive graph guided embedding for multi-label annotationProceedings of the 27th International Joint Conference on Artificial Intelligence10.5555/3304889.3305049(2798-2804)Online publication date: 13-Jul-2018
    • (2018)Multimedia automatic annotation by mining label set correlationMultimedia Tools and Applications10.1007/s11042-017-5170-377:3(3473-3491)Online publication date: 1-Feb-2018
    • (2018)Kernel Based Approaches for Context Based Image AnnotatıonComputational Vision and Bio Inspired Computing10.1007/978-3-319-71767-8_21(249-258)Online publication date: 20-Feb-2018
    • (2016)Automatic image annotation refinement2016 39th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)10.1109/MIPRO.2016.7522345(1324-1329)Online publication date: May-2016
    • (2016)Annotation-retrieval reinforcement by visual cognition modeling on manifoldNeurocomputing10.1016/j.neucom.2015.07.162215:C(150-159)Online publication date: 26-Nov-2016
    • (2016)Multi-scale salient region and relevant visual keywords based model for automatic image annotationMultimedia Tools and Applications10.1007/s11042-014-2318-275:20(12477-12498)Online publication date: 1-Oct-2016
    • (2015)Graph Learning on K Nearest Neighbours for Automatic Image AnnotationProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749383(403-410)Online publication date: 22-Jun-2015
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media