research-article

Efficient large-scale image annotation by probabilistic collaborative multi-label propagation

Authors:

Tat-Seng ChuaAuthors Info & Claims

MM '10: Proceedings of the 18th ACM international conference on Multimedia

Pages 35 - 44

https://doi.org/10.1145/1873951.1873959

Published: 25 October 2010 Publication History

Abstract

Annotating large-scale image corpus requires huge amount of human efforts and is thus generally unaffordable, which directly motivates recent development of semi-supervised or active annotation methods. In this paper we revisit this notoriously challenging problem and develop a novel multi-label propagation scheme, whereby both the efficacy and accuracy of large-scale image annotation are further enhanced. Our investigation starts from a survey of previous graph propagation based annotation approaches, wherein we analyze their main drawbacks when scaling up to large-scale datasets and handling multi-label setting. Our proposed scheme outperforms the state-of-the-art algorithms by making the following contributions. 1) Unlike previous approaches that propagate over individual label independently, our proposed large-scale multi-label propagation (LSMP) scheme encodes the tag information of an image as a unit label confidence vector, which naturally imposes inter-label constraints and manipulates labels interactively. It then utilizes the probabilistic Kullback-Leibler divergence for problem formulation on multi-label propagation. 2) We perform the multi-label propagation on the so-called hashing-based L1-graph, which is efficiently derived with Locality Sensitive Hashing approach followed by sparse L1-graph construction within the individual hashing buckets. 3) An efficient and convergency provable iterative procedure is presented for problem optimization. Extensive experiments on NUS-WIDE dataset (both lite version with 56k images and full version with 270k images) well validate the effectiveness and scalability of the proposed approach.

References

[1]

A. Andoni and P. Indyk. Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. Commun. ACM, 51(1):117--122, February 2008.

Digital Library

[2]

S. Boyd and L. Vandenberghe. Convex Optimization. Cambridge University Press, 2004.

Digital Library

[3]

E. J. Candes, J. K. Romberg, and T. Tao. Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information. IEEE Transactions on Information Theory, 52(2):489--509, February 2006.

Digital Library

[4]

G. Chen, Y. Song, F. Wang, and C. Zhang. Semi-supervised multi-label learning by solving a sylvester equation. In SIAM International Conference on Data Mining, 2008.

[5]

T.-S. Chua, J. Tang, R. Hong, H. Li, Z. Luo, and Y.-T. Zheng. Nus-wide: A real-world web image database from national university of singapore. In CIVR, July 2009.

Digital Library

[6]

R. Collobert, F. H. Sinz, J.Weston, and L. Bottou. Large scale transductive svms. Journal of Machine Learning Research, 7:1687--1712, September 2006.

Digital Library

[7]

T. M. Cover and J. A. Thomas. Elements of Information Theory. Wiley Series in Telecommunications, 1991.

Digital Library

[8]

O. Delalleau, Y. Bengio, and N. Le Roux. Efficient non-parametric function induction in semi-supervised learning. In Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics, pages 96--103, 2005.

[9]

R. Duda, D. Stork, and P. Hart. Pattern Classification. JOHN WILEY, 2000.

Digital Library

[10]

P. Indyk and R. Motwani. Approximate nearest neighbors: Towards removing the curse of dimensionality. In Proceedings of the Symposium on Theory Computing, 1998.

Digital Library

[11]

M. Karlen, J. Weston, A. Erkan, and R. Collobert. Large-scale manifold transduction. In ICML, 2008.

Digital Library

[12]

D. Liu, X.-S. Hua, L. Yang, M. Wang, and H. jiang Zhang. Tag ranking. In WWW, 2009.

Digital Library

[13]

Y. Liu, R. Jin, and L. Yang. Semi-supervised multi-label learning by constrained non-negative matrix factorization. In AAAI, 2006.

Digital Library

[14]

Y. Mu, J. Shen, and S. Yan. Weakly-supervised hashing in kernel space. In CVPR, 2010.

[15]

G.-J. Qi, X.-S. Hua, Y. Rui, J. Tang, T. Mei, and H.-J. Zhang. Correlative multi-label video annotation. In MM, 2007.

Digital Library

[16]

S.T.Roweis and L.K.Saul. Nonlinear dimensionality reduction by locally linear embedding. Science, 290:2323--2326, 2000.

[17]

A. Subramanya and J. Bilmes. Entropic graph regularization in non-parametric semi-supervised classification. In NIPS, 2009.

[18]

J. Tang, S. Yan, R. Hong, G.-J. Qi, and T.-S. Chua. Inferring semantic concepts from community-contributed images and noisy tags. In MM, 2009.

Digital Library

[19]

I. W. Tsang and J. T. Kwok. Large-scale sparsified manifold regularization. In NIPS, 2006.

[20]

F. Wang and C. Zhang. Label propagation through linear neighborhoods. In ICML, June 2006.

Digital Library

[21]

J. Yuan, J. Li, and B. Zhang. Exploiting spatial context constraints for automatic image region annotation. In MM, 2007.

Digital Library

[22]

Z.-J. Zha, T. Mei, J. Wang, Z.Wang, and X.-S. Hua. Graph-based semi-supervised learning with multiple labels. Journal of Visual Communication and Image Representation, 20(2):97--103, February 2009.

Digital Library

[23]

X. Zhu. Semi-supervised learning with graphs. Carnegie Mellon University, 2005.

[24]

X. Zhu. Semi-Supervised Learning Literature Survey. Carnegie Mellon University, 2006.

Cited By

Stureborg RDhingra BYang J(2023)Interface Design for Crowdsourcing Hierarchical Multi-Label Text AnnotationsProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581431(1-17)Online publication date: 19-Apr-2023
https://dl.acm.org/doi/10.1145/3544548.3581431
Li ZLin LZhang CMa HZhao WShi Z(2021)A Semi-supervised Learning Approach Based on Adaptive Weighted Fusion for Automatic Image AnnotationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/342697417:1(1-23)Online publication date: 16-Apr-2021
https://dl.acm.org/doi/10.1145/3426974
Chen YLin XGe KHe WLi D(2020)Tag Pollution Detection in Web Videos via Cross-Modal Relevance Estimation2020 IEEE/ACM 28th International Symposium on Quality of Service (IWQoS)10.1109/IWQoS49365.2020.9212971(1-10)Online publication date: Jun-2020
https://doi.org/10.1109/IWQoS49365.2020.9212971
Show More Cited By

Index Terms

Efficient large-scale image annotation by probabilistic collaborative multi-label propagation
1. Information systems
  1. Information retrieval
    1. Document representation
    2. Search engine architectures and scalability
      1. Search engine indexing

Recommendations

Instance Annotation for Multi-Instance Multi-Label Learning
Special Issue on ACM SIGKDD 2012

Multi-instance multi-label learning (MIML) is a framework for supervised classification where the objects to be classified are bags of instances associated with multiple labels. For example, an image can be represented as a bag of segments and ...
Multi-label learning with missing labels for image annotation and facial action unit recognition

Many problems in computer vision, such as image annotation, can be formulated as multi-label learning problems. It is typically assumed that the complete label assignment for each training image is available. However, this is often not the case in ...
Multi-view based multi-label propagation for image annotation

Multi-view learning and multi-label propagation are two common approaches to address the problem of image annotation. Traditional multi-view methods disregard the consistencies among different views while existing algorithms toward multi-label ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '10: Proceedings of the 18th ACM international conference on Multimedia

October 2010

1836 pages

ISBN:9781605589336

DOI:10.1145/1873951

General Chairs:
Alberto del Bimbo
University of Florence, Italy
,
Shih-Fu Chang
Columbia University, USA
,
Program Chair:
Arnold Smeulders
University of Amsterdam, NL

Copyright © 2010 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 October 2010

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

MM '10

Sponsor:

SIGMM

MM '10: ACM Multimedia Conference

October 25 - 29, 2010

Firenze, Italy

Acceptance Rates

Overall Acceptance Rate 995 of 4,171 submissions, 24%

Upcoming Conference

MM '24

Sponsor:
sigmm

The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

54
Total Citations
View Citations
636
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 15 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Stureborg RDhingra BYang J(2023)Interface Design for Crowdsourcing Hierarchical Multi-Label Text AnnotationsProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581431(1-17)Online publication date: 19-Apr-2023
https://dl.acm.org/doi/10.1145/3544548.3581431
Li ZLin LZhang CMa HZhao WShi Z(2021)A Semi-supervised Learning Approach Based on Adaptive Weighted Fusion for Automatic Image AnnotationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/342697417:1(1-23)Online publication date: 16-Apr-2021
https://dl.acm.org/doi/10.1145/3426974
Chen YLin XGe KHe WLi D(2020)Tag Pollution Detection in Web Videos via Cross-Modal Relevance Estimation2020 IEEE/ACM 28th International Symposium on Quality of Service (IWQoS)10.1109/IWQoS49365.2020.9212971(1-10)Online publication date: Jun-2020
https://doi.org/10.1109/IWQoS49365.2020.9212971
Li ZLin LZhang CMa HZhao WEl Saddik ADel Bimbo AZhang ZHauptmann ACandan KBertini MXie LWei X(2019)Collaborating CNN and SVM for Automatic Image AnnotationProceedings of the 2019 on International Conference on Multimedia Retrieval10.1145/3323873.3325023(63-67)Online publication date: 5-Jun-2019
https://dl.acm.org/doi/10.1145/3323873.3325023
Cartwright MDove GMéndez Méndez ABello JNov OBrewster SFitzpatrick GCox AKostakos V(2019)Crowdsourcing Multi-label Audio Annotation Tasks with Citizen ScientistsProceedings of the 2019 CHI Conference on Human Factors in Computing Systems10.1145/3290605.3300522(1-11)Online publication date: 2-May-2019
https://dl.acm.org/doi/10.1145/3290605.3300522
Shao QLiu B(2019)Laplacian Eigenmaps Regularized Feature Mapping for Image Annotation2019 IEEE International Conference on Systems, Man and Cybernetics (SMC)10.1109/SMC.2019.8913981(3901-3906)Online publication date: Oct-2019
https://doi.org/10.1109/SMC.2019.8913981
Li JJing MXie YLu KHuang Z(2019)Agile Domain Adaptation2019 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN.2019.8852479(1-8)Online publication date: Jul-2019
https://doi.org/10.1109/IJCNN.2019.8852479
Li QWu QZhu CZhang JZhao W(2019)An Inferable Representation Learning for Fraud Review Detection with Cold-start Problem2019 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN.2019.8852437(1-8)Online publication date: Jul-2019
https://doi.org/10.1109/IJCNN.2019.8852437
Li HLi XChen XXie XMu YFeng Z(2019)Cross-project Defect Prediction via ASTToken2Vec and BLSTM-based Neural Network2019 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN.2019.8852135(1-8)Online publication date: Jul-2019
https://doi.org/10.1109/IJCNN.2019.8852135
Li ZLin LZhang CMa HZhao W(2019)Automatic Image Annotation based on Co-Training2019 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN.2019.8852047(1-8)Online publication date: Jul-2019
https://doi.org/10.1109/IJCNN.2019.8852047
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents