research-article

Image annotation by kNN-sparse graph-based label propagation over noisily tagged web images

Authors:

Ramesh JainAuthors Info & Claims

ACM Transactions on Intelligent Systems and Technology (TIST), Volume 2, Issue 2

Article No.: 14, Pages 1 - 15

https://doi.org/10.1145/1899412.1899418

Published: 24 February 2011 Publication History

Abstract

In this article, we exploit the problem of annotating a large-scale image corpus by label propagation over noisily tagged web images. To annotate the images more accurately, we propose a novel kNN-sparse graph-based semi-supervised learning approach for harnessing the labeled and unlabeled data simultaneously. The sparse graph constructed by datum-wise one-vs-kNN sparse reconstructions of all samples can remove most of the semantically unrelated links among the data, and thus it is more robust and discriminative than the conventional graphs. Meanwhile, we apply the approximate k nearest neighbors to accelerate the sparse graph construction without loosing its effectiveness. More importantly, we propose an effective training label refinement strategy within this graph-based learning framework to handle the noise in the training labels, by bringing in a dual regularization for both the quantity and sparsity of the noise. We conduct extensive experiments on a real-world image database consisting of 55,615 Flickr images and noisily tagged training labels. The results demonstrate both the effectiveness and efficiency of the proposed approach and its capability to deal with the noise in the training labels.

References

[1]

Belkin, M. and Niyogi, P. 2003. Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput.

Digital Library

[2]

Chang, C.-C. and Lin, C.-J. 2001. LIBSVM: A library for support vector machines. http://www.csie.ntu.edu.tw/&ctilde;jlin/libsvm.

[3]

Chapelle, O., Zien, A., and Scholkopf, B. 2006. Semi-Supervised Learning. MIT Press.

Digital Library

[4]

Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., and Zheng, Y.-T. 2009. NUS-WIDE: A real-world web image database from national university of singapore. In Proceedings of the ACM Conference on Image and Video Retrieval.

Digital Library

[5]

Donoho, D. L. 2006. For most large underdetermined systems of linear equations the minimal &ell;₁-norm solution is also the sparsest solution. Comm. Pure Appl. Math. 59, 6, 797--829.

[6]

Duda, R., Stork, D., and Hart, P. 2000. Pattern Classification. J. Wiley.

Digital Library

[7]

Fergus, R., Fei-Fei, L., Perona, P., and Zisserman, A. 2005. Learning object categories from google's image search. In Proceedings of the IEEE International Conference on Computer Vision.

Digital Library

[8]

Goh, K.-S., Chang, E. Y., and Lai, W.-C. 2004. Multimodal concept-dependent active learning for image retrieval. In Proceedings of the 12th Annual ACM International Conference on Multimedia. 564--571.

Digital Library

[9]

He, J., Li, M., Zhang, H.-J., Tong, H., and Zhang, C. 2004. Manifold-ranking based image retrieval. In Proceedings of the 12th Annual ACM International Conference on Multimedia.

Digital Library

[10]

&ell;₁ MAGIC. http://www.acm.caltech.edu/l1magic/.

[11]

Li, X., Chen, L., Zhang, L., Lin, F., and Ma, W.-Y. 2006. Image annotation by large-scale content-based image retrieval. In Proceedings of the 14th Annual ACM International Conference on Multimedia.

Digital Library

[12]

Mount, D. and Arya, S. 1997. Ann: A library for approximate nearest neighbor searching. In Proceedings of the CGC 2nd Annual Fall Workship on Computational Geometry.

[13]

Rao, R., Olshausen, B., and Lewicki, M. 2002. Probabilistic Models of the Brain: Perception and Neural Function. MIT Press.

[14]

Roweis, S. T. and Saul, L. K. 2000. Nonlinear dimensionality reduction by locally linear embedding. Science 290, 2323--2326.

[15]

Saad, Y. 2003. Iterative Methods for Sparse Linear Systems 2nd Ed. Society for Industrial and Applied Mathematics.

Digital Library

[16]

Saad, Y. and Schultz, M. 1986. GMRES: A generalized minimal residual algorithm for solving nonsymmetric linear systems. SIAM J. Sci. Stat. Comp. 7, 856--869.

Digital Library

[17]

Sun, Y., Shimada, S., Taniguchi, Y., and Kojima, A. 2008. A novel region-based approach to visual concept modeling using web images. In Proceedings of the 16th ACM International Conference on Multimedia.

Digital Library

[18]

Tang, J., Hua, X.-S., Qi, G.-J., Wang, M., Mei, T., and Wu, X. 2007. Structure-sensitive manifold ranking for video concept detection. In Proceedings of the 15th ACM International Conference on Multimedia.

Digital Library

[19]

Tang, J., Hua, X.-S., Song, Y., Qi, G.-J., and Wu, X. 2008. Video annotation based on kernel linear neighborhood propagation. IEEE Trans. Multimedia 10, 4.

Digital Library

[20]

Tang, J., Yan, S., Hong, R., Qi, G.-J., and Chua, T.-S. 2009. Inferring semantic concepts from community-contributed images and noisy tags. In Proceedings of the 17th ACM International Conference on Multimedia. 223--232.

Digital Library

[21]

Torralba, A., Fergus, R., and Freeman, W. 2008. 80 million tiny images: A large data set for nonparametric object and scene recognition. IEEE Trans. Patt. Anal. Mach. Intell. 30, 11.

Digital Library

[22]

TREC. Trec-10 proceedings appendix on common evaluation measures. http://trec.nist.gov/pubs/trec10/ appendices/measures.pdf.

[23]

Wang, C., Jing, F., Zhang, L., and Zhang, H.-J. 2006a. Image annotation refinement using random walk with restarts. In Proceedings of the 14th ACM International Conference on Multimedia.

Digital Library

[24]

Wang, F. and Zhang, C. 2008. Label propagation through linear neighborhoods. IEEE Trans. Knowl. Data Engin. 20, 1, 55--67.

Digital Library

[25]

Wang, X.-J., Zhang, L., Jing, F., and Ma, W.-Y. 2006b. Annosearch: Image auto-annotation by search. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.

Digital Library

[26]

Wang, X.-J., Zhang, L., Li, X., and Ma, W.-Y. 2008. Annotating images by mining image search results. IEEE Trans. Patt. Anal. Mach. Intell. 30, 11, 1919--1932.

Digital Library

[27]

Wright, J., Yang, A., Ganesh, A., Sastry, S., and Ma, Y. 2009. Robust face recognition via sparse representation. IEEE Trans. Patt. Anal. Mach. Intell. 31, 2 (Feb.), 210--227.

Digital Library

[28]

Zhou, D., Bousquet, O., Lal, T. N., Weston, J., and Scholkopf, B. 2003. Learning with local and global consistency. In Proceedings of the 17th Annual Conference on Neural Information Processing Systems.

[29]

Zhu, X. 2005. Semi-Supervised Learning with Graphs. Ph.D. dissertation, Carnegie Mellon University.

Digital Library

[30]

Zhu, X., Ghahramani, Z., and Lafferty, J. 2003. Semi-supervised learning using gaussian fields and harmonic function. In Proceedings of the 20th International Conference on Machine Learning.

Cited By

Hasan MYu NKhan Mirani I(2024)Efficient Dual-Attention-Based Knowledge Distillation Network for Unsupervised Wafer Map Anomaly DetectionIEEE Transactions on Semiconductor Manufacturing10.1109/TSM.2024.341605537:3(293-303)Online publication date: Aug-2024
https://doi.org/10.1109/TSM.2024.3416055
Li YLi YZhu XFang HYe L(2024)A method for extracting buildings from remote sensing images based on 3DJA-UNet3+Scientific Reports10.1038/s41598-024-70019-z14:1Online publication date: 17-Aug-2024
https://doi.org/10.1038/s41598-024-70019-z
Lei JShu CXu QYu YYang S(2024)FCPFNet: Feature Complementation Network with Pyramid Fusion for Semantic SegmentationNeural Processing Letters10.1007/s11063-024-11464-956:2Online publication date: 20-Feb-2024
https://doi.org/10.1007/s11063-024-11464-9
Show More Cited By

Index Terms

Image annotation by kNN-sparse graph-based label propagation over noisily tagged web images
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
        Scene understanding
2. Information systems
  1. Information retrieval
    1. Document representation
    2. Search engine architectures and scalability
      1. Search engine indexing

Recommendations

Inferring semantic concepts from community-contributed images and noisy tags
MM '09: Proceedings of the 17th ACM international conference on Multimedia

In this paper, we exploit the problem of inferring images' semantic concepts from community-contributed images and their associated noisy tags. To infer the concepts more accurately, we propose a novel sparse graph-based semi-supervised learning ...
Semi-supervised partial label learning algorithm via reliable label propagation
Abstract
Partial label learning (PLL) is a weakly supervised learning method that is able to predict one label as the correct answer from a given candidate label set. In PLL, when all possible candidate labels are as signed to real-world training examples, ...
Transductive Multilabel Learning via Label Set Propagation

The problem of multilabel classification has attracted great interest in the last decade, where each instance can be assigned with a set of multiple class labels simultaneously. It has a wide variety of real-world applications, e.g., automatic image ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Intelligent Systems and Technology

ACM Transactions on Intelligent Systems and Technology Volume 2, Issue 2

February 2011

175 pages

ISSN:2157-6904

EISSN:2157-6912

DOI:10.1145/1899412

Issue’s Table of Contents

Copyright © 2011 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 February 2011

Accepted: 01 August 2010

Revised: 01 June 2010

Received: 01 February 2010

Published in TIST Volume 2, Issue 2

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Funding Sources

National Research Foundation-Prime Minister's office, Republic of Singapore

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

197
Total Citations
View Citations
1,455
Total Downloads

Downloads (Last 12 months)25
Downloads (Last 6 weeks)2

Reflects downloads up to 02 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Hasan MYu NKhan Mirani I(2024)Efficient Dual-Attention-Based Knowledge Distillation Network for Unsupervised Wafer Map Anomaly DetectionIEEE Transactions on Semiconductor Manufacturing10.1109/TSM.2024.341605537:3(293-303)Online publication date: Aug-2024
https://doi.org/10.1109/TSM.2024.3416055
Li YLi YZhu XFang HYe L(2024)A method for extracting buildings from remote sensing images based on 3DJA-UNet3+Scientific Reports10.1038/s41598-024-70019-z14:1Online publication date: 17-Aug-2024
https://doi.org/10.1038/s41598-024-70019-z
Lei JShu CXu QYu YYang S(2024)FCPFNet: Feature Complementation Network with Pyramid Fusion for Semantic SegmentationNeural Processing Letters10.1007/s11063-024-11464-956:2Online publication date: 20-Feb-2024
https://doi.org/10.1007/s11063-024-11464-9
Shu WCao DQian WLi S(2024)Neighborhood relation-based incremental label propagation algorithm for partially labeled hybrid dataMachine Learning10.1007/s10994-024-06560-9Online publication date: 19-Jun-2024
https://doi.org/10.1007/s10994-024-06560-9
Peng KPeng YLiao HYang ZFeng W(2023)Automated bone marrow cell classification through dual attention gates dense neural networksJournal of Cancer Research and Clinical Oncology10.1007/s00432-023-05384-9149:19(16971-16981)Online publication date: 23-Sep-2023
https://doi.org/10.1007/s00432-023-05384-9
Zhou GLu BHu XNi T(2022)Sparse Representation-Based Discriminative Metric Learning for Brain MRI Image RetrievalFrontiers in Neuroscience10.3389/fnins.2021.82904015Online publication date: 14-Jan-2022
https://doi.org/10.3389/fnins.2021.829040
Rijati NPurwitasar DSumpeno SPurnomo M(2022)A Rule-Generation Model for Class Imbalances to Detect Student Entrepreneurship Based on the Theory of Planned BehaviorCybernetics and Information Technologies10.2478/cait-2022-002322:2(160-178)Online publication date: 1-Jun-2022
https://dl.acm.org/doi/10.2478/cait-2022-0023
Li ZTang J(2022)A survey on social image semantic analysisChinese Science Bulletin10.1360/TB-2022-093868:25(3368-3384)Online publication date: 11-Nov-2022
https://doi.org/10.1360/TB-2022-0938
Wu HDong BDing LDong Y(2022)Attention Feature Pyramid Network for Scene Text Detection2022 IEEE 8th International Conference on Computer and Communications (ICCC)10.1109/ICCC56324.2022.10065815(1726-1731)Online publication date: 9-Dec-2022
https://doi.org/10.1109/ICCC56324.2022.10065815
Ren DWu ZLi JYu PGuo JWei MGuo Y(2022)Point attention network for point cloud semantic segmentationScience China Information Sciences10.1007/s11432-021-3387-765:9Online publication date: 29-Aug-2022
https://doi.org/10.1007/s11432-021-3387-7
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Issue’s Table of Contents