Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Image annotation by kNN-sparse graph-based label propagation over noisily tagged web images

Published: 24 February 2011 Publication History

Abstract

In this article, we exploit the problem of annotating a large-scale image corpus by label propagation over noisily tagged web images. To annotate the images more accurately, we propose a novel kNN-sparse graph-based semi-supervised learning approach for harnessing the labeled and unlabeled data simultaneously. The sparse graph constructed by datum-wise one-vs-kNN sparse reconstructions of all samples can remove most of the semantically unrelated links among the data, and thus it is more robust and discriminative than the conventional graphs. Meanwhile, we apply the approximate k nearest neighbors to accelerate the sparse graph construction without loosing its effectiveness. More importantly, we propose an effective training label refinement strategy within this graph-based learning framework to handle the noise in the training labels, by bringing in a dual regularization for both the quantity and sparsity of the noise. We conduct extensive experiments on a real-world image database consisting of 55,615 Flickr images and noisily tagged training labels. The results demonstrate both the effectiveness and efficiency of the proposed approach and its capability to deal with the noise in the training labels.

References

[1]
Belkin, M. and Niyogi, P. 2003. Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput.
[2]
Chang, C.-C. and Lin, C.-J. 2001. LIBSVM: A library for support vector machines. http://www.csie.ntu.edu.tw/&ctilde;jlin/libsvm.
[3]
Chapelle, O., Zien, A., and Scholkopf, B. 2006. Semi-Supervised Learning. MIT Press.
[4]
Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., and Zheng, Y.-T. 2009. NUS-WIDE: A real-world web image database from national university of singapore. In Proceedings of the ACM Conference on Image and Video Retrieval.
[5]
Donoho, D. L. 2006. For most large underdetermined systems of linear equations the minimal ℓ1-norm solution is also the sparsest solution. Comm. Pure Appl. Math. 59, 6, 797--829.
[6]
Duda, R., Stork, D., and Hart, P. 2000. Pattern Classification. J. Wiley.
[7]
Fergus, R., Fei-Fei, L., Perona, P., and Zisserman, A. 2005. Learning object categories from google's image search. In Proceedings of the IEEE International Conference on Computer Vision.
[8]
Goh, K.-S., Chang, E. Y., and Lai, W.-C. 2004. Multimodal concept-dependent active learning for image retrieval. In Proceedings of the 12th Annual ACM International Conference on Multimedia. 564--571.
[9]
He, J., Li, M., Zhang, H.-J., Tong, H., and Zhang, C. 2004. Manifold-ranking based image retrieval. In Proceedings of the 12th Annual ACM International Conference on Multimedia.
[10]
ℓ1 MAGIC. http://www.acm.caltech.edu/l1magic/.
[11]
Li, X., Chen, L., Zhang, L., Lin, F., and Ma, W.-Y. 2006. Image annotation by large-scale content-based image retrieval. In Proceedings of the 14th Annual ACM International Conference on Multimedia.
[12]
Mount, D. and Arya, S. 1997. Ann: A library for approximate nearest neighbor searching. In Proceedings of the CGC 2nd Annual Fall Workship on Computational Geometry.
[13]
Rao, R., Olshausen, B., and Lewicki, M. 2002. Probabilistic Models of the Brain: Perception and Neural Function. MIT Press.
[14]
Roweis, S. T. and Saul, L. K. 2000. Nonlinear dimensionality reduction by locally linear embedding. Science 290, 2323--2326.
[15]
Saad, Y. 2003. Iterative Methods for Sparse Linear Systems 2nd Ed. Society for Industrial and Applied Mathematics.
[16]
Saad, Y. and Schultz, M. 1986. GMRES: A generalized minimal residual algorithm for solving nonsymmetric linear systems. SIAM J. Sci. Stat. Comp. 7, 856--869.
[17]
Sun, Y., Shimada, S., Taniguchi, Y., and Kojima, A. 2008. A novel region-based approach to visual concept modeling using web images. In Proceedings of the 16th ACM International Conference on Multimedia.
[18]
Tang, J., Hua, X.-S., Qi, G.-J., Wang, M., Mei, T., and Wu, X. 2007. Structure-sensitive manifold ranking for video concept detection. In Proceedings of the 15th ACM International Conference on Multimedia.
[19]
Tang, J., Hua, X.-S., Song, Y., Qi, G.-J., and Wu, X. 2008. Video annotation based on kernel linear neighborhood propagation. IEEE Trans. Multimedia 10, 4.
[20]
Tang, J., Yan, S., Hong, R., Qi, G.-J., and Chua, T.-S. 2009. Inferring semantic concepts from community-contributed images and noisy tags. In Proceedings of the 17th ACM International Conference on Multimedia. 223--232.
[21]
Torralba, A., Fergus, R., and Freeman, W. 2008. 80 million tiny images: A large data set for nonparametric object and scene recognition. IEEE Trans. Patt. Anal. Mach. Intell. 30, 11.
[22]
TREC. Trec-10 proceedings appendix on common evaluation measures. http://trec.nist.gov/pubs/trec10/ appendices/measures.pdf.
[23]
Wang, C., Jing, F., Zhang, L., and Zhang, H.-J. 2006a. Image annotation refinement using random walk with restarts. In Proceedings of the 14th ACM International Conference on Multimedia.
[24]
Wang, F. and Zhang, C. 2008. Label propagation through linear neighborhoods. IEEE Trans. Knowl. Data Engin. 20, 1, 55--67.
[25]
Wang, X.-J., Zhang, L., Jing, F., and Ma, W.-Y. 2006b. Annosearch: Image auto-annotation by search. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
[26]
Wang, X.-J., Zhang, L., Li, X., and Ma, W.-Y. 2008. Annotating images by mining image search results. IEEE Trans. Patt. Anal. Mach. Intell. 30, 11, 1919--1932.
[27]
Wright, J., Yang, A., Ganesh, A., Sastry, S., and Ma, Y. 2009. Robust face recognition via sparse representation. IEEE Trans. Patt. Anal. Mach. Intell. 31, 2 (Feb.), 210--227.
[28]
Zhou, D., Bousquet, O., Lal, T. N., Weston, J., and Scholkopf, B. 2003. Learning with local and global consistency. In Proceedings of the 17th Annual Conference on Neural Information Processing Systems.
[29]
Zhu, X. 2005. Semi-Supervised Learning with Graphs. Ph.D. dissertation, Carnegie Mellon University.
[30]
Zhu, X., Ghahramani, Z., and Lafferty, J. 2003. Semi-supervised learning using gaussian fields and harmonic function. In Proceedings of the 20th International Conference on Machine Learning.

Cited By

View all
  • (2024)Efficient Dual-Attention-Based Knowledge Distillation Network for Unsupervised Wafer Map Anomaly DetectionIEEE Transactions on Semiconductor Manufacturing10.1109/TSM.2024.341605537:3(293-303)Online publication date: Aug-2024
  • (2024)A method for extracting buildings from remote sensing images based on 3DJA-UNet3+Scientific Reports10.1038/s41598-024-70019-z14:1Online publication date: 17-Aug-2024
  • (2024)FCPFNet: Feature Complementation Network with Pyramid Fusion for Semantic SegmentationNeural Processing Letters10.1007/s11063-024-11464-956:2Online publication date: 20-Feb-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Intelligent Systems and Technology
ACM Transactions on Intelligent Systems and Technology  Volume 2, Issue 2
February 2011
175 pages
ISSN:2157-6904
EISSN:2157-6912
DOI:10.1145/1899412
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 February 2011
Accepted: 01 August 2010
Revised: 01 June 2010
Received: 01 February 2010
Published in TIST Volume 2, Issue 2

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. kNN
  2. Sparse graph
  3. label propagation
  4. noisy tags
  5. semi-supervised learning
  6. web image

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)24
  • Downloads (Last 6 weeks)1
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Efficient Dual-Attention-Based Knowledge Distillation Network for Unsupervised Wafer Map Anomaly DetectionIEEE Transactions on Semiconductor Manufacturing10.1109/TSM.2024.341605537:3(293-303)Online publication date: Aug-2024
  • (2024)A method for extracting buildings from remote sensing images based on 3DJA-UNet3+Scientific Reports10.1038/s41598-024-70019-z14:1Online publication date: 17-Aug-2024
  • (2024)FCPFNet: Feature Complementation Network with Pyramid Fusion for Semantic SegmentationNeural Processing Letters10.1007/s11063-024-11464-956:2Online publication date: 20-Feb-2024
  • (2024)Neighborhood relation-based incremental label propagation algorithm for partially labeled hybrid dataMachine Language10.1007/s10994-024-06560-9113:9(6293-6339)Online publication date: 1-Sep-2024
  • (2023)Automated bone marrow cell classification through dual attention gates dense neural networksJournal of Cancer Research and Clinical Oncology10.1007/s00432-023-05384-9149:19(16971-16981)Online publication date: 23-Sep-2023
  • (2022)Sparse Representation-Based Discriminative Metric Learning for Brain MRI Image RetrievalFrontiers in Neuroscience10.3389/fnins.2021.82904015Online publication date: 14-Jan-2022
  • (2022)A Rule-Generation Model for Class Imbalances to Detect Student Entrepreneurship Based on the Theory of Planned BehaviorCybernetics and Information Technologies10.2478/cait-2022-002322:2(160-178)Online publication date: 1-Jun-2022
  • (2022)A survey on social image semantic analysisChinese Science Bulletin10.1360/TB-2022-093868:25(3368-3384)Online publication date: 11-Nov-2022
  • (2022)Attention Feature Pyramid Network for Scene Text Detection2022 IEEE 8th International Conference on Computer and Communications (ICCC)10.1109/ICCC56324.2022.10065815(1726-1731)Online publication date: 9-Dec-2022
  • (2022)Point attention network for point cloud semantic segmentationScience China Information Sciences10.1007/s11432-021-3387-765:9Online publication date: 29-Aug-2022
  • Show More Cited By

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media