Article

Efficient semantic annotation method for indexing large personal video database

Authors:

Xian-Sheng Hua,

Hong-Jiang ZhangAuthors Info & Claims

MIR '06: Proceedings of the 8th ACM international workshop on Multimedia information retrieval

Pages 289 - 296

https://doi.org/10.1145/1178677.1178716

Published: 26 October 2006 Publication History

Abstract

As there is a large gap between high-level semantics and low-level features, it is difficult to automatically obtain high-accuracy video semantic annotation through general statistical learning based methods. In this paper, we propose a novel annotation framework based on active learning and semi-supervised ensemble method, which is specially designed for personal video database. To efficiently annotate the home video database, an initial training set is first elaborately constructed based on the distribution analysis of the entire video dataset. Then, both a semi-supervised ensemble based method and an active learning based method are proposed, which aims at minimizing a margin cost function of ensemble to ensure the generalization capacity. The experiment results on about 50 hours home videos show that the proposed method performs superior to both existing semi-supervised learning algorithms and the general active learning algorithms in terms of annotation accuracy and performance stability.

References

[1]

Jun Wu, Xian-Sheng Hua, Hong-Jiang Zhang, An Online-Optimized Incremental Learning Framework for Video Semantic Classification. ACM Multimedia, 2004

Digital Library

[2]

D'Alché Buc, F., Grandvalet, Y. and Ambroise, C. Semi-supervised marginboost. Advances in Neural Information Processing Systems MIT Press, 2002.

[3]

K.P. Bennett, A. Demiriz and R. Maclin, Exploiting unlabeled Data in Ensemble Methods. SIGKDD'02 2002.

Digital Library

[4]

A. Blum, T.M., Combining labeled and unlabeled data with co-training. Proceedings of the Workshop on Computational Learning Theory, 1998.

Digital Library

[5]

K. P. Bennett and A.Demiriz, Semi-supervised support vector machine, Advance in Neural Information Processing Systems 11, MIT Press, 1999 page 368--374

Digital Library

[6]

Y. Freund and R. E. Schapire. Experiments with a new boosting algorithm. In International Conference on Machine Learning, pages 148--156, 1996.

Digital Library

[7]

L. Mason, J. Baxter, P. Bartlett and M. Frean. Boosting algorithms as gradient descent. Advances in Neural Information Processing Systems 12, pages 512--518. MIT Press, 2000.

[8]

J Magalhães and S Rüger, Mining Multimedia Salient Concepts for Incremental Information Extraction. Proc 28th Int'l ACM Information Retrieval Conf (SIGIR, Salvador, Brazil, Aug 2005), pp 641--642, 2005

Digital Library

[9]

Cohn, D. A., Ghahramani, Z., & Jordan, M. I. (1996). Active learning with statistical models. Journal of Artificial Intelligence research, 4, 129--145.

Digital Library

[10]

Tong, S.K., D., Support vector machine active learning with applications to text classification. Journal of Machine Learning Research, 2001: p. 45--66.

Digital Library

[11]

Campbell, C., Cristianini, N. Smola, A. Query learning with large margin classifiers. Proc. 17th International Conf. on Machine Learning, 2000, pp. 111--118

Digital Library

[12]

King-SHy Goh and E. Y. Chang., Multimodal concept-dependent active learning for image retrieval. ACM MM, 2004.

Digital Library

[13]

Yan Song, Xian-Sheng Hua, Li-Rong Dai, Meng Wang. Semi-Automatic Video Annotation Based on Active Learning with Multiple Complementary Predictors. 7th ACM SIGMM International Workshop on Multimedia Information Retrieval. Singapore. Nov 10-11, 2005.

Digital Library

[14]

M. Yeung, B.L.Y., Segmentation of video by clustering and Graph Analysis. Computer Vision and Image understanding, 1998.

Digital Library

[15]

Yan Song, Xian-Sheng Hua, Li-Rong Dai, Ren-Hua Wang, Semi-Automatic Video Semantic Annotation Based on Active Learning. VCIP, 2005

[16]

R. E. Schapire, The Boosting Approach to Machine Learning An Overview, MSRI Workshop on Nonlinear Estimation and Classification, 2002

[17]

R. Collobert, S. Bengio, "SVMTorch: Support vector machines for large-scale regression problems". Journal of Mach. Learning, pp 143--160. 2001.

Digital Library

[18]

Guidelines for the TRECVID 2003 Evaluation http://wwwnlpir.nist.gov/projects/tv2003/tv2003.html

[19]

Whitley, D. A Genetic Algorithm Tutorial. Statistics and Computing, Vol. 4, 64--85, 1994.

Cited By

Haloi PBhuyan M(2021)Video Searching and Retrieval using Scene Classification in Multimedia Databases.2021 2nd International Conference for Emerging Technology (INCET)10.1109/INCET51464.2021.9456317(1-7)Online publication date: 21-May-2021
https://doi.org/10.1109/INCET51464.2021.9456317
Hu WXie NLi Zeng XMaybank S(2018)A Survey on Visual Content-Based Video Indexing and RetrievalIEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews10.1109/TSMCC.2011.210971041:6(797-819)Online publication date: 25-Dec-2018
https://dl.acm.org/doi/10.1109/TSMCC.2011.2109710
Tang JHua X(2018)Typicality rankingMultimedia Tools and Applications10.1007/s11042-011-0892-070:2(647-660)Online publication date: 31-Dec-2018
https://dl.acm.org/doi/10.1007/s11042-011-0892-0
Show More Cited By

Index Terms

Efficient semantic annotation method for indexing large personal video database
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
        Video summarization
2. Information systems
  1. Information retrieval
    1. Document representation
    2. Search engine architectures and scalability
      1. Search engine indexing

Recommendations

Semi-supervised multi-instance multi-label learning for video annotation task
MM '12: Proceedings of the 20th ACM international conference on Multimedia

Traditional approaches for automatic video annotation usually represent one video clip with a flat feature vector, neglecting the fact that video data contain natural structures. It is also noteworthy that a video clip is often relevant to multiple ...
Automatic video annotation by semi-supervised learning with kernel density estimation
MM '06: Proceedings of the 14th ACM international conference on Multimedia

Insufficiency of labeled training data is a major obstacle for automatically annotating large-scale video databases with semantic concepts. Existing semi-supervised learning algorithms based on parametric models try to tackle this issue by incorporating ...
Video Annotation Based on Kernel Linear Neighborhood Propagation

The insufficiency of labeled training data for representing the distribution of the entire dataset is a major obstacle in automatic semantic annotation of large-scale video database. Semi-supervised learning algorithms, which attempt to learn from both ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MIR '06: Proceedings of the 8th ACM international workshop on Multimedia information retrieval

October 2006

344 pages

ISBN:1595934952

DOI:10.1145/1178677

General Chairs:
James Z. Wang
The Pennsylvania State University
,
Nozha Boujemaa
INRIA Rocquencourt, France
,
Yixin Chen
The University of Mississippi

Copyright © 2006 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 October 2006

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

MM06

Sponsor:

MM06: The 14th ACM International Conference on Multimedia 2006

October 26 - 27, 2006

California, Santa Barbara, USA

Upcoming Conference

MM '24

Sponsor:
sigmm

The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

6
Total Citations
View Citations
434
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 16 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Haloi PBhuyan M(2021)Video Searching and Retrieval using Scene Classification in Multimedia Databases.2021 2nd International Conference for Emerging Technology (INCET)10.1109/INCET51464.2021.9456317(1-7)Online publication date: 21-May-2021
https://doi.org/10.1109/INCET51464.2021.9456317
Hu WXie NLi Zeng XMaybank S(2018)A Survey on Visual Content-Based Video Indexing and RetrievalIEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews10.1109/TSMCC.2011.210971041:6(797-819)Online publication date: 25-Dec-2018
https://dl.acm.org/doi/10.1109/TSMCC.2011.2109710
Tang JHua X(2018)Typicality rankingMultimedia Tools and Applications10.1007/s11042-011-0892-070:2(647-660)Online publication date: 31-Dec-2018
https://dl.acm.org/doi/10.1007/s11042-011-0892-0
Wang MHua XHong RTang JQi GSong Y(2009)Unified video annotation via multigraph learningIEEE Transactions on Circuits and Systems for Video Technology10.5555/1641661.164167119:5(733-746)Online publication date: 1-May-2009
https://dl.acm.org/doi/10.5555/1641661.1641671
Meng Wang Xian-Sheng Hua Richang Hong Jinhui Tang Guo-Jun Qi Yan Song (2009)Unified Video Annotation via Multigraph LearningIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2009.201740019:5(733-746)Online publication date: May-2009
https://doi.org/10.1109/TCSVT.2009.2017400
Tang JQi GWang MHua X(2009)Video semantic analysis based on structure-sensitive anisotropic manifold rankingSignal Processing10.1016/j.sigpro.2009.01.02089:12(2313-2323)Online publication date: 1-Dec-2009
https://dl.acm.org/doi/10.1016/j.sigpro.2009.01.020

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents