Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1178677.1178716acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Efficient semantic annotation method for indexing large personal video database

Published: 26 October 2006 Publication History

Abstract

As there is a large gap between high-level semantics and low-level features, it is difficult to automatically obtain high-accuracy video semantic annotation through general statistical learning based methods. In this paper, we propose a novel annotation framework based on active learning and semi-supervised ensemble method, which is specially designed for personal video database. To efficiently annotate the home video database, an initial training set is first elaborately constructed based on the distribution analysis of the entire video dataset. Then, both a semi-supervised ensemble based method and an active learning based method are proposed, which aims at minimizing a margin cost function of ensemble to ensure the generalization capacity. The experiment results on about 50 hours home videos show that the proposed method performs superior to both existing semi-supervised learning algorithms and the general active learning algorithms in terms of annotation accuracy and performance stability.

References

[1]
Jun Wu, Xian-Sheng Hua, Hong-Jiang Zhang, An Online-Optimized Incremental Learning Framework for Video Semantic Classification. ACM Multimedia, 2004
[2]
D'Alché Buc, F., Grandvalet, Y. and Ambroise, C. Semi-supervised marginboost. Advances in Neural Information Processing Systems MIT Press, 2002.
[3]
K.P. Bennett, A. Demiriz and R. Maclin, Exploiting unlabeled Data in Ensemble Methods. SIGKDD'02 2002.
[4]
A. Blum, T.M., Combining labeled and unlabeled data with co-training. Proceedings of the Workshop on Computational Learning Theory, 1998.
[5]
K. P. Bennett and A.Demiriz, Semi-supervised support vector machine, Advance in Neural Information Processing Systems 11, MIT Press, 1999 page 368--374
[6]
Y. Freund and R. E. Schapire. Experiments with a new boosting algorithm. In International Conference on Machine Learning, pages 148--156, 1996.
[7]
L. Mason, J. Baxter, P. Bartlett and M. Frean. Boosting algorithms as gradient descent. Advances in Neural Information Processing Systems 12, pages 512--518. MIT Press, 2000.
[8]
J Magalhães and S Rüger, Mining Multimedia Salient Concepts for Incremental Information Extraction. Proc 28th Int'l ACM Information Retrieval Conf (SIGIR, Salvador, Brazil, Aug 2005), pp 641--642, 2005
[9]
Cohn, D. A., Ghahramani, Z., & Jordan, M. I. (1996). Active learning with statistical models. Journal of Artificial Intelligence research, 4, 129--145.
[10]
Tong, S.K., D., Support vector machine active learning with applications to text classification. Journal of Machine Learning Research, 2001: p. 45--66.
[11]
Campbell, C., Cristianini, N. Smola, A. Query learning with large margin classifiers. Proc. 17th International Conf. on Machine Learning, 2000, pp. 111--118
[12]
King-SHy Goh and E. Y. Chang., Multimodal concept-dependent active learning for image retrieval. ACM MM, 2004.
[13]
Yan Song, Xian-Sheng Hua, Li-Rong Dai, Meng Wang. Semi-Automatic Video Annotation Based on Active Learning with Multiple Complementary Predictors. 7th ACM SIGMM International Workshop on Multimedia Information Retrieval. Singapore. Nov 10-11, 2005.
[14]
M. Yeung, B.L.Y., Segmentation of video by clustering and Graph Analysis. Computer Vision and Image understanding, 1998.
[15]
Yan Song, Xian-Sheng Hua, Li-Rong Dai, Ren-Hua Wang, Semi-Automatic Video Semantic Annotation Based on Active Learning. VCIP, 2005
[16]
R. E. Schapire, The Boosting Approach to Machine Learning An Overview, MSRI Workshop on Nonlinear Estimation and Classification, 2002
[17]
R. Collobert, S. Bengio, "SVMTorch: Support vector machines for large-scale regression problems". Journal of Mach. Learning, pp 143--160. 2001.
[18]
Guidelines for the TRECVID 2003 Evaluation http://wwwnlpir.nist.gov/projects/tv2003/tv2003.html
[19]
Whitley, D. A Genetic Algorithm Tutorial. Statistics and Computing, Vol. 4, 64--85, 1994.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MIR '06: Proceedings of the 8th ACM international workshop on Multimedia information retrieval
October 2006
344 pages
ISBN:1595934952
DOI:10.1145/1178677
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 October 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. active learning
  2. semi-supervised ensemble
  3. video annotation
  4. video structure analysis

Qualifiers

  • Article

Conference

MM06
MM06: The 14th ACM International Conference on Multimedia 2006
October 26 - 27, 2006
California, Santa Barbara, USA

Upcoming Conference

MM '24
The 32nd ACM International Conference on Multimedia
October 28 - November 1, 2024
Melbourne , VIC , Australia

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Video Searching and Retrieval using Scene Classification in Multimedia Databases.2021 2nd International Conference for Emerging Technology (INCET)10.1109/INCET51464.2021.9456317(1-7)Online publication date: 21-May-2021
  • (2018)A Survey on Visual Content-Based Video Indexing and RetrievalIEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews10.1109/TSMCC.2011.210971041:6(797-819)Online publication date: 25-Dec-2018
  • (2018)Typicality rankingMultimedia Tools and Applications10.1007/s11042-011-0892-070:2(647-660)Online publication date: 31-Dec-2018
  • (2009)Unified video annotation via multigraph learningIEEE Transactions on Circuits and Systems for Video Technology10.5555/1641661.164167119:5(733-746)Online publication date: 1-May-2009
  • (2009)Unified Video Annotation via Multigraph LearningIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2009.201740019:5(733-746)Online publication date: May-2009
  • (2009)Video semantic analysis based on structure-sensitive anisotropic manifold rankingSignal Processing10.1016/j.sigpro.2009.01.02089:12(2313-2323)Online publication date: 1-Dec-2009

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media