Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Maximum entropy model-based baseball highlight detection and classification

Published: 01 November 2004 Publication History

Abstract

In this paper, we propose a novel system that is able to automatically detect and classify baseball highlights by seamlessly integrating image, audio, and speech clues using a unique framework based on maximum entropy model (MEM). What distinguishes our system is that we emphasize on the integration of multimedia features and the acquisition of domain knowledge through automatic machine learning processes. Our MEM-based framework provides a simple platform capable of integrating multimedia features as well as their contextual information in a uniform fashion. Unlike the Hidden Markov Model and the Bayes Network-based approaches, this framework does not need to explicitly segment and classify the input video into states during its data modeling process, and hence remarkably simplifies the training data creation and the highlight detection/classification tasks. Experimental evaluations demonstrate the superiority of our proposed baseball highlight detection system in its capability of detecting more major baseball highlights, and in its overall accuracy of detecting these major highlights.

References

[1]
{1} Y. Gong, L.T. Sin, C.H. Chuan, H. Zhang, M. Sakauchi, Automatic parsing of tv soccer programs, in IEEE International Conference on Multimedia Computing and Systems, (1995) 167-174.
[2]
{2} Y. Rui, A. Gupta, A. Acero, Automatically extracting highlights for tv baseball programs, in Eighth ACM International Conference on Multimedia, (2000) 105-115.
[3]
{3} P. Xu, L. Xie, S.F. Chang, A. Divakaran, A. Vetro, H. Sun, Algorithms and system for segmentation and structure analysis in soccer video, in IEEE Conference on Multimedia and Expo, (2001) 928-931.
[4]
{4} D. Zhong, S.F. Chang, Structure analysis of sports video using domain models, in IEEE Conference on Multimedia and Expo, (2001) 920-923.
[5]
{5} N. Babaguchi, Towards abstracting sports video by highlights, in IEEE Conference on Multimedia and Expo, (2000) 1519-1522.
[6]
{6} A. Bonzanini, R. Leonardi, P. Migliorati, Event recognition in sport programs using low-level motion indices, in IEEE Conference on Multimedia and Expo, (2001) 1208-1211.
[7]
{7} T. Syeda-Mahmood, S. Srinivasan, Detecting topical events in digital video, in Proc. ACM Multimedia, (Los Angeles), (2000) 85-94.
[8]
{8} M. Naphade, T. Huang, Semantic filtering of video content, in Proc. SPIE, Storage and Retrieval for Media Databases, (San Jose), 2001.
[9]
{9} S. Intille, A. Bobick, Representation and visual recognition of complex, multi-agent actions using belief networks, Tech. Rep. 454, MIT Media Lab., Jan. 1998.
[10]
{10} Y.L. Chang, W. Zeng, I. Kamel, R. Alonso, Integrated image and speech analysis for content-based video indexing, in International Conference on Multimedia Communication and Systems, (1996) 306-313.
[11]
{11} S. Srinivasan et al, Query expansion for imperfect speech: applications in distributed learning, in Proc. IEEE Workshop on Content-Based Access of Image and Video Libraries, 2000.
[12]
{12} A.L. Berger, S.A.D. Pietra, V.J.D. Pietra, A maximum entropy approach to natural language processing, Comput. Linguistics 22 (1) (1996) 1-36.
[13]
{13} J.N. Darroch, D. Ratcliff, Generalized iterative scaling for log-linear models, Ann. Math. Stat. 43 (1972) 1470-1480.
[14]
{14} Y.T. Tse, R.L. Baker, Camera zoom/pan estimation and compensation for video compression, Image Process. Algorithms Tech. 1452 (1991) 468-479.
[15]
{15} R. Szeliski, H. Shum, Creating full view panoramic image mosaics and texture-mapped models, in SIGGRAPH97 (1997) 251-258.
[16]
{16} M. Fischler, R. Bolles, Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography, CACM 24 (1981) 381-395.
[17]
{17} A. Dempster, N. Laird, D. Rubin, Maximum-likelihood from incomplete data via em algorithm, R. Star. Soc. Ser. B 39 (1977).
[18]
{18} G. Schwarts, Estimating the dimension of a model, Ann. Stat. 6 (1990) 461-464.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Computer Vision and Image Understanding
Computer Vision and Image Understanding  Volume 96, Issue 2
Special issue on event detection in video
November 2004
172 pages

Publisher

Elsevier Science Inc.

United States

Publication History

Published: 01 November 2004

Author Tags

  1. baseball highlight detection
  2. event detection
  3. machine learning
  4. maximum entropy method
  5. video content analysis

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2016)Incorporating frequent pattern analysis into multimodal HMM event classification for baseball videosMultimedia Tools and Applications10.1007/s11042-015-2447-275:9(4913-4932)Online publication date: 1-May-2016
  • (2013)A spatio-temporal pyramid matching for video retrievalComputer Vision and Image Understanding10.1016/j.cviu.2013.02.003117:6(660-669)Online publication date: 1-Jun-2013
  • (2012)Ice hockey shooting event modeling with mixture hidden Markov modelMultimedia Tools and Applications10.1007/s11042-010-0722-957:1(131-144)Online publication date: 1-Mar-2012
  • (2010)Sports Information Retrieval for Video AnnotationInternational Journal of Digital Library Systems10.4018/jdls.20101027041:1(62-88)Online publication date: 1-Jan-2010
  • (2009)Ice hockey shot event modeling with mixture hidden Markov modelProceedings of the 1st ACM international workshop on Events in multimedia10.1145/1631024.1631031(25-32)Online publication date: 23-Oct-2009
  • (2009)Using continuous features in the maximum entropy modelPattern Recognition Letters10.1016/j.patrec.2009.06.00530:14(1295-1300)Online publication date: 1-Oct-2009
  • (2008)Automatic score scene detection for baseball videoProceedings of the 3rd international conference on Large-scale knowledge resources: construction and application10.5555/1787800.1787825(226-240)Online publication date: 3-Mar-2008
  • (2008)Spatio-temporal pyramid matching for sports videosProceedings of the 1st ACM international conference on Multimedia information retrieval10.1145/1460096.1460144(291-297)Online publication date: 30-Oct-2008
  • (2008)Real-time multiview analysis of soccer matches for understanding interactions between ball and playersProceedings of the 2008 international conference on Content-based image and video retrieval10.1145/1386352.1386419(525-534)Online publication date: 7-Jul-2008
  • (2007)Situated models of meaning for sports video retrievalHuman Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers10.5555/1614108.1614118(37-40)Online publication date: 22-Apr-2007
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media