Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A personal look back at twenty years of research in multimedia content analysis

Published: 17 October 2013 Publication History

Abstract

This paper is a personal look back at twenty years of research in multimedia content analysis. It addresses the areas of audio, photo and video analysis for the purpose of indexing and retrieval from the perspective of a multimedia researcher. Whereas a general analysis of content is impossible due to the personal bias of the user, significant progress was made in the recognition of specific objects or events. The paper concludes with a brief outlook on the future.

References

[1]
Cao, L., Chang, S.-F., Codella, N., Cotton, C., Ellis, D., Gong, L., Hill, M., Hua, G., Kender, J., Merler, M., Mu, Y., Natsev, A., and Smith, J. R. 2011. IBM Research and Columbia University TRECVID-2011 multimedia event detection (MED) system. In Proceedings of the NIST TRECVID Workshop.
[2]
Chen, D., Odobez, J. M., and Bourlard, H. 2004. Text detection and recognition in images and video frames. J. Pattern Recog. Soc. 37, 3, 595--608.
[3]
Ghias, A., Logan, J., Chamberlin, D., and Smith, B. C. 1995. Query by humming: Musical information retrieval in an audio database. In Proceedings of the ACM Multimedia Conference. 231--236.
[4]
Google. 2013. http://images.google.com. (Last accessed 7/13).
[5]
Han, J., Farin, D., and de With, P. H. N. 2008. Broadcast court-net sports video analysis using fast 3-D camera modeling. IEEE Trans. Circuits Syst. Video Technol. 18, 11, 1628--1638.
[6]
Lienhart, R., Kuhmünch, C. H., and Effelsberg, W. 1997a. On the detection and recognition of television commercials. In Proceedings of the IEEE International Conference on Multimedia Computing and Systems (ICMCS'97). 509--516.
[7]
Lienhart, R., Pfeiffer, S., and Effelsberg, W. 1997b. Video abstracting. Comm. ACM, 40, 12, 55--62.
[8]
Moore, B. E., Ali, S., Mehran, R., and Shah, M. 2011. Visual crowd surveillance through a hydrodynamic lens. Comm. ACM, 54, 12, 64--73.
[9]
Niblack, C. W., Barber, R., Equitz, W., Flickner, M. D., Glasman, E. H., Petkovic, D., Yanker, P., Faloutsos, C. H., and Taubin, G. 1993. QBIC project: Querying images by content using color, texture, and shape. In Proceedings of the SPIE 1908, Storage and Retrieval for Image and Video Databases.
[10]
Rowley, H. A., Baluja, S., and Kanade, T. 1998. Neural network-based face detection. IEEE Trans. Pattern Anal. Machine Intell. 20, 1, 23--38.
[11]
Rui, Y., Huang, T. H., Ortega, M., and Mehrotra, S. 1998. Relevance feedback: A power tool for interactive content-based image retrieval. IEEE Trans. Circuits Syst. Video Technol. 8, 5, 644--655.
[12]
Shah, M. 2010. Visual crowd surveillance is like hydrodynamics. In Proceedings of the ACM Multimedia Conference. 3--4.
[13]
Uitdenbogerd, A., and Zobel, J. 1995. Melody matching techniques for large music databases. In Proceedings of the ACM Multimedia Conference. 57--66.
[14]
Wactlar, H. D., Christel, M. G., Gong, Y., and Hauptmann, A. G. 1999. Lessons learned from building a terabyte digital video library. Computer 32, 2, 66--73.
[15]
Wang, G., Hoiem, D., and Forsyth, D. 2012. Learning image similarity from Flickr groups using fast kernel machines. IEEE Trans. Pattern Anal. Machine Intell. 34, 11, 2177--2188.
[16]
Zabih, R., Miller, J., and Mai, K. 1995. A feature-based algorithm for detecting and classifying scene breaks. In Proceedings of the ACM Multimedia Conference. 189--200.
[17]
Zhang, H., Kankanhalli, A., and Smoliar, S. 1993. Automatic Partitioning of full-motion video. Multimedia Syst. 1, 10--28.
[18]
Zhu, G., Huang, Q., Xu, C., Rui, Y., Jiang, S., Gao, W., and Yao, H. 2007. Trajectory based event tactics analysis in broadcast sports video. In Proceedings of the ACM Multimedia Conference. 58--67.

Cited By

View all
  • (2019)Multimedia based IoT-centric smart framework for eLearning paradigmMultimedia Tools and Applications10.1007/s11042-018-5636-y78:3(3087-3106)Online publication date: 1-Jun-2019
  • (2018)A survey on indexing techniques for big dataKnowledge and Information Systems10.1007/s10115-015-0830-y46:2(241-284)Online publication date: 30-Dec-2018
  • (2015)Boosted Multifeature Learning for Cross-Domain TransferACM Transactions on Multimedia Computing, Communications, and Applications10.1145/270028611:3(1-18)Online publication date: 5-Feb-2015

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications
ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 9, Issue 1s
Special Sections on the 20th Anniversary of ACM International Conference on Multimedia, Best Papers of ACM Multimedia 2012
October 2013
218 pages
ISSN:1551-6857
EISSN:1551-6865
DOI:10.1145/2523001
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2013
Accepted: 01 June 2013
Revised: 01 June 2013
Received: 01 May 2013
Published in TOMM Volume 9, Issue 1s

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Music analysis
  2. brain-computer interface
  3. face recognition
  4. multimedia content analysis
  5. query by example
  6. text recognition
  7. user feedback
  8. video analysis

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 04 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Multimedia based IoT-centric smart framework for eLearning paradigmMultimedia Tools and Applications10.1007/s11042-018-5636-y78:3(3087-3106)Online publication date: 1-Jun-2019
  • (2018)A survey on indexing techniques for big dataKnowledge and Information Systems10.1007/s10115-015-0830-y46:2(241-284)Online publication date: 30-Dec-2018
  • (2015)Boosted Multifeature Learning for Cross-Domain TransferACM Transactions on Multimedia Computing, Communications, and Applications10.1145/270028611:3(1-18)Online publication date: 5-Feb-2015

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media