Abstract
Two basic requirements for a digital video library to be “browsable” are a precisely indexed content and informative abstracts. Nowadays such solutions are not common in video search engines or generic digital video platforms, therefore, the authors suggest developing some computer applications resolving the problems of at least abstracts’ creation. The abstracts cannot be constructed without a deep video content analysis, including some low level processing like a shot detection towards a video sequence segmented to a series of “camera takes”. The presented method, aimed at a shot detection, deploys a concept of a Motion Factor (of frame transitions). The basic definition considers the motion factor as a very sudden peak of difference between two successive frames. In some specific areas, the intrashot motion factor may suppress the shot-boundary motion factor. In order to avoid misrecognition of both motion factors during a shot detection process a concept of a differential motion factor was implemented. The full-resolution algorithm achieves the accuracy of up to 80%, however, it is very time-consuming. The shot detection accuracy was measured including true and false shots detected as well as real shots that were bounded visually. The authors’ research of a representative number of movies (from various categories) has revealed that the shot detection process can be accelerated up to 500 times without any significant deterioration of shot recognition accuracy. The shot detection algorithm was accelerated in a simple manner by two-dimensional reduction of a frame resolution (in pixels).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
J. R. Smith, “Digital Video Libraries and the Internet”, IEEE Communications Magazine, January 1999, pp. 92–97.
B.-L. Ye, B. Liu, “Unified approach to temporal segmentation...” Proc. 2nd Int. Conf. Multimedia Computing and Systems May 1995.
“Office Automation”, http://www.irisusa.com/Support, 2000.
P. Kruizinga, “The Face Recognition Home Page”, University of Groningen-http://www.cs.rug.nl/~peterkr, 2000.
“Face Recognition Technology”, Visionicshttp://www.visionics.com/, 2000.
J. R. Smith and S.-F. Chang, “Searching for Images and Videos on the World-Wide Web”, CU/CTR Technical Report 459-96-25, 1996.
J. R. Smith and S.-F. Chang, “An Image and Video Search Engine for the World-Wide Web”, Proc. EI’1997, San Jose, CA, February 1997.
B.-L. Yeo and B. Liu, “On the Extraction of DC Sequence...” Proc. Int. Conf. Image Processing, October 1995.
M. Leszczuk, Z. Papir, “Integration of a Voice Recognition-based Indexing with Multimedia Applications”, Proc. PROMS’ 2000, Krakow, Poland, Oct. 2000, pp. 375–381.
M. Leszczuk, Z. Papir, “Developing of Digital Video Libraries Indexed by a Speech...” AI’2001, Innsbruck, Austria, February 2001, pp. 107–113.
M. Leszczuk, Z. Papir,“Developing Digital Video Libraries Indexed by a Speech Recognition Engine”, ICIMADE’2001, Fargo-ND, USA, June 2001.
J. Mitchell, W. Pennebaker, C. Fogg, D. J. LeGall, “MPEG video compression standard”, International Thomson Publishing, New York, 1996, p. 58.
M. Krunz, S. Tripathi, “Scene-based characterization of VBR MPEG-compressed video traffic”, Proc. of ACM SIGMETRICS’1997, 1997.
O. Rose, “Simple and efficient models for variable bit rate MPEG video traffic”, Performance Evaluation, vol. 30, pp. 69–85, July 1997.
J. Roberts, U. Mocci, J. Virtamo, “Broadband network teletraffic”, Springer-Verlag, Berlin 1996, pp. 20–25.
Y. Sang-Jo, K. Seong-Dae, “Traffic Modelling and QoS Prediction for MPEG-Coded Video Services over ATM Networks Using Scene...“, Journal of High-Speed Networks”, vol. 8, no. 3, 1999, pp. 211–224.
A. Mashat, M. Kara, “Performance Evaluation of a Scene-based Model...”, “System Performance Evaluation...” CRC Press, 2000, pp. 123–142.
D. Heyman, A. Tabatabai, T. Lakshman, “Statistical Analysis of MPEG-2 Coded VBR Video Traffic”, sixth Int. Workshop on Packet Video, September 1994.
D. P. Heyman, T. Lakshman, “Source models for VBR broadcast-video traffic”, IEEE/ACM Transactions on Networking, vol. 4, no. 1, February 1996.
M. F. Scheffer, K. Wajda, J. S. Kunicki, “Fuzzy Logic Adaptive Traffic Enforcement Mechanisms for ATM Networks”, Proc. of ITC, Pretoria, South Africa, 1995.
A. Winter, “Video Capture Software and Scene Detection-Scenalyzer”, http://www.scenalyzer.com/, 2001-08-02
MGI Software Corp, “Video Wave”, http://www.mgisoft.com/products/vw, 2001
IBM, “DB2 Video Extender”, http://www-4.ibm.com/software/data/db2
M. Leszczuk, P. Pacyna, Z. Papir, “Video Content Streaming Service Using IP/RSVP Protocol Stack”, IEEE Workshop on Internet Applications WIAPP’99, 7, 1999
L. Sanghoon, M.S. Pattichis, A.C. Bovik, “Foveae video compression with optimal rate control” IEEE Trans. on Img. Proc. Volume: 10 Issue: 7, July 2001.
Sherman Networks, “KaZaA”, http://www.kazaa.com/, 2002.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Leszczuk, M., Papir, Z. (2002). Accuracy vs. Speed Trade-Off in Detecting of Shots in Video Content for Abstracting Digital Video Libraries. In: Boavida, F., Monteiro, E., Orvalho, J. (eds) Protocols and Systems for Interactive Distributed Multimedia. IDMS 2002. Lecture Notes in Computer Science, vol 2515. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36166-9_16
Download citation
DOI: https://doi.org/10.1007/3-540-36166-9_16
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00169-0
Online ISBN: 978-3-540-36166-4
eBook Packages: Springer Book Archive