Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Compact Representations of Videos Through Dominant and Multiple Motion Estimation

Published: 01 August 1996 Publication History

Abstract

An explosion of on-line image and video data in digital form is already well underway. With the exponential rise in interactive information exploration and dissemination through the World-Wide Web (WWW), the major inhibitors of rapid access to on-line video data are costs and management of capture and storage, lack of real-time delivery, and nonavailability of content-based intelligent search and indexing techniques. The solutions for capture, storage, and delivery may be on the horizon or a little beyond. However, even with rapid delivery, the lack of efficient authoring and querying tools for visual content-based indexing may still inhibit as widespread a use of video information as that of text and traditional tabular data is currently.In order to be able to nonlinearly browse and index into videos through visual content, it is necessary to develop authoring tools that can automatically separate moving objects and significant components of the scene, and represent these in a compact form. Given that video data comes in torrents almost a megabyte every 30th of a second it will be highly inefficient to search for objects and scenes in every frame of a video. In this paper, we present techniques to automatically derive compact representations of scenes and objects from the motion information.Image motion is a significant cue in videos for the separation of scenes into their significant components and for the separation of moving objects. Motion analysis is useful in capturing the visual content of videos for indexing and browsing in two different ways. First, separation of the static scene from moving objects can be accomplished by employing dominant 2D/3D motion estimation methods. Alternatively, if the goal is to be able to represent the fixed scene too as a composition of significant structures and objects, then simultaneous multiple motion methods might be more appropriate. In either case, view-based summarized representations of the scene can be created by video compositing/mosaicing based on the estimated motions. We present robust algorithms for both kinds of representations: 1) dominant motion estimation based techniques which exploit a fairly common occurrence in videos that a mostly fixed background (scene) is imaged with or without independently moving objects, and 2) simultaneous multiple motion estimation and representation of motion video using layered representations. Ample examples of the representations achieved by each method are included in the paper.

References

[1]
E.H. Adelson and P. Anandan, "Ordinal Characteristics of Transparency," Proc. AAAI Workshop Qualitative Vision, 1990.
[2]
G. Adiv, "Determining 3D Motion and Structure from Optical Flows Generated by Several Moving Objects," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 7, no. 4, pp. 384-401, 1985.
[3]
J. Ashley M. Flickner J. Hafner, et al., "Automatic and Semi-Automatic Methods for Image Annotation and Retrieval in QBIC," Image and Video Storage Retrieval III, vol. 2, 420. San Jose, Calif.: SPIE, 1995.
[4]
S. Ayer, "Sequential and Competitive Methods for the Estimation of Multiple Motions," PhD thesis, EPFL, Lausanne, 1995.
[5]
S. Ayer and H.S. Sawhney, "Layered Representation of Motion Video Using Robust Maximum-Likelihood Estimation of Mixture Models and MDL Encoding," Proc. Int'l Conf. Computer Vision, pp. 777-784, 1995.
[6]
S. Ayer P. Schroeter and J. Bigün, "Segmentation of Moving Objects by Robust Motion Parameter Estimation Over Multiple Frames," Proc. ECCV, Stockholm, May 1994.
[7]
J.R. Bergen P. Anandan K.J. Hanna and R. Hingorani, "Hierarchical Model-Based Motion Estimation," Proc. Second ECCV, pp. 237-252, 1992.
[8]
M.J. Black and P. Anandan, "The Robust Estimation of Multiple Motions: Affine and Piecewise-Smooth Flow Fields," Technical Report TR, Xerox PARC, CA, Dec. 1993.
[9]
M. Bober and J. Kittler, "Robust Motion Analysis," Proc. CVPR, pp. 947-952, Seattle, June 1994.
[10]
T. Darrell and A.P. Pentland, "Cooperative Robust Estimation Using Layers of Support," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 5, pp. 474-487, May 1995.
[11]
M. Flickner H.S. Sawhney W. Niblack, et al., "Query by Image Video Content: The QBIC System," Computer, pp. 23-32, Sept. 1995.
[12]
F.R. Hampel E.M. Ronchetti P.J. Rousseeuw and W.A. Stahel, Robust Statistics: The Approach Based on Influence Functions. New York: J. Wiley & Sons, 1986.
[13]
K.J. Hanna, "Direct Multi-Resolution Estimation of Ego-Motion and Structure from Motion," Proc. IEEE Workshop Visual Motion, pp. 156-162, 1991.
[14]
R.I. Hartley, "Euclidean Reconstruction from Uncalibrated Views," Proc. Joint European-US Workshop Applications of Invariance in Computer Vision, 1993.
[15]
H. Hashihara J. Takahashi and J.-K. Hong, "Scene Retrieval Method for Motion Image Databases," technical report, IBM Tokyo Research Laboratory, 1991.
[16]
V. Hasselblad, "Estimation of Parameters for a Mixture of Normal Distributions," Technometrics, vol. 8, no. 3, pp. 431-446, Aug. 1966.
[17]
K. Hirata and T. Kato, "Rough Sketch-Based Image Information Retrieval," NEC R&D, vol. 34, no. 2, pp. 263-273, Apr. 1993.
[18]
S. Hsu P. Anandan and S. Peleg, "Accurate Computation of Optical Flow by Using Layered Motion Representation," Proc. ICPR, pp. 743-746, Jerusalem, Oct. 1994.
[19]
M. Irani P. Anandan and S. Hsu, "Mosaic Based Representations of Video Sequences and Their Applications," Proc. Int'l Conf. Computer Vision, pp. 605-611, 1995.
[20]
M. Irani B. Rousso and S. Peleg, "Detecting and Tracking Multiple Moving Objects Using Temporal Integration," Proc. ECCV, pp. 282-287, Santa Margherita, Italy, May 1992.
[21]
A. Jepson and M.J. Black, "Mixture Models for Optical Flow Computation," Proc. CVPR, pp. 760-761, New York, June 1993.
[22]
R. Kumar P. Anandan and K. Hanna, "Direct Recovery of Shape from Multiple Views: A Parallax Based Approach," Proc. ICPR, pp. 685-688, 1994.
[23]
R. Kumar P. Anandan M. Irani, et al., "Representation of Scenes from Collection of Images," Proc. IEEE Workshop Representation of Visual Scenes, 1995.
[24]
Y.G. Leclerc, "Constructing Simple Stable Descriptions for Image Partitioning," Int'l J. Computer Vision, vol. 3, no. 1, pp. 73-102, 1989.
[25]
G. Li, "Robust Regression," Exploring Data Tables, Trends and Shapes, D.C. Hoaglin, F. Mosteller, and J.W. Tukey, eds., chap. 8. New York: John Wiley & Sons, 1985.
[26]
W.J. MacLean A.D. Jepson and R.C. Frecker, "Recovery of Egomotion and Segmentation of Independent Object Motion Using the EM Algorithm," Proc. BMVC, 1994.
[27]
S. Mann and R.W. Picard, "Virtual Bellows: Constructing High Quality Stills from Video," Proc. ICIP, 1994.
[28]
G.J. McLachlan and K.E. Basford, Mixture Models Inference and Applications to Clustering. New York and Basel: Marcel Dekker, Inc., 1988.
[29]
W. Niblack R. Barber W. Equitz, et al., "The QBIC Project: Querying Images by Content Using Color, Texture, and Shape," SPIE 1980, Storage and Retrieval for Image and Video Databases, pp. 173-187, Feb. 1993.
[30]
J.M. Odobez and P. Bouthemy, "Detection of Multiple Moving Objects Using Multiscale mrf with Camera Motion Compensation," Proc. ICIP, pp. 257-261, Austin, Tex., Nov. 1994.
[31]
J.M. Odobez and P. Bouthemy, "Robust Multiresolution Estimation of Parametric Motion Models in Complex Image Sequences," Proc. Seventh EUSIPCO European Conf. Signal Processing, pp. 411-414, Edinburgh, Sept. 1994.
[32]
A. Pentland R.W. Picard and S. Sclaroff, "Photobook: Tools for Content-Based Manipulation of Image Databases," Proc. Storage and Retrieval for Image and Video Databases II. SPIE, 1994.
[33]
J. Rissanen, "A Universal Prior for Integers and Estimation by Minimum Description Length," Annals of Statistics, vol. 11, no. 2, pp. 416-431, 1983.
[34]
P.J. Rousseeuw and A.M. Leroy, Robust Regression & Outlier Detection. New York: J. Wiley & Sons, 1987.
[35]
H.S. Sawhney, "Simplifying Motion and Structure Analysis Using Planar Parallax and Image Warping," Proc. Int'l Conf. Pattern Recognition, pp. A403-A408, 1994.
[36]
H.S. Sawhney S. Ayer and M. Gorkani, "Model-Based 2D & 3D Dominant Motion Estimation for Mosaicing and Video Representation," Proc. Int'l Conf. Computer Vision, pp. 583-590, 1995.
[37]
G.A.F. Seber and C.J. Wild, Nonlinear Regression. New York: Wiley, 1989.
[38]
A. Shashua and N. Navab, "Relative Affine Structure: Theory and Application to 3D Reconstruction from Perspective Views," Proc. Computer Vision & Pattern Recognition Conf., pp. 483-489, 1994.
[39]
R. Szelski, "Image Mosaicing for Tele-Reality Applications," Proc. IEEE Workshop Applications of Computer Vision, pp. 44-53, 1994.
[40]
R. Szeliski and S.B. Kang, "Direct Methods for Visual Scene Reconstruction," Proc. IEEE Workshop Representation of Visual Scenes, 1995.
[41]
L.A. Teodosio and W. Bender, "Salient Video Stills: Content and Context Preserved," Proc. ACM Int'l Conf. Multimedia, 1993.
[42]
Y. Tonomura A. Akutsu K. Otsuji and T. Sadakata, "VideoMAP and VideoSpaceIcon: Tools for Anatomizing Video Content," Proc. ACM INTERCHI, pp. 131-136, 1993.
[43]
J.Y.A. Wang and E.H. Adelson, "Layered Representation for Motion Analysis," Proc. Computer Vision & Pattern Recognition Conf., pp. 361-366, New York, June 1993.

Cited By

View all
  • (2023)A Survey on Deep Learning Technique for Video SegmentationIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2022.322557345:6(7099-7122)Online publication date: 1-Jun-2023
  • (2019)Dynamic mosaicking: region-based method using edge detection for an optimal seamlineMultimedia Tools and Applications10.1007/s11042-019-7603-778:16(23225-23253)Online publication date: 1-Aug-2019
  • (2018)Multiple object tracking by employing shaped-based features and Kalman filterInternational Journal of Business Intelligence and Data Mining10.5555/3192182.319220213:1-3(331-346)Online publication date: 1-Jan-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image IEEE Transactions on Pattern Analysis and Machine Intelligence
IEEE Transactions on Pattern Analysis and Machine Intelligence  Volume 18, Issue 8
August 1996
92 pages
ISSN:0162-8828
Issue’s Table of Contents

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 August 1996

Author Tags

  1. Compact video representations
  2. expectation-maximization (EM) algorithm.
  3. layered motion representations
  4. mixture models
  5. motion segmentation
  6. robust estimation
  7. video indexing
  8. video mosaics
  9. video motion analysis

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2023)A Survey on Deep Learning Technique for Video SegmentationIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2022.322557345:6(7099-7122)Online publication date: 1-Jun-2023
  • (2019)Dynamic mosaicking: region-based method using edge detection for an optimal seamlineMultimedia Tools and Applications10.1007/s11042-019-7603-778:16(23225-23253)Online publication date: 1-Aug-2019
  • (2018)Multiple object tracking by employing shaped-based features and Kalman filterInternational Journal of Business Intelligence and Data Mining10.5555/3192182.319220213:1-3(331-346)Online publication date: 1-Jan-2018
  • (2018)Automatic Clustering of Natural Scene Using Color Spatial Envelope FeatureProceedings of the 2018 International Conference on Computing and Artificial Intelligence10.1145/3194452.3194476(144-148)Online publication date: 12-Mar-2018
  • (2018)A dynamic mosaicking method based on histogram equalization for an improved seamlineProcedia Computer Science10.1016/j.procs.2018.01.131127:C(344-352)Online publication date: 1-May-2018
  • (2015)Image Mosaicing Using a Self-Calibration Camera3D Research10.1007/s13319-015-0048-56:2(1-15)Online publication date: 1-Jun-2015
  • (2013)A survey of appearance models in visual object trackingACM Transactions on Intelligent Systems and Technology10.1145/2508037.25080394:4(1-48)Online publication date: 8-Oct-2013
  • (2012)Video SummarizationProceedings of the International Conference on Computer Vision and Graphics - Volume 759410.5555/2942031.2942033(1-13)Online publication date: 24-Sep-2012
  • (2011)Foreground objects segmentation for moving camera scenarios based on SCGMMProceedings of the 2011 international conference on Computational Intelligence for Multimedia Understanding10.1007/978-3-642-32436-9_17(195-206)Online publication date: 13-Dec-2011
  • (2010)Dynamic video narrativesACM SIGGRAPH 2010 papers10.1145/1833349.1778825(1-9)Online publication date: 26-Jul-2010
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media