3D representation of videoconference image sequences using VRML 2.0

Kompatsiaris, Ioannis; Strintzis, Michael G.

doi:10.1007/3-540-64594-2_81

Ioannis Kompatsiaris¹ &
Michael G. Strintzis¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1425))

Included in the following conference series:

European Conference on Multimedia Applications, Services, and Techniques

Abstract

In this paper a procedure for visualisation of videoconference image sequences using Virtual Reality Modeling Language (VRML) 2.0 is described. First image sequence analysis is performed in order to estimate the shape and motion parameters of the person talking in front of the camera. For this purpose, we propose the K-Means with connectivity constraint algorithm as a general segmentation algorithm combining information of various types such as colour and motion. The algorithm is applied “hierarchically” in the image sequence and it is first used to separate the background from the foreground object and then to further segment the foreground object into the head and shoulders regions. Based on the above information, personal 3D shape parameters are estimated. The rigid 3D motion is estimated next for each sub-object. Finally a VRML file is created containing all the above estimated information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Video Pop-up: Monocular 3D Reconstruction of Dynamic Scenes

First International Workshop on Video Segmentation - Panel Discussion

Comparative Analysis of Different Clustering Techniques for Video Segmentation

References

D. Tzovaras, N. Grammalidis, and M. G. Strintzis, “Object — Based Coding of Stereo Image Sequences using Joint 3-D Motion/Disparity Compensation,” IEEE Trans. on Circuits and Systems for Video Technology, vol. 7, Apr. 1997.
Google Scholar
K. Aizawa, H. Harashima, and T. Saito, “Model-based analysis-synthesis image coding (MBASIC) system for a person's face,” Signal Processing: Image Communication, vol. 1, pp. 139–152, Oct. 1989.
Google Scholar
H. G. Musmann, M. Hotter, and J. Ostermann, “Object-oriented analysis-synthesis coding of moving images,” Signal Processing: Image Communication, vol. 1, pp. 117–138, Oct. 1989.
Google Scholar
“Overview of the MPEG-4 Standard,” tech. rep., ISO/IEC JTC1/SC29/WG11 N1730, Stockholm Jul. 1997.
Google Scholar
VRML 2.0 Specification, http://vrml.sgi.com/moving-worlds.
Google Scholar
T. Kanade and P. J. Narayanan, “Virtualised reality: Constructing virtual worlds from real scenes,” IEEE Multimedia, pp. 34–46, Jan.-March 1997.
Google Scholar
P. E. Eren, C. Toklu, and M. Tekalp, “Object-based video manipulation and composition using 2d meshes in VRML,” in IEEE Workshop on Multimedia Signal Processing, (Princeton, New Jersey, USA), pp. 257–261, June 1997.
Google Scholar
S. Z. Selim and M. A. Ismail, “K-means-type algorithms,” IEEE Trans. Pattern Anal. and Mach. Intell., vol. 6, pp. 81–87, January 1984.
Google Scholar
M. J. T. Reindrs, Model Adaptation for image Coding. Delft University Press, 1995.
Google Scholar
A. W. Fitzgibbon, M. Pilu, and R. B. Fisher, “Direct Least Squares Fitting of Ellipses,” in International Conference on Pttern Recognition, (Vienna, Austria), August 1996.
Google Scholar

Download references

Author information

Authors and Affiliations

Information Processing Laboratory Electrical and Computer Engineering Department, Aristotle University of Thessaloniki, 54006, Thessaloniki, Greece
Ioannis Kompatsiaris & Michael G. Strintzis

Authors

Ioannis Kompatsiaris
View author publications
You can also search for this author in PubMed Google Scholar
Michael G. Strintzis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

David Hutchison Ralf Schäfer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kompatsiaris, I., Strintzis, M.G. (1998). 3D representation of videoconference image sequences using VRML 2.0. In: Hutchison, D., Schäfer, R. (eds) Multimedia Applications, Services and Techniques — ECMAST'98. ECMAST 1998. Lecture Notes in Computer Science, vol 1425. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-64594-2_81

Download citation

DOI: https://doi.org/10.1007/3-540-64594-2_81
Published: 29 July 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64594-8
Online ISBN: 978-3-540-69344-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

3D representation of videoconference image sequences using VRML 2.0

Abstract

Access this chapter

Preview

Similar content being viewed by others

Video Pop-up: Monocular 3D Reconstruction of Dynamic Scenes

First International Workshop on Video Segmentation - Panel Discussion

Comparative Analysis of Different Clustering Techniques for Video Segmentation

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

3D representation of videoconference image sequences using VRML 2.0

Abstract

Access this chapter

Preview

Similar content being viewed by others

Video Pop-up: Monocular 3D Reconstruction of Dynamic Scenes

First International Workshop on Video Segmentation - Panel Discussion

Comparative Analysis of Different Clustering Techniques for Video Segmentation

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation