Abstract
In this paper, we present an Orthogonal Locality Preserving Projection based (OLPP) approach to capture three-dimensional human motion from monocular images. From the motion capture data residing in high dimension space of human activities, we extract the motion base space in which human pose can be described essentially and concisely by more controllable way. This is actually a dimensionality reduction process completed in the framework of OLPP. And then, the structure of this space corresponding to special activity such as walking motion is explored with data clustering. Pose recovering is performed in the generative framework. For the single image, Gaussian mixture model is used to generate candidates of the 3D pose. The shape context is the common descriptor of image silhouette feature and synthetical feature of human model. We get the shortlist of 3D poses by measuring the shape contexts matching cost between image features and the synthetical features. In tracking situation, an AR model trained by the example sequence produces almost accurate pose predictions. Experiments demonstrate that the proposed approach works well.
Chapter PDF
Similar content being viewed by others
References
Aggarwal, J.K., Cai, Q.: Human Motion Analysis: A Review. Computer Vision and Image Understanding 73(3), 428–440 (1999)
Moeslund, T.B., Granum, E.: A Survey of Computer Vision-Based Human Motion Capture. Computer Vision and Image Understanding 81, 231–268 (2001)
Sminchisescu, C.: Optimization and Learning Algorithms for Visual Inference. In: ICCV 2005 tutorial (2005)
Cai, D., He, X., Han, J., Zhang, H.-J.: Orthogonal Laplacianfaces for Face Recognition. IEEE Transactions on Image Processing 15(11), 3608–3614 (2006)
He, X., Yan, S., Hu, Y., Niyogi, P., Zhang, H.-J.: Face recognition using laplacianfaces. IEEE Trans. on Pattern Analysis and Machine Intelligence, 27(3) (2005)
Agarwal, A., Triggs, B.: Tracking Articulated Motion Using a Mixture of Autoregressive Models. In: Proc. European Conf. Computer Vision (2004)
Belongie, S., Malik, J., Puzicha, J.: Shape Matching and Object Recognition Using Shape Contexts. IEEE Trans. Pattern Analysis and Machine Intelligence 24(4), 509–522 (2002)
Urtasun, R., Fleet, D.J., Fua, P.: Monocular 3D Tracking of The Golf Swing. In: Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 932–938 (2005)
Urtasun, R., Fua, P.: 3D Human Body Tracking Using Deterministic Temporal Motion Models. In: Proc. European Conf. Computer Vision, Prague, Czech Republic (May 2004)
Sidenbladh, H., Black, M., Sigal, L.: Implicit Probabilistic Models of Human Motion for Synthesis and Tracking. In: Proc. European Conf. Computer Vision, vol. 1 (2002)
Ormoneit, D., Sidenbladh, H., Black, M., Hastie, T.: Learning and Tracking Cyclic Human. In: Advances in Neural Information Processing Systems, vol. 13, pp. 894–900. The MIT Press, Cambridge (2001)
Mori, G., Malik, J.: Recovering 3D Human Body Configurations Using Shape Contexts. IEEE Trans. Pattern Analysis and Machine Intelligence 28(4), 1052–1062 (2006)
Mori, G., Belongie, S., Malik, J.: Efficient Shape Matching Using Shape Contexts. IEEE Trans. Pattern Analysis and Machine Intelligence 27(11), 1832–1837 (2005)
Ning, H., Tan, T., Wang, L., Hu, W.: People Tracking Based on Motion Model and Motion Constraints with Automatic Initialization. Pattern Recognition 37, 1423–1440 (2004)
Agarwal, A., Triggs, B.: Recovering 3D Human Pose from Monocular Images. IEEE Trans. Pattern Analysis and Machine Intelligence 28(1), 44–58 (2006)
Sminchisescu, C., Kanaujia, A., Li, Z., Metaxas, D.: Discriminative Density Propagation for 3D Human Motion Estimation. In: Proc. IEEE CS Conf. Computer Vision and Pattern Recognition. vol. 1(1), pp. 390–397 (2005)
Rosales, R., Sclaroff, S.: Learning Body Pose Via Specialized Maps. In: NIPS (2002)
Taylor, C.J.: Reconstruction of Articulated Objects from Point Correspondences in a Single Uncalibrated Image. Computer Vision and Image Understanding 80, 349–363 (2000)
Jin, X., Tai, C.-L.: Convolution Surfaces for Arcs and Quadratic Curves with a Varying Kernel. The Visual Computer 18, 530–546 (2002)
CMU database: http://mocap.cs.cmu.edu/
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhao, X., Liu, Y. (2007). Capturing 3D Human Motion from Monocular Images Using Orthogonal Locality Preserving Projection. In: Duffy, V.G. (eds) Digital Human Modeling. ICDHM 2007. Lecture Notes in Computer Science, vol 4561. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73321-8_36
Download citation
DOI: https://doi.org/10.1007/978-3-540-73321-8_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73318-8
Online ISBN: 978-3-540-73321-8
eBook Packages: Computer ScienceComputer Science (R0)