Gesture Recognition Using Template Based Random Forest Classifiers

Camgöz, Necati Cihan; Kindiroglu, Ahmet Alp; Akarun, Lale

doi:10.1007/978-3-319-16178-5_41

Necati Cihan Camgöz¹⁶,
Ahmet Alp Kindiroglu¹⁶ &
Lale Akarun¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8925))

Included in the following conference series:

European Conference on Computer Vision

5548 Accesses
7 Citations

Abstract

This paper presents a framework for spotting and recognizing continuous human gestures. Skeleton based features are extracted from normalized human body coordinates to represent gestures. These features are then used to construct spatio-temporal template based Random Decision Forest models. Finally, predictions from different models are fused at decision-level to improve overall recognition performance. Our method has shown competitive results on the ChaLearn 2014 Looking at People: Gesture Recognition dataset. Trained on a dataset of 20 gesture vocabulary and 7754 gesture samples, our method achieved a Jaccard Index of \(0.74663\) on the test set, reaching 7th place among contenders. Among methods that exclusively used skeleton based features, our method obtained the highest recognition performance.

Download to read the full chapter text

Chapter PDF

A Study of Feature Combination in Gesture Recognition with Kinect

Kinect vs. Low-cost Inertial Sensing for Gesture Recognition

Transfer Learning Decision Forests for Gesture Recognition

Keywords

References

Agarwal, A., Triggs, B.: Tracking articulated motion using a mixture of autoregressive models. In: Pajdla, T., Matas, J.G. (eds.) ECCV 2004. LNCS, vol. 3023, pp. 54–65. Springer, Heidelberg (2004)
Chapter Google Scholar
Bishop, C.M.: Pattern Recognition and Machine Learning (Information Science and Statistics). Springer, New York (2006)
MATH Google Scholar
Bobick, A.F., Davis, J.W.: The recognition of human movement using temporal templates. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(3), 257–267 (2001)
Article Google Scholar
Breiman, L.: Random forests. Machine Learning 45(1), 5–32 (2001)
Article MathSciNet MATH Google Scholar
Chang, J.Y.: Nonparametric gesture labeling from multi-modal data. In: European Conference on Computer Vision (ECCV) 2014 ChaLearn Workshop, Zurich (2014)
Google Scholar
Chen, G., Clarke, D., Weikersdorfer, D., Giuliani, M.: Multi-modality gesture detection and recognition with un-supervision, randomization and discrimination. In: European Conference on Computer Vision (ECCV) 2014 ChaLearn Workshop, Zurich (2014)
Google Scholar
Erol, A., Bebis, G., Nicolescu, M., Boyle, R.D., Twombly, X.: Vision-based hand pose estimation: A review. Computer Vision and Image Understanding 108(1–2), 52–73 (2007)
Article Google Scholar
Escalera, S., Baró, X., Gonzàlez, J., Bautista, M.A., Madadi, M., Reyes, M., Ponce, V., Escalante, H.J., Shotton, J., Guyon, I.: ChaLearn looking at people challenge 2014: dataset and results. In: ECCV Workshop, Zurich (2014)
Google Scholar
Evangelidis; G., Singh; G., Horaud, R.: Continuous gesture recognition from articulated poses. In: European Conference on Computer Vision (ECCV) 2014 ChaLearn Workshop, Zurich (2014)
Google Scholar
Kuznetsova, A., Leal-Taixe, L., Rosenhahn, B.: Real-time sign language recognition using a consumer depth camera. In: ICCV 2013 (2013)
Google Scholar
Liang, B., Zheng, L.: Multi-modal gesture recognition using skeletal joints and motion trail model. In: European Conference on Computer Vision (ECCV) 2014 ChaLearn Workshop, Zurich, pp. 1–16 (2014)
Google Scholar
Mitra, S., Acharya, T.: Gesture Recognition: A Survey. IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews) 37(3), 311–324 (2007)
Article Google Scholar
Monnier, C., German, S., Ost, A.: A multi-scale boosted detector for efficient and robust gesture recognition. In: European Conference on Computer Vision (ECCV) 2014 ChaLearn Workshop (2014)
Google Scholar
Mori, G.: Max-margin hidden conditional random fields for human action recognition. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 872–879. IEEE (2009)
Google Scholar
Neverova, N., Wolf, C., Taylor, G.W., Nebout, F.: Multi-scale deep learning for gesture detection and localization. In: European Conference on Computer Vision (ECCV) 2014 ChaLearn Workshop, Zurich (2014)
Google Scholar
Peng, X., Wang, L.: Action and gesture temporal spotting with. In: European Conference on Computer Vision (ECCV) 2014 ChaLearn Workshop (2014)
Google Scholar
Pigou, L., Dieleman, S., Kindermans, P.J., Schrauwen, B.: Sign language recognition using convolutional neural networks. In: European Conference on Computer Vision (ECCV) 2014 ChaLearn Workshop, Zurich (2014)
Google Scholar
Poppe, R.: A survey on vision-based human action recognition. Image and Vision Computing 28(6), 976–990 (2010)
Article Google Scholar
Rabiner, L., Juang, B.: An introduction to hidden Markov models. IEEE ASSP Magazine (1986)
Google Scholar
Rautaray, S.S., Agrawal, A.: Vision based hand gesture recognition for human computer interaction: a survey (2012)
Google Scholar
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: a local svm approach. In: Proceedings of the 17th International Conference on Pattern Recognition, ICPR 2004, vol. 3, pp. 32–36, August 2004
Google Scholar
Sempena, S., Maulidevi, N.U., Aryan, P.R.: Human action recognition using Dynamic Time Warping. In: Proceedings of the 2011 International Conference on Electrical Engineering and Informatics, pp. 1–5 (2011)
Google Scholar
Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from single depth images. In: CVPR, vol. 2 (2011)
Google Scholar
Starner, T., Pentland, A.: Real-time American sign language recognition from video using hidden Markov models. In: Proceedings of Computer Vision (1995)
Google Scholar
Suarez, J., Murphy, R.R.: Hand gesture recognition with depth images: a review. In: Proceedings - IEEE International Workshop on Robot and Human Interactive Communication, pp. 411–417 (2012)
Google Scholar
Sullivan, J., Carlsson, S.: Recognizing and tracking human action. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002, Part I. LNCS, vol. 2350, pp. 629–644. Springer, Heidelberg (2002)
Chapter Google Scholar
Wachs, J.P., Kölsch, M., Stern, H., Edan, Y.: Vision-based hand-gesture applications (2011)
Google Scholar
Wu, D., Shao, L.: Deep dynamic neural networks for gesture segmentation and recognition. In: European Conference on Computer Vision (ECCV) 2014 ChaLearn Workshop, Zurich (2014)
Google Scholar
Yamato, J., Ohya, J., Ishii, K.: Recognizing human action in time-sequential images using hidden Markov model. In: Proceedings CVPR 1992, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 379–385 (1992)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Engineering Department, Bogazici University, Istanbul, Turkey
Necati Cihan Camgöz, Ahmet Alp Kindiroglu & Lale Akarun

Authors

Necati Cihan Camgöz
View author publications
You can also search for this author in PubMed Google Scholar
Ahmet Alp Kindiroglu
View author publications
You can also search for this author in PubMed Google Scholar
Lale Akarun
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Necati Cihan Camgöz .

Editor information

Editors and Affiliations

University College London, London, United Kingdom
Lourdes Agapito
University of Lugano, Lugano, Switzerland
Michael M. Bronstein
Technische Universität Dresden, Dresden, Germany
Carsten Rother

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Camgöz, N.C., Kindiroglu, A.A., Akarun, L. (2015). Gesture Recognition Using Template Based Random Forest Classifiers. In: Agapito, L., Bronstein, M., Rother, C. (eds) Computer Vision - ECCV 2014 Workshops. ECCV 2014. Lecture Notes in Computer Science(), vol 8925. Springer, Cham. https://doi.org/10.1007/978-3-319-16178-5_41

Download citation

DOI: https://doi.org/10.1007/978-3-319-16178-5_41
Published: 19 March 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16177-8
Online ISBN: 978-3-319-16178-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Gesture Recognition Using Template Based Random Forest Classifiers

Abstract

Chapter PDF

Similar content being viewed by others

A Study of Feature Combination in Gesture Recognition with Kinect

Kinect vs. Low-cost Inertial Sensing for Gesture Recognition

Transfer Learning Decision Forests for Gesture Recognition

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Gesture Recognition Using Template Based Random Forest Classifiers

Abstract

Chapter PDF

Similar content being viewed by others

A Study of Feature Combination in Gesture Recognition with Kinect

Kinect vs. Low-cost Inertial Sensing for Gesture Recognition

Transfer Learning Decision Forests for Gesture Recognition

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation