research-article

Discriminative poses for early recognition in multi-camera networks

Authors:

Scott Spurlock,

Richard SouvenirAuthors Info & Claims

ICDSC '15: Proceedings of the 9th International Conference on Distributed Smart Cameras

Pages 74 - 79

https://doi.org/10.1145/2789116.2789117

Published: 08 September 2015 Publication History

Abstract

We present a framework for early action recognition in a multi-camera network. Our approach balances recognition accuracy with speed by dynamically selecting the best camera for classification. We follow an iterative clustering approach to learn sets of keyposes that are discriminative for recognition as well as for predicting the best camera for classification of future frames. Experiments on multi-camera datasets demonstrate the applicability of our view-shifting framework to the problem of early recognition.

References

[1]

S. Cheema, A. Eweiwi, C. Thurau, and C. Bauckhage. Action recognition by learning discriminative key poses. In IEEE Intl Conf. on Computer Vision Workshops, pages 1302--1309, 2011.

[2]

J. W. Davis and A. Tyagi. Minimal-latency human action recognition using reliable-inference. Image and Vision Computing, 24(5):455--472, 2006.

Digital Library

[3]

C. Doersch, S. Singh, A. Gupta, J. Sivic, and A. A. Efros. What makes paris look like paris? ACM Trans. Graph., 31(4):101, 2012.

Digital Library

[4]

N. Gkalelis, H. Kim, A. Hilton, N. Nikolaidis, and I. Pitas. The i3dpost multi-view and 3d human action/interaction database. In Visual Media Production, pages 159--168. IEEE, 2009.

Digital Library

[5]

M. Hoai and F. De la Torre. Max-margin early event detectors. Intl Journal of Computer Vision, 107(2):191--202, 2014.

Digital Library

[6]

A. Jain, A. Gupta, M. Rodriguez, and L. S. Davis. Representing videos using mid-level discriminative patches. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pages 2571--2578. IEEE, 2013.

Digital Library

[7]

Z. Jiang, G. Zhang, and L. S. Davis. Submodular dictionary learning for sparse coding. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pages 3418--3425. IEEE, 2012.

Digital Library

[8]

I. Laptev. On space-time interest points. Intl Journal of Computer Vision, 64(2-3):107--123, 2005.

Digital Library

[9]

L. Liu, L. Shao, X. Zhen, and X. Li. Learning discriminative key poses for action recognition. IEEE T. Cybernetics, 43(6):1860--1870, 2013.

[10]

T. Määttä, A. Härmä, and H. Aghajan. On efficient use of multi-view data for activity recognition. In Proc. Intl Conf. on Distributed Smart Cameras, ICDSC '10, pages 158--165, New York, NY, USA, 2010. ACM.

Digital Library

[11]

T. Malisiewicz, A. Gupta, and A. A. Efros. Ensemble of exemplar-svms for object detection and beyond. In Proc. Intl Conf. on Computer Vision, pages 89--96. IEEE, 2011.

Digital Library

[12]

D. Rudoy and L. Zelnik-Manor. Viewpoint selection for human actions. Intl Journal of Computer Vision, 97(3):243--254, 2012.

Digital Library

[13]

M. Ryoo. Human activity prediction: Early recognition of ongoing activities from streaming videos. In Proc. Intl Conf. on Computer Vision, pages 1036--1043. IEEE, 2011.

Digital Library

[14]

K. Schindler and L. Van Gool. Action snippets: How many frames does human action recognition require? In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pages 1--8, 2008.

[15]

C. Shen, C. Zhang, and S. Fels. A multi-camera surveillance system that estimates quality-of-view measurement. In Proc. Intl Conf. on Image Processing, volume 3, pages III--193. IEEE, 2007.

[16]

S. Singh, A. Gupta, and A. A. Efros. Unsupervised discovery of mid-level discriminative patches. In Proc. European Conf. on Computer Vision, pages 73--86. Springer, 2012.

Digital Library

[17]

D. Tran and A. Sorokin. Human activity recognition with metric learning. In Proc. European Conf. on Computer Vision, pages 548--561. Springer-Verlag, 2008.

Digital Library

[18]

D. Weinland, E. Boyer, and R. Ronfard. Action recognition from arbitrary views using 3d exemplars. In Proc. Intl Conf. on Computer Vision, pages 1--7, 2007.

[19]

D. Weinland, R. Ronfard, and E. Boyer. Free viewpoint action recognition using motion history volumes. Computer Vision and Image Understanding, 104(2):249--257, 2006.

Digital Library

[20]

D. Weinland, R. Ronfard, and E. Boyer. A survey of vision-based methods for action representation, segmentation and recognition. Computer Vision and Image Understanding, 115(2):224--241, 2011.

Digital Library

[21]

C. Wu, A. H. Khalili, and H. Aghajan. Multiview activity recognition in smart homes with spatio-temporal features. In Proc. Intl Conf. on Distributed Smart Cameras, pages 142--149. ACM, 2010.

Digital Library

[22]

X. Wu, D. Xu, L. Duan, and J. Luo. Action recognition using context and appearance distribution features. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pages 489--496, 2011.

Digital Library

[23]

Z. Zhao and A. M. Elgammal. Information theoretic key frame selection for action recognition. In Proc. of the British Machine Vision Conf., pages 1--10, 2008.

Cited By

Trehan SAakur S(2022)Towards Active Vision for Action Localization with Reactive Control and Predictive Learning2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV51458.2022.00345(3391-3400)Online publication date: Jan-2022
https://doi.org/10.1109/WACV51458.2022.00345
Wang BHuang LHoai M(2020)Active Vision for Early Recognition of Human Actions2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR42600.2020.00116(1078-1088)Online publication date: Jun-2020
https://doi.org/10.1109/CVPR42600.2020.00116

Index Terms

Discriminative poses for early recognition in multi-camera networks

Recommendations

Early Facial Expression Recognition Using Hidden Markov Models
ICPR '14: Proceedings of the 2014 22nd International Conference on Pattern Recognition

Although it is often necessary to recognize users' expressions as soon as possible after it starts and before it ends in many applications, few methods have been proposed explicitly for early facial expression recognition. In this paper, we propose an ...
Probabilistic recognition of human faces from video
Special issue on Face recognition

Recognition of human faces using a gallery of still or video images and a probe set of videos is systematically investigated using a probabilistic framework. In still-to-video recognition, where the gallery consists of still images, a time series state ...
Collaborative discriminative multi-metric learning for facial expression recognition in video

We present a new metric learning approach for facial expression recognition in videos.Our approach combines both audio and visual features and achieves better facial expression recognition performance.Experimental results clearly show the advantages of ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICDSC '15: Proceedings of the 9th International Conference on Distributed Smart Cameras

September 2015

225 pages

ISBN:9781450336819

DOI:10.1145/2789116

General Chairs:
Ricardo Carmona-Galán,
Ángel Rodríguez-Vázquez
IMSE-CNM (CSIC-Universidad de Sevilla), Spain

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Escuela Técnica superier de Ingeniería Informática, Universidad de Seville, Spain: Escuela Técnica superier de Ingeniería Informática, Universidad de Seville, Spain

In-Cooperation

SIGBED: ACM Special Interest Group on Embedded Systems

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 September 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

ICDSC '15

Sponsor:

Escuela Técnica superier de Ingeniería Informática, Universidad de Seville, Spain

ICDSC '15: International Conference on distributed Smart Cameras

September 8 - 11, 2015

Seville, Spain

Acceptance Rates

ICDSC '15 Paper Acceptance Rate 43 of 48 submissions, 90%;

Overall Acceptance Rate 92 of 117 submissions, 79%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
79
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 16 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Trehan SAakur S(2022)Towards Active Vision for Action Localization with Reactive Control and Predictive Learning2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV51458.2022.00345(3391-3400)Online publication date: Jan-2022
https://doi.org/10.1109/WACV51458.2022.00345
Wang BHuang LHoai M(2020)Active Vision for Early Recognition of Human Actions2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR42600.2020.00116(1078-1088)Online publication date: Jun-2020
https://doi.org/10.1109/CVPR42600.2020.00116

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents