Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2733373.2806310acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Human Action Recognition With Trajectory Based Covariance Descriptor In Unconstrained Videos

Published: 13 October 2015 Publication History

Abstract

Human action recognition from realistic videos plays a key role in multimedia event detection and understanding. In this paper, a novel Trajectory Based Covariance (TBC) descriptor is proposed, which is formulated along the dense trajectories. To map the descriptor matrix to vector space and trim out the redundancy of data, the TBC descriptor matrix is projected to Euclidean space by the Logarithm Principal Components Analysis (LogPCA). Our method is tested on the challenging Hollywood2 and TV Human Interaction datasets. Experimental results show that the proposed TBC descriptor outperforms three baseline descriptors (i.e., histogram of oriented gradient, histogram of optical flow and motion boundary histogram), and our method achieves better recognition performances than a number of state-of-the-art approaches.

References

[1]
Nicolas Ballas, Bertrand Delezoide, and Françoise Prêteux. Trajectory signature for action recognition in video. In ACM MM'12, pages 1429--1432, 2012.
[2]
Mihir Jain, Hervé Jégou, and Patrick Bouthemy. Better exploiting motion for better action recognition. In CVPR'13, pages 2555--2562, 2013.
[3]
Heng Wang and Cordelia Schmid. Action recognition with improved trajectories. In ICCV'13, pages 3551--3558, 2013.
[4]
Oncel Tuzel, Fatih Porikli, and Peter Meer. Pedestrian detection via classification on riemannian manifolds. IEEE Trans. Pattern Anal. Mach. Intell., 30(10):1713--1727, 2008.
[5]
Navneet Dalal and Bill Triggs. Histograms of oriented gradients for human detection. In CVPR'05, pages 886--893, 2005.
[6]
Ivan Laptev, Marcin Marszalek, Cordelia Schmid, and Benjamin Rozenfeld. Learning realistic human actions from movies. In CVPR'08, pages 1--8, 2008.
[7]
Navneet Dalal, Bill Triggs, and Cordelia Schmid. Human detection using oriented histograms of flow and appearance. In ECCV'06, pages 428--441, 2006.
[8]
Florent Perronnin and Christopher Dance. Fisher kernels on visual vocabularies for image categorization. In CVPR'07, pages 1--8, 2007.
[9]
Dan Oneata, Jakob Verbeek, and Cordelia Schmid. Action and event recognition with fisher vectors on a compact feature set. In ICCV'13, pages 1817--1824, 2013.
[10]
Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin. LIBLINEAR: A library for large linear classification. J. Machine Learning Research, 9:1871--1874, 2008.
[11]
Marcin Marszalek, Ivan Laptev, and Cordelia Schmid. Actions in context. In CVPR'09, pages 2929--2936, 2009.
[12]
Alonso Patron-Perez, Marcin Marszalek, Ian Reid, and Andrew Zisserman. Structured learning of human interactions in TV shows. IEEE Trans. Pattern Anal. Mach. Intell., 34(12):2441--2453, 2012.
[13]
Gang Yu, Junsong Yuan, and Zicheng Liu. Propagative hough voting for human activity recognition. In ECCV'12, pages 693--706, 2012.
[14]
Minh Hoai and Andrew Zisserman. Talking heads: Detecting humans and recognizing their interactions. In CVPR'14, pages 875--882, 2014.
[15]
Minh Hoai and Andrew Zisserman. Improving human action recognition using score distribution and ranking. In ACCV'14, pages 3--20, 2014.

Cited By

View all
  • (2023)Micro-network-based deep convolutional neural network for human activity recognition from realistic and multi-view visual dataNeural Computing and Applications10.1007/s00521-023-08440-035:18(13321-13341)Online publication date: 13-Mar-2023
  • (2019)Human Action Recognition Based on Foreground Trajectory and Motion Difference DescriptorsApplied Sciences10.3390/app91021269:10(2126)Online publication date: 24-May-2019
  • (2019)Multi-modal learning for affective content analysis in moviesMultimedia Tools and Applications10.1007/s11042-018-5662-978:10(13331-13350)Online publication date: 1-May-2019
  • Show More Cited By

Index Terms

  1. Human Action Recognition With Trajectory Based Covariance Descriptor In Unconstrained Videos

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '15: Proceedings of the 23rd ACM international conference on Multimedia
    October 2015
    1402 pages
    ISBN:9781450334594
    DOI:10.1145/2733373
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 October 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. covariance
    2. logpca
    3. motion trajectory
    4. tbc descriptor

    Qualifiers

    • Short-paper

    Funding Sources

    • Fundamental Research Funds for the Central Universities
    • ``Shu Guang' project of Shanghai Municipal Education Commission and Shanghai Education Development Foundation
    • Program for Professor of Special Appointment (Eastern Scholar) at Shanghai Institutions of Higher Learning
    • National Natural Science Foundation of China

    Conference

    MM '15
    Sponsor:
    MM '15: ACM Multimedia Conference
    October 26 - 30, 2015
    Brisbane, Australia

    Acceptance Rates

    MM '15 Paper Acceptance Rate 56 of 252 submissions, 22%;
    Overall Acceptance Rate 995 of 4,171 submissions, 24%

    Upcoming Conference

    MM '24
    The 32nd ACM International Conference on Multimedia
    October 28 - November 1, 2024
    Melbourne , VIC , Australia

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 14 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Micro-network-based deep convolutional neural network for human activity recognition from realistic and multi-view visual dataNeural Computing and Applications10.1007/s00521-023-08440-035:18(13321-13341)Online publication date: 13-Mar-2023
    • (2019)Human Action Recognition Based on Foreground Trajectory and Motion Difference DescriptorsApplied Sciences10.3390/app91021269:10(2126)Online publication date: 24-May-2019
    • (2019)Multi-modal learning for affective content analysis in moviesMultimedia Tools and Applications10.1007/s11042-018-5662-978:10(13331-13350)Online publication date: 1-May-2019
    • (2018)Motion keypoint trajectory and covariance descriptor for human action recognitionThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-016-1345-634:3(391-403)Online publication date: 1-Mar-2018
    • (2017)Attention Transfer from Web Images for Video RecognitionProceedings of the 25th ACM international conference on Multimedia10.1145/3123266.3123432(1-9)Online publication date: 23-Oct-2017
    • (2017)Learning correlations for human action recognition in videosMultimedia Tools and Applications10.1007/s11042-017-4416-476:18(18891-18913)Online publication date: 1-Sep-2017
    • (2016)Real-Time Action Recognition with Enhanced Motion Vector CNNs2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR.2016.297(2718-2726)Online publication date: Jun-2016
    • (2016)Accelerating Large-Scale Human Action Recognition with GPU-Based SparkAdvances in Multimedia Information Processing - PCM 201610.1007/978-3-319-48896-7_66(670-679)Online publication date: 27-Nov-2016
    • (2016)Anomaly Detection and Activity Perception Using Covariance Descriptor for TrajectoriesComputer Vision – ECCV 2016 Workshops10.1007/978-3-319-48881-3_51(728-742)Online publication date: 3-Nov-2016

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media