Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3538806.3538811acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccbnConference Proceedingsconference-collections
research-article

An Improved Deep-learning Network for Abnormal Action Detection

Published: 14 July 2022 Publication History
  • Get Citation Alerts
  • Abstract

    Detecting actions in untrimmed videos is a challenging task. In order to improve the accuracy of abnormal action detection, an improved network combining Boundary-Matching Network (BMN) and Structured Segment Network (SSN) is proposed in this paper to achieve the temporal detection of abnormal actions from public places. BMN is used to generate the temporal proposals of abnormal actions in the video, and then SSN acts on the proposals generated by BMN to classify them into specific categories. By modifying the feature dimensions to adapt to the length of the video, and transforming the output generated by the BMN network into a proposal, the BMN and SSN networks are well combined. The experimental results prove that the proposed method achieves good results on the abnormal action dataset collected from public places.

    References

    [1]
    Talari S, Shafie-Khah M, and Siano P. 2017. A review of smart cities based on the internet of things concept J. Energies. 10(4): 421.
    [2]
    Miller M. 2015. The internet of things: How smart TVs, smart cars, smart homes, and smart cities are changing the world M. Pearson Education.
    [3]
    Ohnishi K, Hidaka M, and Harada T. 2016. Improved dense trajectory with cross streams. Proceedings of the 24th ACM international conference on Multimedia. pp. 257-261.
    [4]
    Wang H, Schmid C. 2013. Action recognition with improved trajectories. Proceedings of the IEEE international conference on computer vision. pp. 3551-58.
    [5]
    Dalal N, Triggs B, and Schmid. 2006. Human detection using oriented histograms of flow and appearance. European conference on computer vision, Springer. pp. 428-441.
    [6]
    Dalal N and Triggs B. 2005. Histograms of oriented gradients for human detection. IEEE computer society conference on computer vision and pattern recognition (CVPR'05). Ieee.1 pp. 886-893.
    [7]
    Simonyan K and Zisserman A. 2014. Two-stream convolutional networks for action recognition in videos. arXiv preprint arXiv. 1406 2199.
    [8]
    Tran D, Bourdev L D, and Fergus R. 2014. C3D: generic features for video analysis. CoRR 2(7): 8.
    [9]
    Zhu W, Lan C, and Xing J. 2016. Co-occurrence feature learning for skeleton based action recognition using regularized deep LSTM networks. Proceedings of the AAAI Conference on Artificial Intelligence, vol 30(1).
    [10]
    Gao J. 2017. Turn tap: Temporal unit regression network for temporal action proposals. Proceedings of the IEEE international conference on computer vision. pp. 3628-36.
    [11]
    Xiong Y. 2017. A pursuit of temporal accuracy in general activity detection. arXiv preprint arXiv.1703 02716.
    [12]
    Gao J. 2018. Ctap: Complementary temporal action proposal generation. Proceedings of the European conference on computer vision (ECCV). pp. 68-83.
    [13]
    Lin T. 2018. Bsn: Boundary sensitive network for temporal action proposal generation. Proceedings of the European Conference on Computer Vision (ECCV). pp. 3-19.
    [14]
    Liu Y. 2019. Multi-granularity generator for temporal action proposal. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 3604-13.
    [15]
    Lin T. 2019. Bmn: Boundary-matching network for temporal action proposal generation. Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 3889-98.
    [16]
    Zhao Y. 2017. Temporal action detection with structured segment networks. Proceedings of the IEEE International Conference on Computer Vision. pp. 2914-23.
    [17]
    Wang L, Xiong Y, and Wang Z. 2016. Temporal segment networks: Towards good practices for deep action recognition. European conference on computer vision, Springer. pp. 20-36.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICCBN '22: Proceedings of the 10th International Conference on Communications and Broadband Networking
    February 2022
    82 pages
    ISBN:9781450387439
    DOI:10.1145/3538806
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 14 July 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Abnormal action detection
    2. Deep-learning
    3. Temporal detection

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICCBN 2022

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 33
      Total Downloads
    • Downloads (Last 12 months)14
    • Downloads (Last 6 weeks)3

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media