Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3376067.3376101acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicvipConference Proceedingsconference-collections
research-article

Dangerous Driving Behavior Detection with Attention Mechanism

Published: 25 February 2020 Publication History

Abstract

In order to reduce the incidence of traffic accidents caused by dangerous driving, a dangerous driving behavior recognition model based on convolutional neural network (CNN) and long short-term memory network (LSTM) is proposed. Aiming at the problem of low accuracy of the network model identification, the algorithm is optimized by introducing the unsupervised attention mechanism. The model focuses on a specific visual area and improves the recognition accuracy of the algorithm to some extent by integrating the attention weighted module and the convolution LSTM. The experimental results show that the detection accuracy and detection rate of the algorithm are improved compared with the Two-Stream method and C3D behavior recognition algorithm in the dangerous driving behavior recognition task.

References

[1]
Zhuang M K, Bai H F, Xie X F. A study on risky driving behavior and related factors[J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2008, 44(3): 475--482. DOI= https://doi.org/10.13209/j.0479-8023.2008.074
[2]
Simonyan K, Zisserman A. Two-stream convolutional networks for action recognition in videos[C]//Advances in neural information processing systems. 2014: 568--576. Retrieved from https://arxiv.org/pdf/1406.2199.pdf
[3]
Wang L, Xiong Y, Wang Z, et al. Temporal segment networks: Towards good practices for deep action recognition[C]//European conference on computer vision. Springer, Cham, 2016: 20--36. DOI= https://doi.org/10.1007/978-3-319-46484-8_2
[4]
Tran D, Bourdev L, Fergus R, et al. Learning spatiotemporal features with 3d convolutional networks[C]//Proceedings of the IEEE international conference on computer vision. 2015: 4489--4497. DOI= https://doi.org/10.1109/iccv.2015.510
[5]
Ma C Y, Chen M H, Kira Z, et al. Ts-lstm and temporal-inception: Exploiting spatiotemporal dynamics for activity recognition[J]. Signal Processing: Image Communication, 2019, 71: 76--87. DOI= https://doi.org/10.1016/j.image.2018.09.003
[6]
Abtahi S, Omidyeganeh M, Shirmohammadi S, et al. YawDD: A yawning detection dataset[C]//Proceedings of the 5th ACM Multimedia Systems Conference. ACM, 2014: 24--28. DOI= https://doi.org/10.1145/2557642.2563678
[7]
Sandler M, Howard A, Zhu M, et al. Mobilenetv2: Inverted residuals and linear bottlenecks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018: 4510--4520. DOI= https://doi.org/10.1109/cvpr.2018.00474
[8]
He K, Zhang X, Ren S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 770--778. DOI= https://doi.org/10.1109/cvpr.2016.90
[9]
Olah C. Understanding lstm networks[J]. 2015. Retrieved from https://colah.github.io/posts/2015-08-Understanding-LSTMs/
[10]
Sharma S, Kiros R, Salakhutdinov R. Action recognition using visual attention[J]. arXiv preprint arXiv:1511.04119, 2015. Retrieved from https://arxiv.org/pdf/1511.04119.pdf
[11]
Xingjian S H I, Chen Z, Wang H, et al. Convolutional LSTM network: A machine learning approach for precipitation nowcasting[C]//Advances in neural information processing systems. 2015: 802--810. Retrieved from https://arxiv.org/pdf/1506.04214v1.pdf
[12]
Lin T Y, RoyChowdhury A, Maji S. Bilinear cnn models for fine-grained visual recognition[C]//Proceedings of the IEEE international conference on computer vision. 2015: 1449--1457. DOI= https://doi.org/10.1109/iccv.2015.170
[13]
Selvaraju R R, Cogswell M, Das A, et al. Grad-cam: Visual explanations from deep networks via gradient-based localization[C]//Proceedings of the IEEE International Conference on Computer Vision. 2017: 618--626. DOI= https://doi.org/10.1109/iccv.2017.74

Cited By

View all
  • (2022)End-to-end deep learning-based framework for driver action recognition2022 International Conference on Multimedia Analysis and Pattern Recognition (MAPR)10.1109/MAPR56351.2022.9924944(1-6)Online publication date: Oct-2022
  • (2022)Multi-level Attention Fusion for Multimodal Driving Maneuver Recognition2022 IEEE International Symposium on Circuits and Systems (ISCAS)10.1109/ISCAS48785.2022.9937710(2609-2613)Online publication date: 28-May-2022
  • (2021)Soft Spatial Attention-Based Multimodal Driver Action Recognition Using Deep LearningIEEE Sensors Journal10.1109/JSEN.2020.301925821:2(1918-1925)Online publication date: 15-Jan-2021

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICVIP '19: Proceedings of the 3rd International Conference on Video and Image Processing
December 2019
270 pages
ISBN:9781450376822
DOI:10.1145/3376067
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • Shanghai Jiao Tong University: Shanghai Jiao Tong University
  • Xidian University
  • TU: Tianjin University

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 February 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Attention mechanism
  2. Behavior recognition
  3. CNN
  4. Dangerous driving
  5. LSTM

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICVIP 2019

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 04 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2022)End-to-end deep learning-based framework for driver action recognition2022 International Conference on Multimedia Analysis and Pattern Recognition (MAPR)10.1109/MAPR56351.2022.9924944(1-6)Online publication date: Oct-2022
  • (2022)Multi-level Attention Fusion for Multimodal Driving Maneuver Recognition2022 IEEE International Symposium on Circuits and Systems (ISCAS)10.1109/ISCAS48785.2022.9937710(2609-2613)Online publication date: 28-May-2022
  • (2021)Soft Spatial Attention-Based Multimodal Driver Action Recognition Using Deep LearningIEEE Sensors Journal10.1109/JSEN.2020.301925821:2(1918-1925)Online publication date: 15-Jan-2021

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media