Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2996890.3007881acmotherconferencesArticle/Chapter ViewAbstractPublication PagesuccConference Proceedingsconference-collections
short-paper

Real-time target detection and recognition with deep convolutional networks for intelligent visual surveillance

Published: 06 December 2016 Publication History

Abstract

Moving target detection and tracking, recognition, behaviors analysis are the key issues in the intelligent visual surveillance system (IVSS). The challenge is how to process the real-time video stream in an effective way in case that we could find the interested objects for analysis. However, the traditional video surveillance technology often does not meet the needs of real-time key frame recognition for the on-line intelligent video monitoring system. In our paper, we adopt the state-of-the-art Faster R-CNN [7] that takes advantages of convolutional neural networks into our real-time target recognition system - Deep Intelligent Visual Surveillance (DIVS). The key aspects of our DIVS are consisted of four parts: (i) Getting the real-time video images from remote cameras; (ii) Processing the data with the deep learning framework caffe [23] built for Faster R-CNN; (iii) Storing the valuable data with MySQL; (iv) Data presentation on the website. Experiments based on our system validated the effectiveness, stability and accuracy of our proposed solutions.

References

[1]
K. Lenc and A. Vedaldi. R-CNN minus R. British Machine Vision Conference (BMVC), 2015.
[2]
A. Krizhevsky, I. Sutskever, and G. Hinton. "Imagenet classification with deep convolutional neural networks," in Neural Information Processing Systems (NIPS), 2012.
[3]
O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei. "ImageNet Large Scale Visual Recognition Challenge," in International Journal of Computer Vision (IJCV), 2015.
[4]
A. Andreopoulos and J. K. Tsotsos. 50 years of object recognition. 2013,117(8): 827--891
[5]
R. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. In CVPR, 2014.
[6]
R. Girshick. Fast R-CNN. arXiv:1504.08083, 2015.
[7]
S. Ren, K. He, R. Girshick, and J. Sun. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. NIPS 2015: 91--99
[8]
A. Coates, B. Huval, T. Wang, D. J. Wu and A. Y. Ng. Deep learning with COTS HPC systems.
[9]
K. He, X. Zhang, S. Ren and J. Sun. Spatial pyramid pooling in deep convolutional networks for visual recognition. In ECCV, 2014.
[10]
Grauman, K., Darrell, T.: The pyramid match kernel: Discriminative classification with sets of image features. In: ICCV (2005)
[11]
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR (2006)
[12]
J. R. Uijlings, K. E. vandeSande, T. Gevers and A. W. Smeulders. Selective search for object recognition. IJCV, 2013.
[13]
D. Erhan, C. Szegedy, A. Toshey and D. Angeloy. Scalable object detection using deep neural networks
[14]
P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus and Y. LeCun. Overfeat: Integrated recognition, localization and detection using convolutional networks. In ICLR, 2014.
[15]
J. Long, E. Shelhamer, and T. Darrell, "Fully convolutional networks for semantic segmentation," in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
[16]
Zeiler M D, Fergus R. Visualizing and Understanding Convolutional Networks{J}. Lecture Notes in Computer Science, 2013.
[17]
Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel, "Backpropagation applied to handwritten zip code recognition," Neural computation, 1989.
[18]
https://www.cntk.ai/, 2016.
[19]
https://www.tensorflow.org/, 2015.
[20]
https://developer.nvidia.com/cuda-toolkit, 2016.
[21]
https://developer.nvidia.com/cudnn, 2016.
[22]
C. Szegedy, A. Toshev, and D. Erhan. Deep neural networks for object detection. In NIPS, 2013.
[23]
Y. Jia. Caffe: An open source convolutional architecture for fast feature embedding. http://caffe.berkeleyvision.org/, 2013.
[24]
M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman, "The PASCAL Visual Object Classes Challenge 2007 (VOC2007) Results," 2007.

Cited By

View all
  • (2024)Intelligent Surveillance System Using Deep LearningProceedings of Data Analytics and Management10.1007/978-981-99-6547-2_31(405-416)Online publication date: 3-Jan-2024
  • (2023)Domain Adaptation: Challenges, Methods, Datasets, and ApplicationsIEEE Access10.1109/ACCESS.2023.323702511(6973-7020)Online publication date: 2023
  • (2023)Integrating fluid–solid coupling domain knowledge with deep learning models: An automatic and interpretable diagnostic system for the silting disease of drainage pipelinesTunnelling and Underground Space Technology10.1016/j.tust.2023.105386142(105386)Online publication date: Dec-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
UCC '16: Proceedings of the 9th International Conference on Utility and Cloud Computing
December 2016
549 pages
ISBN:9781450346160
DOI:10.1145/2996890
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 December 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. caffe
  2. convolutional neural networks
  3. faster r-cnn
  4. intelligent visual surveillance
  5. real-time
  6. target recognition

Qualifiers

  • Short-paper

Conference

UCC '16

Acceptance Rates

Overall Acceptance Rate 38 of 125 submissions, 30%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Intelligent Surveillance System Using Deep LearningProceedings of Data Analytics and Management10.1007/978-981-99-6547-2_31(405-416)Online publication date: 3-Jan-2024
  • (2023)Domain Adaptation: Challenges, Methods, Datasets, and ApplicationsIEEE Access10.1109/ACCESS.2023.323702511(6973-7020)Online publication date: 2023
  • (2023)Integrating fluid–solid coupling domain knowledge with deep learning models: An automatic and interpretable diagnostic system for the silting disease of drainage pipelinesTunnelling and Underground Space Technology10.1016/j.tust.2023.105386142(105386)Online publication date: Dec-2023
  • (2020)Transfer Learning and Deep Domain AdaptationAdvances and Applications in Deep Learning [Working Title]10.5772/intechopen.94072Online publication date: 29-Oct-2020
  • (2019)A Novel Low Processing Time System for Criminal Activities Detection Applied to Command and Control Citizen Security CentersInformation10.3390/info1012036510:12(365)Online publication date: 24-Nov-2019
  • (2019)Obstacle detection and tracking method for autonomous vehicle based on three-dimensional LiDARInternational Journal of Advanced Robotic Systems10.1177/172988141983158716:2Online publication date: 20-Mar-2019
  • (2018)The Study of Surface State Identification Based on BP_adaboost Algorithm2018 37th Chinese Control Conference (CCC)10.23919/ChiCC.2018.8483380(5865-5870)Online publication date: Jul-2018
  • (2018)Bird Eyes: A Cloud-Based Object Detection System for Customisable Surveillance2018 International Conference on Image and Vision Computing New Zealand (IVCNZ)10.1109/IVCNZ.2018.8634751(1-6)Online publication date: Nov-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media