Research Paper G19
Research Paper G19
Research Paper G19
Abstract – Over the past few decades, the interpret and understand visual data
world has witnessed a lot of terrorist and (images or videos).
criminal activities. Public Surveillance
system has gained a lot of importance as a B. DIGITAL IMAGE PROCESSING
response to counter these activities.
Various state governments has started to Digital Image Processing (DIP) deals with
install cameras in their densly populated the manipulation of digital data (images or
and busy cities in order to safeguard their video) with the help of digital computer and
citizens. some sort of Image Processing Algorithms
In order to cover a complete city like Canny-Edge Detection, Error
under surveillance system there’s a need of Diffusion, Hough Transformation, etc. to
thousands of cameras and to monitor their produce digital maps in which specific
live video feed, hundreds of active security feature has been highlighted.
personals are required. Most commonly,
one security personal is monitoring around C. DEEP LEARNING
6 – 8 cameras, checking whether any
Deep learning is an attempt of Computer
person is indulging in any sort of criminal
Scientists to mimic the behaviour of human
activity, carrying any weapon or not
brain. It is a prominent field of Machine
(because most of the criminal activities has
Learning which comprises of multiple
involvement of weapons), which is quite
hidden layers of Artificial Neural Network
monotonous and hectic task. One slip of
(ANN) which are used to extract higher
concentration can cause damage to many
level features from data.
lives. So there’s an efficient way is required
to monitor these video feeds and add an D. OBJECT DETECTION
extra layer of security.
This research paper deals with the Object Detection is a field of Machine
optimised, efficient & fastest way to detect Learning associated with Computer Vision
commonly used weapons like AK47, Hand and Image Processing which deals with
Revolver, Pistol, knife, Grenade, etc. in a detecting instances of an object of certain
live video feed and notifies the security class (like a car, buildings, humans,
personals immediately. weapons, etc.) possessing similar features.
Keywords – Object Detection, Neural Object detection can be implemented either
Network, Deep Learning using ML based algorithms or DL based
algorithms, depending upon the complexity
I. INTRODUCTION of object class.
A. COMPUTER VISION E. METHODS OF OBJECT
DETECTION
Computer Vision is an interdisciplinary
branch of Computer Science and Digital Object Detection methods are generally
Image Processing that enable computers to categorised into either ML based approach
2
or DL based approach depending upon the obtained mAP of 53.3% In contrary, to the
complexity of object class. For Machine results achieved on PASCAL VOC
Learning approaches, it is essential to dataset, an improvement of 30% was
define features beforehand using methods achieved by this model. In this model, the
like Haar Cascade, SIFT, etc. which further whole image is processed with a
uses Support Vector Machine (SVM) Convolution Neural Network to produce a
technique for object classification. Deep feature map and then a fixed length feature
Learning technique uses Artificial Neural vector with a Region of Interest (RoI)
Network to do an end-to-end object pooling layer is extracted from each region
detection without defining features proposal.
specifically.
III. PROPOSED WORK
II. LITERATURE REVIEW
Flowchart
In year 2001, Paul Viola and Michael Jones
proposed first robust, efficient and real time
Machine Learning based Object Detection
Framework in their paper “Rapid Object
Detection using Boosted Cascade of Simple
Features”. This framework can be trained to
detect variety of objects by taking lots of
positive images (images which contains the
object which we want our classifier to
identify) and negative images (exactly same
images but without the object which needs
to be identified) to train the classifier.
However, this approach cannot be used to
detect complex objects in different
orientation and sizes.
The emergence and rapid development in
the field of Deep Learning overcomes these
above mentioned drawbacks through
Artificial Neural Networks. By using Deep
Learning based Object Detection
framework computers are able to learn
more complex features.
Deep Learning based Object Detection
framework mainly consists of two types – Scraping Images of Commonly
used Weapons
a. Region Proposal based framework that
includes models like RCNN, FRCNN In order to build Deep Learning based
and Faster RCNN. object detection model, we scraped almost
5000 images of commonly used weapons
b. Regression based frameworks that like AK47, Hand Pistol, Revolver,
includes models like YOLO and SSD Shotgun, Combat Knife, etc. in various
Region Proposal based algorithms uses sizes and orientations, which is latter pre-
sliding window approach to extract processed for building dataset. These
features from the visual data. In the year images were gathered from different
2014, Ross Girshick presented RCNN sources available on the internet and
model based on this algorithm, which further trained using a software.
3
𝑇𝑃
𝑅𝑒𝑐𝑎𝑙𝑙 =
(𝑇𝑃 + 𝐹𝑁)
where,
TP = True Positives (Predicted as
positive and is correct)
FN = False Negatives (Failed to
Predict an object is present there)
The Average Precision (AP) is the area
under the Precision vs Recall curve. (Mean
Average Precision) mAP is the average of
Average Precision (AP).
Fig: Graph of GPU time vs Accuracy
Analysis of Accuracy Precision
Precision for YOLO Model
(AP) and GPU Time to process
one frame (in ms)
Model mAP GPU
Score Time
RCNN
(Regional
Convolutional 33.325 874
Neural
Network)
SSD (Single
Shot
20.58 86.8
Multibox
Detector)
V. CONCLUSION
object was fastest from others. So, it can be around the area which will report a nearby
a better choice if we need fast object hospital to avoid any fatality.
detector in trade-off to accuracy. YOLO D. Another possible application could be the
and RCNN provided us with the similar
detection of fire at any place which upon
mAP score of 33 and 34.2 respectively detection can be reported directly to a fire
which gives us the better accuracy of department ensuring that there is minimum
detecting the object. Although the YOLO
damage around that area.
trained model is comparatively faster as
that to the RCNN model making it an E. One can also monitor through the traffic
efficient and reliable object detection using this system where the cameras will
model. detect all the vehicles breaking any law and
reporting the same to a traffic control
department helping them to resolve traffic
issues.
F. One of the limitations of our project is that
there is no possible solution right now to
detect any weapon which is hidden by the
criminal in either his pocket or suitcases.
We are thinking of a way to overcome this
problem and build a better and safer
environment for citizens.
VII. REFERENCES