Object Detection - Deep Learning: Jamia Hamdard
Object Detection - Deep Learning: Jamia Hamdard
Object Detection - Deep Learning: Jamia Hamdard
Submitted by
PRASHANT SHARMA
ENROLL. NO. 2016-310-073
Bachelor of Technology
JAMIA HAMDARD
New Delhi-110062
(2020)
1
DECLARATION
(PRASHANT SHARMA)
Place: DELHI
ACKNOWLEDGEMENT
I have taken efforts in this project. However, it would not have been
possible without the kind support and help of my mentor. I would like to extend
my sincere thanks to my mentor.
Last but not least, we would like to express our deep sense of gratitude
and earnest thanksgiving to our dear parents for their moral support and
heartfelt cooperation in doing the main project.
(PRASHANT SHARMA)
Place: DELHI
Contents
1. ABSTRACT
2. INTRODUCTION
2.1 Problem Statement
2.2 Application
2.3 Challenges
2.4 Literature Survey
3. EXPERIMENTAL RESULTS
3.1 Dataset
3.2 Implementation Details
3.2.1 Pre-processing
3.2.2 Network
4. PROJECT PLAN
4.1 Feasibility study
4.1.1 Operation
4.1.2 Technical
4.1.3 Cost/Benefit Analysis
4.2 System Analysis
4.3 System Requirement
5. CONCLUSION
Efficient and accurate object detection has been an important topic in the
advancement of computer vision systems. With the advent of deep learning
techniques, the accuracy for object detection has increased drastically. The
project aims to incorporate state-of-the-art technique for object detection with
the goal of achieving high accuracy with a real-time performance. A major
challenge in many of the object detection systems is the dependency on other
computer vision techniques for helping the deep learning-based approach,
which leads to slow and non-optimal performance.
(a) Surveillance
(b) Autonomous vehicles
3.1 Dataset
Figure 3: Dataset
3.2 Implementation Details
Ground Truth
Prediction
Table 2
3.4 Quantitative Analysis
Hardware Requirements:
Computer with Intel processor.
2GB RAM
500GB hard disk drive
Software Requirements:
Windows 7 operating system.
Python 3
Juypter Notebook
Software Development:
Using Python
Python Requirements:
OpenCV
Imutils
Dlib
Skit-learn
Skit-image
Tensorflow
Keras
Mxnet
5. CONCLUSION
The feature extracted is sufficient for recognizing the object and marking
the location of the object. x the proposed classifier is able to recognize the
object in less computational cost.
Along with the local features, the width and height of the object
computed through projection method is used.
As a scope for future enhancement,
Features either the local or global used for recognition can be increased,
to increase the efficiency of the object recognition system.
The colour information in the image can be used for recognition of the
object. Colour based object recognition plays vital role in Robotics. Although
the visual tracking algorithm proposed here is robust in many of the conditions,
it can be made more robust by eliminating some of the limitations as listed
below:
In the Single Visual tracking, the size of the template remains fixed for
tracking. If the size of the object reduces with the time, the background
becomes more dominant than the object being tracked. In this case the
object may not be tracked.
Splitting and merging cannot be handled very well in all conditions using
the single camera due to the loss of information of a 3D object projection
in 2D images.
In this research work, the object Identification and Visual Tracking has
been done through the use of ordinary camera. The concept is well
extendable in applications like Intelligent Robots, Automatic Guided
Vehicles, Enhancement of Security Systems to detect the suspicious
behavior along with detection of weapons, identify the suspicious
movements of enemies on boarders with the help of night vision cameras
and many such applications.
[1] Ross Girshick, Je_ Donahue, Trevor Darrell, and Jitendra Malik. Rich
feature hierarchies for accurate object detection and semantic
segmentation. In The IEEE Conference on Computer Vision and Pattern
Recognition (CVPR), 2014.
[2] Ross Girshick. Fast R-CNN. In International Conference on Computer
Vision (ICCV),2015.
[3] Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. Faster R-CNN:
Towards real-time object detection with region proposal networks. In
Advances in Neural Information Processing Systems (NIPS), 2015.
[4] Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. You
only look once: Unified, real-time object detection. In The IEEE
Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
[5] Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott
Reed, Cheng-Yang Fu, and Alexander C. Berg. SSD: Single shot
multibox detector. In ECCV, 2016.
[6] Karen Simonyan and Andrew Zisserman. Very deep convolutional
networks for large-scale image recognition. arXiv preprint
arXiv:1409.1556, 2014
26