Object Detection Using TensorFlow

This document provides an overview of object detection with TensorFlow. It begins by defining object detection as identifying objects in images and localizing them with bounding boxes. Classical and deep learning approaches to object detection are discussed, including R-CNN, Fast R-CNN, YOLO, and Faster R-CNN. The TensorFlow Object Detection API is introduced for preparing data, training and evaluating models. CIFAR-10 is used as an example dataset. Key steps include creating TFRecords from annotated images and using the train.py and eval.py scripts. Applications of object detection discussed include facial recognition, self-driving cars, and security.

Uploaded by

Hari Vamshi

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

565 views

Object Detection Using TensorFlow

Uploaded by

Hari Vamshi

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Object Detection with

Tensorflow

o D. HARI VAMSHI
o V. RAJU
o U. LAXMAN
Agenda

Ø Intro
Ø What is Object Detection
Ø State of Object Detection
Ø Tensorflow Object Detection API
Ø Preparing Data
Ø Training & Evaluating Models
Ø Links
What is Object
Detection
Object detection is the
task of
identifying objects in an
image and drawing
bounding boxes around
them, i.e. localizing them.
It's a very important
problem in computer vision
due its numerous
applications from self-
driving cars to security and
tracking.
Object detection =
Object Classification + Object
Localization
Approache
s

▪ Classical approach (Haar features) -
first OD real time framework (Viola-Jones)
▪ Deep learning approach is a subset of machine
learning in artificial intelligence (AI) that has networks
capable of learning unsupervised from data that is
unstructured or unlabeled. Also known
as deep neural learning or deep neural network. Few of
the approaches used are:
▪ OverFeat
▪ R-CNN
▪ Fast R-CNN
▪ YOLO
▪ Faster R-CNN
▪ SSD andR-FCN
Deep learning
approach
OverFeat -published in 2013, multi-scale
sliding window algorithm using Convolutional
Neural Networks (CNNs).

N.N - Regions with CNN features. Three

stage approach:
- Extract possible objects using a region
proposal method (the most popular one
being Selective Search).
- Extract features from each region using a
CNN.
- Classify each region with SVMs.
Deep learning
approach
Fast R-CNN - Similar to R-CNN, it used Selective
Search to generate object proposals, but instead
of extracting all of them independently and
using SVM classifiers, it applies the CNN on the
complete image and then used both Region of
Interest (RoI) Pooling on the feature mapwith a
final feed forward network for classification and
regression.

YOLO - You Only Look Once:

a simple convolutional neural
network approach which has
both great results and high
speed,allowing for the first
time real time object
detection.
Deep learning
approach
Faster R-CNN - Faster R-CNN added what
they called a Region Proposal Network
(RPN), in an attempt to get rid of the
Selective Search algorithm and make the
model completely trainable end-to-end.

SSD andR-FCN
Finally, there are two notable papers, Single Shot
Detector (SSD)which takes on YOLO by using
multiple sized convolutional feature maps
achieving better results and speed, and Region-
based Fully Convolutional Networks (R-FCN)
which takes the architecture of Faster R-CNN but
with only convolutionalnetworks.
Introduction
TensorFlow is a free and open-
source software library for
dataflow and differentiable
programming across a range of
tasks. It is a symbolic math
library, and is also used for
machine learning applications
such as neural networks. We
train and process the data
based on the help of
TensorFlow Object Detection
API.
Creating a
dataset
 We can either create a dataset of
our own or we can also consider
a predefined dataset and work on
the basis of TensorFlow package.

 The dataset being considered in

CIFAR 10
this project is CIFAR-10 which
consists of numerous pictures
used in detection classified in 10
classes.
Dataset

Ø Tensorflow  Object  Detection  API  uses
the  TFRecord  file format
Ø There   is available  third-party   scripts
to  convert  PASCAL VOC and   Oxford
Pet Format
Ø In  other cases   explanation of format is
    available in  git repo.
Ø Input data to create TFRecord - annotated
Image

The dataset being considered in this module is

CIFAR 10.
Creating
TFRecord
 TensorFlow object detection API report contains folder dataset_tools
with scripts to covert common structures of data into TFRecord.

 The considered images can be formed into a TFrecord once the input file are
either an image or a jpg or png file which s stored in the form of records in
TensorFlow.
Max-norm The maximum norm, also called max-norm or max-norm, is a popular constraint
because it is less aggressive than other norms such as the unit norm, simply setting an upper bound.

TRAINING DATA
One model for two
tasks?

Po - is object
exists
bx1
- bounding
bx2 box
Object detection -output is the one number (index) of coordinates
aclass by1

by2
c1

c2 - object’s
variables
c3
…
Object localization -output is the four
numbers - coordinates of bounding box. cn
Selecting a
model
Tensorflow OD API provides a
collection of  detection models pre-
trained on the COCO  dataset, the Kitti dataset,
and the Open Images  dataset.

- model name corresponds to a config file that
was used to train this model.
- speed -
running time in msper 600x600 image
- mAP stands for mean average precision,
which indicates how well the model
performed on the COCO dataset.
- Outputs types (Boxes, and Masks if ap
plicable)
Training &
Evaluating
# From the tensorflow/models/research directory
python object_detection/train.py
--logtostderr
--
pipeline_config_path=/tensorflow/models/object_detection/samples/configs/ssd_mobilenet_v1_p
ets.config
--train_dir=${PATH_TO_ROOT_TRAIN_FOLDER}

# From the tensorflow/models/research directory

python object_detection/eval.py \
--logtostderr \
--pipeline_config_path=$
{PATH_TO_YOUR_PIPELINE_CONFIG} \
--checkpoint_dir=${PATH_TO_TRAIN_DIR} \
--eval_dir=${PATH_TO_EVAL_DIR}
Facial Recognition:
A deep learning facial recognition system called the
“DeepFace” has been developed by a group of researchers
in the Facebook, which identifies human faces in a digital
image very effectively. Google uses its own facial
recognition system in Google Photos, which automatically
segregates all the photos based on the person in the
image. There are various components involved in Facial
Recognition like the eyes, nose, mouth and the eyebrows.
Self Driving Cars:

Self-driving cars are the Future, there’s no doubt in

that. But the working behind it is very tricky as it
combines a variety of techniques to perceive their
surroundings, including radar, laser light, GPS,
odometry, and computer vision.
Advanced control systems interpret sensory
information to identify appropriate navigation
paths, as well as obstacles and once the image
sensor detects any sign of a living being in its
path, it automatically stops. This happens at a
very fast rate and is a big step towards Driverless
Cars.
Security: Object Detection plays a very important role in Security. Be it face ID of Apple or
the retina scan used in all the sci-fi movies.
It is also used by the government to access the security feed and match it with
their existing database to find any criminals or to detect the robbers’ vehicle.
The applications are limitless.
Link
s
▪ https://towardsdatascience.com/how-to-train-your-own-object-detector-with-

tensorflows-object-detector-api-bec72ecfe1d9
▪ https://www.kdnuggets.com/2017/10/deep-learning-object-detection-

comprehensive-review.html
▪ http://www.machinelearninguru.com/deep_learning/tensorflow/basics/tfrecord/tfreco

rd.html
▪ https://www.coursera.org/learn/convolutional-neural-networks
▪ https://medium.com/comet-app/review-of-deep-learning-algorithms-for-object-

detection-c1f3d437b852
▪ https://towardsdatascience.com/evolution-of-object-detection-and-localization-
algorithms-e241021d8bad
▪ https://medium.freecodecamp.org/how-to-play-quidditch-using-the-tensorflow-
ANY QUERIES!

Introductory Techniques For 3D Computer Vision
100% (1)
Introductory Techniques For 3D Computer Vision
180 pages
Airis 2 MANUAL
No ratings yet
Airis 2 MANUAL
46 pages
Computer Vision55
100% (1)
Computer Vision55
268 pages
Deep Learning Methods and Applications For Electrical Power Systems A Comprehensive Review
No ratings yet
Deep Learning Methods and Applications For Electrical Power Systems A Comprehensive Review
22 pages
Digital Image Processing - Assignment No 2: Problem No. 1: (CLO 2, C-5)
No ratings yet
Digital Image Processing - Assignment No 2: Problem No. 1: (CLO 2, C-5)
7 pages
Paul F. Whelan BEng, MEng, PHD, Derek Molloy BEng (Auth.) - Machine Vision Algorithms in Java - Techniques and Implementation-Springer-Verlag London (2001) PDF
No ratings yet
Paul F. Whelan BEng, MEng, PHD, Derek Molloy BEng (Auth.) - Machine Vision Algorithms in Java - Techniques and Implementation-Springer-Verlag London (2001) PDF
292 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Object Detection Using Image Processing
No ratings yet
Object Detection Using Image Processing
17 pages
Multiple Object Tracking Using Deep Learning With Yolo v5 IJERTCONV9IS13010
No ratings yet
Multiple Object Tracking Using Deep Learning With Yolo v5 IJERTCONV9IS13010
5 pages
Analytical Study On Object Detection Using Yolo Algorithm
No ratings yet
Analytical Study On Object Detection Using Yolo Algorithm
3 pages
YOLO V3 ML Project
No ratings yet
YOLO V3 ML Project
15 pages
Age and Gender Detection
No ratings yet
Age and Gender Detection
4 pages
Object Detection and Identification
No ratings yet
Object Detection and Identification
8 pages
Face Detection & Emotion Recognition
No ratings yet
Face Detection & Emotion Recognition
26 pages
Object Detection
No ratings yet
Object Detection
73 pages
Install TensorFlow With Pip - TensorFlow
No ratings yet
Install TensorFlow With Pip - TensorFlow
3 pages
Real Time Bangladeshi License Plate Detection & Recognition: Submitted by
No ratings yet
Real Time Bangladeshi License Plate Detection & Recognition: Submitted by
25 pages
Project
100% (1)
Project
30 pages
ML Training by Custom Yolo v5
No ratings yet
ML Training by Custom Yolo v5
56 pages
How To Make An Object Tracking Robot Using Raspberry Pi - Automatic Addisonasdfsdf
100% (1)
How To Make An Object Tracking Robot Using Raspberry Pi - Automatic Addisonasdfsdf
10 pages
Plant Disease Identification
No ratings yet
Plant Disease Identification
17 pages
Bird Species Identification Using Deep Learning IJERTV8IS040112 6
No ratings yet
Bird Species Identification Using Deep Learning IJERTV8IS040112 6
5 pages
Study and Implementation of Object Detection and Visual Tracking
No ratings yet
Study and Implementation of Object Detection and Visual Tracking
32 pages
Speech Recognition System
No ratings yet
Speech Recognition System
16 pages
Fruits Classification Using Convolutional Neural Network
No ratings yet
Fruits Classification Using Convolutional Neural Network
6 pages
Object Detection Report
No ratings yet
Object Detection Report
48 pages
Deep Generative Adversarial Networks For Image-To
No ratings yet
Deep Generative Adversarial Networks For Image-To
26 pages
Face Recognition Based Attendance System
No ratings yet
Face Recognition Based Attendance System
54 pages
Tensorflow Object Detection Api Tutorial PDF
No ratings yet
Tensorflow Object Detection Api Tutorial PDF
41 pages
Machine Learning/ Artificial Intelligence (MLAI) Internship
No ratings yet
Machine Learning/ Artificial Intelligence (MLAI) Internship
4 pages
Object Detection
No ratings yet
Object Detection
57 pages
Face Mask Detector: A Project Report Submitted in Partial Fulfillment of The Requirement For The Award of The Degree of
No ratings yet
Face Mask Detector: A Project Report Submitted in Partial Fulfillment of The Requirement For The Award of The Degree of
28 pages
Car Make and Model Recognition Using Ima
No ratings yet
Car Make and Model Recognition Using Ima
8 pages
Python Ieee Projects 2021 - 22 JP
No ratings yet
Python Ieee Projects 2021 - 22 JP
3 pages
Synopsis P
100% (1)
Synopsis P
6 pages
30 Amazing Machine Learning Projects For The Past Year (v.2018)
No ratings yet
30 Amazing Machine Learning Projects For The Past Year (v.2018)
22 pages
Introduction To Machine Learning PDF
100% (1)
Introduction To Machine Learning PDF
17 pages
Concurrent and Real-Time Programming in Java: © Andy Wellings, 2004
No ratings yet
Concurrent and Real-Time Programming in Java: © Andy Wellings, 2004
35 pages
Iot Based Car Detection and Theft Control
No ratings yet
Iot Based Car Detection and Theft Control
69 pages
Forest Fire Detection Using Computer Vision
No ratings yet
Forest Fire Detection Using Computer Vision
30 pages
Chapter 7 - Neural-Networks
100% (1)
Chapter 7 - Neural-Networks
60 pages
Tiny Machine Learning
No ratings yet
Tiny Machine Learning
7 pages
Object Detection Using YOLOv5 and OpenCV DNN in C++ & Python
No ratings yet
Object Detection Using YOLOv5 and OpenCV DNN in C++ & Python
21 pages
Function Generator Using VHDL
No ratings yet
Function Generator Using VHDL
20 pages
Face Recognization and Detection System For Attendance.
33% (3)
Face Recognization and Detection System For Attendance.
39 pages
Unified Real-Time Object Detection
No ratings yet
Unified Real-Time Object Detection
36 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
Projects in Deep Learning
No ratings yet
Projects in Deep Learning
4 pages
Mobile Net
No ratings yet
Mobile Net
9 pages
Chapter 6 AI
No ratings yet
Chapter 6 AI
63 pages
Cs490 Advanced Topics in Computing (Deep Learning) : Lecture 16: Convolutional Neural Networks (CNNS)
No ratings yet
Cs490 Advanced Topics in Computing (Deep Learning) : Lecture 16: Convolutional Neural Networks (CNNS)
63 pages
Object Detection Technique (YOLO)
No ratings yet
Object Detection Technique (YOLO)
19 pages
Lecture 12 - Deep Learning
No ratings yet
Lecture 12 - Deep Learning
25 pages
Dzone Rc251 Gettingstartedwithtensorflow
No ratings yet
Dzone Rc251 Gettingstartedwithtensorflow
5 pages
Medical Image Fusion Method by Deep Learning
No ratings yet
Medical Image Fusion Method by Deep Learning
9 pages
Intrusion Detection System in Software Defined Networks Using Machine Learning Approach
No ratings yet
Intrusion Detection System in Software Defined Networks Using Machine Learning Approach
8 pages
Ultrasonic Radar With Arduino
No ratings yet
Ultrasonic Radar With Arduino
12 pages
Python and Machine Learning: A Practical Training Report On
No ratings yet
Python and Machine Learning: A Practical Training Report On
65 pages
DRV 8833
No ratings yet
DRV 8833
31 pages
Image Caption Generator
No ratings yet
Image Caption Generator
13 pages
Machine Learning
100% (1)
Machine Learning
46 pages
Emotion Detection
No ratings yet
Emotion Detection
23 pages
Smart Parking System Using MERN Stack
No ratings yet
Smart Parking System Using MERN Stack
6 pages
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Environmental Impact Assessment
No ratings yet
Environmental Impact Assessment
86 pages
Python Class1
No ratings yet
Python Class1
34 pages
United Nations Environment Programme
No ratings yet
United Nations Environment Programme
7 pages
Movie Recommendation Engine Using Artificial Intelligence
No ratings yet
Movie Recommendation Engine Using Artificial Intelligence
30 pages
Image Recognition Using CIFAR 10
100% (1)
Image Recognition Using CIFAR 10
56 pages
Cvgenerator
No ratings yet
Cvgenerator
6 pages
History of Computer Graphics
No ratings yet
History of Computer Graphics
10 pages
Laboratory Exercise 4 (Consyst) - 1
No ratings yet
Laboratory Exercise 4 (Consyst) - 1
5 pages
Labs-Lab Manual 02
No ratings yet
Labs-Lab Manual 02
7 pages
AI Introduction 1a
100% (1)
AI Introduction 1a
34 pages
Harnessing AI For Smart Marketing
No ratings yet
Harnessing AI For Smart Marketing
9 pages
B.SC - MIT Log Book Final
100% (1)
B.SC - MIT Log Book Final
7 pages
AI in Network Use Cases in China PDF
No ratings yet
AI in Network Use Cases in China PDF
156 pages
Automation in Food Industry
No ratings yet
Automation in Food Industry
26 pages
1.1 - Intro To Ai - 1.1 Excite
No ratings yet
1.1 - Intro To Ai - 1.1 Excite
21 pages
Generative AI Business MicrosoftAzure OpenAI Brochure DOM 23apr24
No ratings yet
Generative AI Business MicrosoftAzure OpenAI Brochure DOM 23apr24
15 pages
Image Processing For Weed Detection: International Journal of Engineering Technology, Management and Applied Sciences
No ratings yet
Image Processing For Weed Detection: International Journal of Engineering Technology, Management and Applied Sciences
5 pages
Huang GLoRIA A Multimodal Global-Local Representation Learning Framework For Label-Efficient Medical ICCV 2021 Paper
No ratings yet
Huang GLoRIA A Multimodal Global-Local Representation Learning Framework For Label-Efficient Medical ICCV 2021 Paper
10 pages
NATCONProceedings 1
No ratings yet
NATCONProceedings 1
143 pages
GE, Siemens, PhilipsTerminology Referemce Card
No ratings yet
GE, Siemens, PhilipsTerminology Referemce Card
2 pages
A Detailed Survey on Enhancing Low Light Images using Retinex Theory and Deep Learning
No ratings yet
A Detailed Survey on Enhancing Low Light Images using Retinex Theory and Deep Learning
11 pages
PARKING SPACE TRACKER (CSE A Mini Project)
No ratings yet
PARKING SPACE TRACKER (CSE A Mini Project)
25 pages
Color Image Processing
No ratings yet
Color Image Processing
13 pages
AI Unit 2
No ratings yet
AI Unit 2
36 pages
UAV Sensors For Environmental Monitoring PDF
No ratings yet
UAV Sensors For Environmental Monitoring PDF
672 pages
Instant download 3-D Computer Vision: Principles, Algorithms and Applications 1st Edition Yu-Jin Zhang pdf all chapter
No ratings yet
Instant download 3-D Computer Vision: Principles, Algorithms and Applications 1st Edition Yu-Jin Zhang pdf all chapter
41 pages
Face Detection Algorithms
No ratings yet
Face Detection Algorithms
2 pages
Computer Vision Resources: Satya Mallick, PH.D
No ratings yet
Computer Vision Resources: Satya Mallick, PH.D
16 pages
(IJETA-V11I3P44) :santosh Kumar, Harshvardhan Tailor, Hemant Singh Jadoun, Mandeep Kumar Biloniya, Aryan Jangid
No ratings yet
(IJETA-V11I3P44) :santosh Kumar, Harshvardhan Tailor, Hemant Singh Jadoun, Mandeep Kumar Biloniya, Aryan Jangid
4 pages
Underwater Image Enhancement Using GAN
No ratings yet
Underwater Image Enhancement Using GAN
9 pages
ETech Image Placement
No ratings yet
ETech Image Placement
34 pages
B19 PPT PRC 2
No ratings yet
B19 PPT PRC 2
27 pages