Object Detection Withtensorflow: D. Hari Vamshi V. Raju U. Laxman
Object Detection Withtensorflow: D. Hari Vamshi V. Raju U. Laxman
Object Detection Withtensorflow: D. Hari Vamshi V. Raju U. Laxman
o D. HARI VAMSHI
o V. RAJU
o U. LAXMAN
Agenda
➢ Intro
➢ What is Object Detection
➢ State of Object Detection
➢ Tensorflow Object DetectionAPI
➢ Preparing Data
➢ Training & Evaluating Models
➢ Links
What is Object Detection
▪ Classicalapproach(Haarfeatures)-firstOD realtimeframework(Viola-Jones)
▪ Deeplearningapproach-now stateoftheart inOD
▪ OverFeat
▪ R-CNN
▪ FastR-CNN
▪ YOLO
▪ FasterR-CNN
▪ SSDandR-FCN
Deep learning approach
SSD andR-FCN
Finally, there are two notable papers, Single Shot
Detector (SSD)whichtakesonYOLObyusingmultiple
sized convolutional feature maps achieving better
results and speed, and Region-based Fully
Convolutional Networks (R-FCN) which takes the
architecture of Faster R-CNN but with only
convolutionalnetworks.
Introduction
• LABEL MAPS:
Each dataset is required to have a label map associated with it. This label map defines a mapping
from string class names to integer class Ids. The label map should be a StringIntLabelMap text
protobuf and can be found in object detection/data.
• DATASET REQUIREMENTS:
For every example in your dataset, you should have the following information:
o An RGB image for the dataset encoded as jpeg or png.
o A list of bounding boxes for the image. Each bounding box should contain:
o A bounding box coordinates (with origin in top left corner) defined by 4 floating point
numbers [ymin, xmin, ymax, xmax]. Note that we store the normalized coordinates (x /
width, y / height) in the TFRecord dataset.
o The class of the object in the bounding box.
Creating TFRecord
➢ Tensorflow object detection API report contains folder dataset_tools with script
s to covert common structures of data into TFRecord.
➢ If output data has another structure then , it includes to follow few steps
described in next slide.
One model for twotasks?
Po - is objectexists
bx1
bx2 - bounding box
by1 coordinates
Objectdetection-outputistheonenumber(index)ofaclass by2
c1
c2
c3 - object’s
… variables
cn
Objectlocalization-outputisthefournumbers-
coordinatesofboundingbox.
Training
Selecting a model
▪ https://towardsdatascience.com/how-to-train-your-own-object-detector-with-
tensorflows-object-detector-api-bec72ecfe1d9
▪ https://www.kdnuggets.com/2017/10/deep-learning-object-detection-
comprehensive-review.html
▪ http://www.machinelearninguru.com/deep_learning/tensorflow/basics/tfrecord/tfreco
rd.html
▪ https://www.coursera.org/learn/convolutional-neural-networks
▪ https://medium.com/comet-app/review-of-deep-learning-algorithms-for-object-
detection-c1f3d437b852
▪ https://towardsdatascience.com/evolution-of-object-detection-and-localization-
algorithms-e241021d8bad
▪ https://medium.freecodecamp.org/how-to-play-quidditch-using-the-tensorflow-
object-detection-api-b0742b99065d
ANY QUERIES!