0% found this document useful (0 votes)

92 views

Object Detection

The document discusses object detection and summarizes two approaches: R-CNN and YOLO. R-CNN is a region proposal-based approach that uses Selective Search to propose regions, extracts CNN features from each region, and trains SVMs and regressors for classification and bounding box refinement. It is a multi-stage pipeline that is slow. YOLO is an unified model that performs detection as a regression problem to predict bounding boxes and class probabilities directly from the image in one stage, making it much faster than R-CNN.

Uploaded by

vasanth rathod

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

92 views

Object Detection

Uploaded by

vasanth rathod

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 57

Object Detection

TA : Young-geun Kim

Biostatistics Lab.,
Seoul National University

March-June, 2018

Seoul National University Deep Learning March-June, 2018 1 / 57

Index

1 Introduction

2 R-CNN

3 YOLO

4 Evaluation

Seoul National University Deep Learning March-June, 2018 2 / 57

Introduction

Seoul National University Deep Learning March-June, 2018 3 / 57

Introduction

In this session, we will learn about...

The Object Detection problem.

Regions with CNN features (R-CNN), a region proposal-based
approach model.
You-Only-Look-Once (YOLO), an unified approach model.
Evaluation metrics for detection models.

Seoul National University Deep Learning March-June, 2018 4 / 57

Introduction

What is Object Detection?

Object Detection is a task finding where and what objects are

(Location + Classification).
An integral part of various vision application such as Automated
Driving System, Face Detection and Object Counting.

Figure: from https://youtu.be/MPU2HistivI (YOLO v3 clip).

Seoul National University Deep Learning March-June, 2018 5 / 57

Introduction

What is Object Detection? (Conti.)

For given image, task-taker should answer the predicted region and
class confidence.
A region is expressed as rectangular called bounding box.
The number of objects is not provided.

Figure: from Ren et al., 2015..

Seoul National University Deep Learning March-June, 2018 6 / 57
Introduction

What is Object Detection? (Conti.)

Exact region (or bounding box) of each object is called Ground-Truth

(GT) box, the minimal rectangular containing whole part of the
object.
A region is parameterized by (x, y , w , h) where (x, y ) is the
coordinate of top-left (or center) point, w is the width, and h is the
height of the bounding box.
Intersection over Union (IoU), the ratio of intersection area to union
area, between predicted region and GT box presents the accuracy
about location.

Seoul National University Deep Learning March-June, 2018 7 / 57

Introduction

What is Object Detection? (Conti.)

There are various types of objects.

For example, VOC challenge requires detecting following 20 kinds of
object classes.
Middle Level Low Level
Person person
Animal bird, cat, cow, dog, horse, sheep
aeroplane, bicycle, boat, bus, car,
Vehicle
motorbike, train
bottle, chair, diningtable, potted plant,
Indoor
sofa, tv/monitor

mean Average Precision (mAP), an estimator of the area under the

precision-recall curve (AUCPR), usually presents the accuracy about
classification.

Seoul National University Deep Learning March-June, 2018 8 / 57

Introduction

What is Object Detection? (Conti.)

An object is considered detected if task-taker selects any region with

predicted label satisfying following conditions.
Condition 1 : Highly overlapped with GT box of the object.
Condition 2 : Correctly classified.

Seoul National University Deep Learning March-June, 2018 9 / 57

Introduction

Challenges

Infinitely Imbalanced Structure : Background (BG) is the majority

class. There are few positive regions (GT) and infinitely many
negative regions (BG).
Predicted
N P Total
N TN FP # of BG
True
P FN TP # of GT
Table: The confusion matrix of object detection.

In this structure, accuracy about positive class is severely suffered.

This means that finding an object position as it is difficult.

Seoul National University Deep Learning March-June, 2018 10 / 57

Introduction

Challenges (Conti.)

Dynamic Scale : Shape of objects is various. Some are tiny/huge

and some are horizontally/vertically long.

Figure: from VOC2012.

This means that our model should recognize various scale of regions.

Seoul National University Deep Learning March-June, 2018 11 / 57

Introduction

Challenges (Conti.)

Multi-task : Finding object position (Location) and classifying the

object (Classification) each is difficult. Object Detection requires
performing both tasks simultaneously.
In practice, the test time of detection model should be short, but due
to the high level of difficulty, it is challenging.

Seoul National University Deep Learning March-June, 2018 12 / 57

Introduction

Approaches

Pre-deep learning approaches (Do not cover. See 50 years of object

recognition: Directions forward).
Regions with CNN features (R-CNN), a region proposal-based
approach model.
You-Only-Look-Once (YOLO), an unified approach model.

Seoul National University Deep Learning March-June, 2018 13 / 57

R-CNN

Seoul National University Deep Learning March-June, 2018 14 / 57

R-CNN

Regions with CNN features

Regions with CNN features (R-CNN) is a region proposal-based

approach model (Girshick et el., 2014).
R-CNN selects regions using Selective Search (Uijlinga et al., 2013),
warps them as to the same scale and extracts features to learn
class-specific SVMs.

Figure: from Girshick et el., 2014.

Seoul National University Deep Learning March-June, 2018 15 / 57

R-CNN

Selective Search

Selective Search (SS) is an hierarchical grouping algorithm whose

domain is a set of region.
For given set of regions, SS greedily merges regions.
The distance measure is a partial summation of similarity about
colour, texture, size, and fill.

Figure: from Uijlinga et al., 2013.

Seoul National University Deep Learning March-June, 2018 16 / 57

R-CNN

Selective Search (Conti.)

Considering various features from fine-level region, SS distinguishes

objects and captures their hierarchical structure.
Initialization is based on a graph-based segmentation algorithm
(Felzenszwalb and Huttenlocher, 2004.) whose time complexity is
nearly linear in the number of pixels.

Seoul National University Deep Learning March-June, 2018 17 / 57

R-CNN

Detection Network

Proposed regions pass through CNNs which consists of classifier and

bounding box (bbox) regressor.
AlexNet (Krizhevsky et al., 2012.) is applied with replaced FC layer
for corresponding number of class including BG.
After tuning AlexNet, class-specific linear SVMs and bbox regressor
(Felzenszwalb et al., 2010.) are learned by using extracted feature.
bbox regressor predicts (x, y , w , h) of GT and use it to adjust
proposed regions.

Seoul National University Deep Learning March-June, 2018 18 / 57

R-CNN

Limitation of R-CNN

R-CNN requires fine-tuning CNN, learning multiple SVMs and bbox

regressor (multi-stage pipeline).
Training SVMs and bbox regressor requires feedforwarding all regions
in all images and saving all extracted features.
Because of the same reason in training, test time is too long. It takes
47 second to perform detection for a single image.

Seoul National University Deep Learning March-June, 2018 19 / 57

R-CNN

Spatial Pyramid Pooling Network

Feedforwarding all proposed regions is time-consuming approach.
Spatial Pyramid Pooling (SPP; He et al., 2014.) models the spatial
connectivity and makes various regions into the fixed size.
For usual CNNs, warp ◦ conv 6= conv ◦ warp since there is no spatial
connectivity between raw image and extracted feature.

Figure: from He et al., 2014.

Seoul National University Deep Learning March-June, 2018 20 / 57
R-CNN

Spatial Pyramid Pooling Network (Conti.)

SPP Network learns the spatial connectivity, still capturing semantic
content.
SPP reduces the computation cost, but SPP Network is still
multi-stage pipeline.

Figure: from He et al., 2014.

Seoul National University Deep Learning March-June, 2018 21 / 57
R-CNN

Fast R-CNN

Fast R-CNN (Girshick and Ross, 2015.) is a variation of R-CNN

applying Region of Interest (RoI) pooling, a kind of SPP.
Training is single-stage by using multi-task loss. Multi-task loss
enables us to update all weights simultaneously.

Figure: from Girshick and Ross, 2015.

Seoul National University Deep Learning March-June, 2018 22 / 57

R-CNN

Region of Interest Pooling

RoI pooling connects the raw image and the final extracted feature
before FC layers.
RoI feature vector passes two sibling FC layer.

Figure: from Girshick and Ross, 2015.

Seoul National University Deep Learning March-June, 2018 23 / 57

R-CNN

Region of Interest Pooling (Conti.)

In contrast to usual max-pooling, RoI pooling has dynamic filter size.

Back propagation through RoI pooling requires activated positions for
each region.

Seoul National University Deep Learning March-June, 2018 24 / 57

R-CNN

Multi-task Loss

For given region (xr , yr , wr , hr ) in an image, Fast R-CNN calculates p

and t k = (txk , tyk , twk , thk ), the predicted probability vector and location
for class k parameterized by following.

txk = (x k − xr )/wr
tyk = (y k − yr )/hr
twk = log(w k /wr )
thk = log(hk /hr )

Let u, v be the true class and location of corresponding GT box for

given region. v is parameterized by substituting (x, y , w , h) of the GT.

Seoul National University Deep Learning March-June, 2018 25 / 57

R-CNN

Multi-task Loss (Conti.)

To train multi-task model, the loss function is designed as

L(p, u, t u , v ) = Lcls (p, u) + λ[u ≥ 1]Lloc (t u , v )

where Lcls (p, u) = − log pu is log loss for true class u and
X
Lloc (t u , v ) = huber (tiu − vi ).
i∈{x,y ,w ,h}

The hyper-parameter λ controls balance between classification loss

and regression loss.
For u = 0, background region, Lloc doesn’t have any role.
Lloc is a function of |tiu − vi |, so L is invariant to translation, flipping
and rescaling.

Seoul National University Deep Learning March-June, 2018 26 / 57

R-CNN

Limitation of Fast R-CNN (Conti.)

Compared to R-CNN, Fast R-CNN achieves slightly higher accuracy

with nearly 100 times short test time, but the test time is still long.
In VOC 2007 test task, Fast R-CNN takes 1830ms per image.
Region proposal task, SS is a huge piece consuming 1510ms per
image.

Seoul National University Deep Learning March-June, 2018 27 / 57

R-CNN

Faster R-CNN

Faster R-CNN (Ren et al., 2015) is a variation of Fast R-CNN using

Region Proposal Network (RPN).
Roughly speaking, Faster R-CNN = RPN + Fast R-CNN.
Contrast to SS, RPN has learnable weight for multi-task loss.

Seoul National University Deep Learning March-June, 2018 28 / 57

R-CNN

Region Proposal Network

For given point in an image, RPN classifies objectness of several

regions centered on that point and regresses exact location.
Pre-determined points in each image are called anchors.

Figure: adjusted from VOC2012.

Seoul National University Deep Learning March-June, 2018 29 / 57

R-CNN

Region Proposal Network (Conti.)

1. For selected anchor, view small region nearby the anchor in the
level of extracted feature.
2. Determine the objectness of k regions centered on the
corresponding anchor in the raw image map.

Figure: from Ren et al., 2015.

Seoul National University Deep Learning March-June, 2018 30 / 57
R-CNN

Region Proposal Network (Conti.)

3. For all regions classified to be positive, adjust them using reg layer.

Figure: from Ren et al., 2015.

Seoul National University Deep Learning March-June, 2018 31 / 57

R-CNN

Multi-task loss for RPN

RPN uses multi-task loss similar to Fast R-CNN. Exact formula is

1 X 1 X ∗
L({pi }, {ti }) = Lcls (pi , pi∗ ) + λ pi Lreg (ti , ti∗ )
Ncls Nreg
i i

where i is the index of an anchor.

Here, pi is the predicted probability of anchor i being an object. pi∗ is
the ground-truth label. t is the same to Fast R-CNN.
(Opinion) This data has multi-label structure. Note that input domain
of loss is an anchor box, not anchor boxes sharing center. This
design relaxes the issue about class correlation between anchor boxes.

Seoul National University Deep Learning March-June, 2018 32 / 57

R-CNN

Training faster R-CNN

RPN and fast R-CNN share feature extractor part. This shared
structure reduces test-time, the origin of its name ”Faster R-CNN”.
Sharing structure is implemented by following sequence.
Phase Feature Extractor Region Proposal
Initialized from
1. Train RPN -
ImageNet model
Initialized from RPN from
2. Train fast R-CNN
ImageNet model phase 1.
Frozen from
3. Tune RPN -
phase 2.
Frozen from RPN from
4. Tune fast R-CNN
phase 2. phase 3.

Seoul National University Deep Learning March-June, 2018 33 / 57

R-CNN

Summary of R-CNN variations

All the models use bbox regressor to adjust proposed region. R-CNN
uses SVM and others use softmax classifier.
Region Proposal Method Region Scaling Method
R-CNN SS Warping
Fast R-CNN SS RoI pooling
Faster R-CNN RPN RoI pooling
Table: Key methodologies.

mAP (%) test time (ms/image)

R-CNN 66.0 > 104
Fast R-CNN 66.9 1830
Faster R-CNN 69.9 198

Table: Evaluation on VOC 2007 test set, adjusted from Girshick and Ross,
2015. and Ren et al., 2015.

Seoul National University Deep Learning March-June, 2018 34 / 57

YOLO

Seoul National University Deep Learning March-June, 2018 35 / 57

YOLO

You-Only-Look-Once

You-Only-Look-Once (YOLO; Redmon et al., 2016.) is an unified

approach model. YOLO has one CNNs solving both location and
classification problem.
In the introduction of paper: ‘Humans glance at an image and
instantly know what objects are in the image, where they are, and
how they interact.‘’
For given image, YOLO feedforwards only one time, remarkably
reducing test time.
All the figures, tables, and equations in this section are come from
Redmon et al., 2016.

Seoul National University Deep Learning March-June, 2018 36 / 57

YOLO

Terminology

An image is divided by S × S grid cells.

If the center of an object falls into a grid cell, that grid cell is
responsible for detecting that object.

Seoul National University Deep Learning March-June, 2018 37 / 57

YOLO

Terminology (Conti.)

Each grid cell predicts B bounding boxes and corresponding

objectness confidence.
Each bounding box is parametrized by (x, y , w , h), the same to
R-CNN.
truth .
The objectness confidence is Pr (Object) ∗ IOUpred

Seoul National University Deep Learning March-June, 2018 38 / 57

YOLO

Terminology (Conti.)

All bounding boxes sharing grid cell have the same conditional class
probability, formally Pr (Classi |Object).
truth is
At test time, the class-specific confidence, Pr (Classi ) ∗ IoUpred
predicted by multiplying predicted conditional class confidence and
objectness confidence.

Seoul National University Deep Learning March-June, 2018 39 / 57

YOLO

Terminology (Conti.)

Seoul National University Deep Learning March-June, 2018 40 / 57

YOLO

Architecture

For given image, YOLO predicts (x, y , w , h) and objectness

confidence for all bounding boxes and conditional class probability for
all grid cells.
Considering its spatial meaning, we can view the output as
S × S × (B ∗ 5 + C ) box.
In VOC competition, S = 7, B = 2, and C = 20.

Seoul National University Deep Learning March-June, 2018 41 / 57

YOLO

Architecture (Conti.)

Following figure describes the architecture of YOLO.

For given image, convolution layers extract features and final FC layer
predicts bounding box parameters, objectness confidence, and
conditional class probability.

Seoul National University Deep Learning March-June, 2018 42 / 57

YOLO

Loss
Following is the loss function of YOLO. The first two terms are about
bbox regression. Next two terms are about objectness classification
and the last term is about the class classification.

Here, 1i and 1ij are indicators about responsibility of ith grid cell and
its jth bounding box, respectively.
Seoul National University Deep Learning March-June, 2018 43 / 57
YOLO

Performance

YOLO is the first deep-learning model in the context of real-time

detection with the state-of-the-art accuracy.
Real-Time : 30 frames per second or better. When the speed of car is
60km/h, car moves 0.55m between detections.

Seoul National University Deep Learning March-June, 2018 44 / 57

YOLO

Performance (Conti.)

Compared with fast R-CNN, YOLO has high location error and low
background error.
Correct: correct class and IoU >.5, Loc: correct class, .1<IoU<.5,
Sim: class is similar, IoU>.1, Other: class is wrong, IoU>.1,
Background: IoU<.1 for any object.

Seoul National University Deep Learning March-June, 2018 45 / 57

Evaluation

Seoul National University Deep Learning March-June, 2018 46 / 57

Evaluation

Non Maximum Suppression

Some of predicted regions severely overlap.

In object detection, multiple detection for single GT is penalized.

Figure: from
https://kr.mathworks.com/help/vision/ref/selectstrongestbbox.html

Seoul National University Deep Learning March-June, 2018 47 / 57

Evaluation

Non Maximum Suppression (Conti.)

Non Maximum Suppression (NMS) is a pre-work for evaluation,
removing overlapped regions using confidence.
Choose the most confident bounding box and remove all other boxes
with high IoU with the box. Repeat until there is no more box.
NMS is applied to both RPN and detection network.

Figure: from
https://kr.mathworks.com/help/vision/ref/selectstrongestbbox.html.
Seoul National University Deep Learning March-June, 2018 48 / 57
Evaluation

Evaluation measures

In Infinitely Imbalance Structure, performance measures using TN

may unsuitable.
Predicted
N P Total
N TN FP # of BG
True
P FN TP # of GT
Table: The confusion matrix of object detection.

Detecting all objects as it is easy. Just classify all regions to all object.
What would be the value of TN? If the model is reasonable, TN
should be ∞.

Seoul National University Deep Learning March-June, 2018 49 / 57

Evaluation

Evaluation measures (Conti.)

Main evaluation measures in object detection are based on Precision

and Recall.
Precision : the proportion of TP among positive labeled,
TP/(TP+FP).
Recall : the proportion of TP among positive, TP/(TP+FN).
F1 score : the harmonic mean of precision and recall.
AUCPR : the area under the precision-recall curve. Commonly used
estimator for AUCPR in object detection is Average Precision (AP).

Seoul National University Deep Learning March-June, 2018 50 / 57

Evaluation

Average Precision

Let c be the threshold of confidence. Than AUCPR can be expressed

as Z ∞
AUCPR = Precision(c)dRecall(c)
−∞

where Precision(c) and Recall(c) are the precision and recall at

threshold level c, respectively.
\
By plugging in the empirical precision and recall, Precision(c) and
\
Recall(c), we get an estimator of AUCPR,
Z ∞
\ =
AUCPR \
Precision(c)d \
Recall(c).
−∞

Seoul National University Deep Learning March-June, 2018 51 / 57

Evaluation

Average Precision (Conti.)

Here, by the definition of Riemann−Stieltjes integral,

Z ∞
\ =
AUCPR \
Precision(c)d \
Recall(c)
−∞
X
\ # of P have conf . equal to c
= Precision(c) .
# of P
c∈{confi |i∈P}

This is the Average Precision (AP), an weighted average of precisions

at each confidence level of GT box.

Seoul National University Deep Learning March-June, 2018 52 / 57

Evaluation

Average Precision (Conti.)

Considering various kinds of class, the mean of AP is used. This is

called mean Average Precision (mAP).
In practice, Interpolated AP is used due to the wiggles in the
precision-recall curve. Unlike the ROC curve, it may not hold
monotonicity.

Seoul National University Deep Learning March-June, 2018 53 / 57

Evaluation

References

Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014). Rich
feature hierarchies for accurate object detection and semantic
segmentation. In Proceedings of the IEEE conference on computer
vision and pattern recognition (pp. 580-587).
Uijlings, Jasper RR, et al. ”Selective search for object recognition.”
International journal of computer vision 104.2 (2013): 154-171.
Felzenszwalb, Pedro F., and Daniel P. Huttenlocher. ”Efficient
graph-based image segmentation.” International journal of computer
vision 59.2 (2004): 167-181
Felzenszwalb, Pedro F., et al. ”Object detection with discriminatively
trained part-based models.” IEEE transactions on pattern analysis and
machine intelligence 32.9 (2010): 1627-1645.

Seoul National University Deep Learning March-June, 2018 54 / 57

Evaluation

References (Conti.)

A. Krizhevsky, I. Sutskever, and G. Hinton. ImageNet classification

with deep convolutional neural networks. In NIPS, 2012.
He, Kaiming, et al. ”Spatial pyramid pooling in deep convolutional
networks for visual recognition.” european conference on computer
vision. Springer, Cham, 2014.
Girshick, Ross. ”Fast r-cnn.” arXiv preprint arXiv:1504.08083 (2015).
Simonyan, Karen, and Andrew Zisserman. ”Very deep convolutional
networks for large-scale image recognition.” arXiv preprint
arXiv:1409.1556 (2014).

Seoul National University Deep Learning March-June, 2018 55 / 57

Evaluation

References (Conti.)

Ren, Shaoqing, et al. ”Faster r-cnn: Towards real-time object

detection with region proposal networks.” Advances in neural
information processing systems. 2015.
Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016). You
only look once: Unified, real-time object detection. In Proceedings of
the IEEE conference on computer vision and pattern recognition (pp.
779-788).
Everingham, M., Van Gool, L., Williams, C. K., Winn, J., and
Zisserman, A. (2010). The pascal visual object classes (voc)
challenge. International journal of computer vision, 88(2), 303-338.

Seoul National University Deep Learning March-June, 2018 56 / 57

Evaluation

References (Conti.)

Boyd, Kendrick, Kevin H. Eng, and C. David Page. ”Area under the
precision-recall curve: Point estimates and confidence intervals.” Joint
European Conference on Machine Learning and Knowledge Discovery
in Databases. Springer, Berlin, Heidelberg, 2013.
Introduction to modern information retrieval

Seoul National University Deep Learning March-June, 2018 57 / 57

CNN Short
No ratings yet
CNN Short
61 pages
18AI742
No ratings yet
18AI742
2 pages
CNN RNN Assignment Set 4
0% (1)
CNN RNN Assignment Set 4
2 pages
Autoencoders - Presentation
No ratings yet
Autoencoders - Presentation
18 pages
Artificial Intelligence (Computer Vision) : by Dr. Sehat Ullah Department of Computer Science & IT University of Malakand
No ratings yet
Artificial Intelligence (Computer Vision) : by Dr. Sehat Ullah Department of Computer Science & IT University of Malakand
35 pages
Object Recognition
No ratings yet
Object Recognition
30 pages
Object Detection - Week 1 - Object Detection in 20 Years - Final
No ratings yet
Object Detection - Week 1 - Object Detection in 20 Years - Final
280 pages
Object Tracking Using Radial Basis Function Networks
No ratings yet
Object Tracking Using Radial Basis Function Networks
9 pages
Project
100% (1)
Project
30 pages
2018 Arxiv Mou VehicleSegmentation
No ratings yet
2018 Arxiv Mou VehicleSegmentation
14 pages
50 Most Important CNN Interview Questions
No ratings yet
50 Most Important CNN Interview Questions
18 pages
Object Detection and Tracking Algorithms For Vehicle Counting: A Comparative Analysis
No ratings yet
Object Detection and Tracking Algorithms For Vehicle Counting: A Comparative Analysis
11 pages
CS231A Course Notes 1: Camera Models: Kenji Hata and Silvio Savarese
No ratings yet
CS231A Course Notes 1: Camera Models: Kenji Hata and Silvio Savarese
17 pages
Improved YOLOv4 Tiny Network For Real-Time Electronic Component Detection
No ratings yet
Improved YOLOv4 Tiny Network For Real-Time Electronic Component Detection
13 pages
Smart Parking System Using Yolov3 Deep Learning Model: Major Project Report
No ratings yet
Smart Parking System Using Yolov3 Deep Learning Model: Major Project Report
26 pages
Panoptic Segmentation
No ratings yet
Panoptic Segmentation
29 pages
Single Layer Perceptron Classifier
No ratings yet
Single Layer Perceptron Classifier
62 pages
Image Super Resolution Report
No ratings yet
Image Super Resolution Report
12 pages
Object Detection Using Image Processing
No ratings yet
Object Detection Using Image Processing
17 pages
2 Convolutional Neural Network For Image Classification
No ratings yet
2 Convolutional Neural Network For Image Classification
6 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Feature Extraction
No ratings yet
Feature Extraction
14 pages
YOLO V3 ML Project
No ratings yet
YOLO V3 ML Project
15 pages
Theoretical and Practical Analysis On CNN, MTCNN and Caps-Net Base Face Recognition and Detection PDF
No ratings yet
Theoretical and Practical Analysis On CNN, MTCNN and Caps-Net Base Face Recognition and Detection PDF
35 pages
AE - IEEE - REPORT - 01fe20bei040
No ratings yet
AE - IEEE - REPORT - 01fe20bei040
5 pages
First Review PDF
No ratings yet
First Review PDF
36 pages
AI-Lecture 12 - Simple Perceptron
100% (1)
AI-Lecture 12 - Simple Perceptron
24 pages
Movidius Neural Computer Stick
No ratings yet
Movidius Neural Computer Stick
33 pages
"Object Detection With Yolo": A Seminar On
No ratings yet
"Object Detection With Yolo": A Seminar On
14 pages
UNIT_3 _DL
No ratings yet
UNIT_3 _DL
15 pages
Ocr
No ratings yet
Ocr
16 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
CNN-based and DTW Features For Human Activity Recognition On Depth Maps
No ratings yet
CNN-based and DTW Features For Human Activity Recognition On Depth Maps
14 pages
Image Processing
No ratings yet
Image Processing
39 pages
Yolo
No ratings yet
Yolo
10 pages
Car Make and Model Recognition Using Ima
No ratings yet
Car Make and Model Recognition Using Ima
8 pages
Lecture 12 - Deep Learning
No ratings yet
Lecture 12 - Deep Learning
25 pages
Computer Vision Unit 4
No ratings yet
Computer Vision Unit 4
186 pages
Cs490 Advanced Topics in Computing (Deep Learning) : Lecture 16: Convolutional Neural Networks (CNNS)
No ratings yet
Cs490 Advanced Topics in Computing (Deep Learning) : Lecture 16: Convolutional Neural Networks (CNNS)
63 pages
Image Segmentation
100% (1)
Image Segmentation
3 pages
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
No ratings yet
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
9 pages
ANN Matlab
No ratings yet
ANN Matlab
13 pages
Deep Learning (MODULE-3) (1)
No ratings yet
Deep Learning (MODULE-3) (1)
85 pages
L10 - Intro - To - Deep - Learning
No ratings yet
L10 - Intro - To - Deep - Learning
75 pages
ANN Matlab
No ratings yet
ANN Matlab
13 pages
Real-Time Traffic Sign and Light Recognition System For ADAS
No ratings yet
Real-Time Traffic Sign and Light Recognition System For ADAS
18 pages
Tiny Object Recognition
No ratings yet
Tiny Object Recognition
8 pages
Object Detector For Blind Person
No ratings yet
Object Detector For Blind Person
20 pages
Real Time Object Detection Using SSD and MobileNet
No ratings yet
Real Time Object Detection Using SSD and MobileNet
6 pages
Chapter 7. Object Recognition
No ratings yet
Chapter 7. Object Recognition
106 pages
SSRN Id4107251
No ratings yet
SSRN Id4107251
7 pages
Image Classification Using Pre-Trained Convolutional Neural Network in COLAB
No ratings yet
Image Classification Using Pre-Trained Convolutional Neural Network in COLAB
6 pages
Unit -3-NNDL- Notes
No ratings yet
Unit -3-NNDL- Notes
17 pages
Computer Vision
No ratings yet
Computer Vision
4 pages
Image Captioning Using CNN and LSTM
No ratings yet
Image Captioning Using CNN and LSTM
9 pages
Enhanced Super-Resolution Using GAN
No ratings yet
Enhanced Super-Resolution Using GAN
6 pages
Detection and Classification of Dental Caries in X-Ray Images Using Deep Neural Networks
No ratings yet
Detection and Classification of Dental Caries in X-Ray Images Using Deep Neural Networks
5 pages
Reconfigurable Hardware Design Approach For Economic Neural Network
No ratings yet
Reconfigurable Hardware Design Approach For Economic Neural Network
5 pages
mv_cs4243_2024_amir_6_p2 (1)
No ratings yet
mv_cs4243_2024_amir_6_p2 (1)
95 pages
L7 Detection
No ratings yet
L7 Detection
54 pages
Murrani S - 2011 Interdisciplinary Archi Experience
No ratings yet
Murrani S - 2011 Interdisciplinary Archi Experience
428 pages
IEOR E4525 Logistics 2017
No ratings yet
IEOR E4525 Logistics 2017
3 pages
08 - Classification - Decision Trees
No ratings yet
08 - Classification - Decision Trees
116 pages
RPS Intercultural Communication
100% (1)
RPS Intercultural Communication
5 pages
Organizational Communication in Business
No ratings yet
Organizational Communication in Business
4 pages
Dog Breed Classificationusing Convolutional Neural Network
No ratings yet
Dog Breed Classificationusing Convolutional Neural Network
54 pages
0TH Review
No ratings yet
0TH Review
10 pages
A Review of Deep Learning Techniques For 3D Reconstruction of 2D Images
No ratings yet
A Review of Deep Learning Techniques For 3D Reconstruction of 2D Images
5 pages
Senior Data Scientist (Gen AI) - Job Description
No ratings yet
Senior Data Scientist (Gen AI) - Job Description
2 pages
Collective Information Regarding ILM (Information Lifecycle Management)
No ratings yet
Collective Information Regarding ILM (Information Lifecycle Management)
3 pages
Bibliometrics Ansd Traffic Flow Prediction Based On IA
No ratings yet
Bibliometrics Ansd Traffic Flow Prediction Based On IA
17 pages
ABP DWDM UNIT 4 Classification 1
No ratings yet
ABP DWDM UNIT 4 Classification 1
51 pages
Python Machine Learning Projects
100% (2)
Python Machine Learning Projects
135 pages
Fast Serializable Multi-Version Concurrency Control For Main-Memory Database Systems
No ratings yet
Fast Serializable Multi-Version Concurrency Control For Main-Memory Database Systems
13 pages
May 2019
No ratings yet
May 2019
3 pages
Barriers To Effective Communication
No ratings yet
Barriers To Effective Communication
5 pages
ML Algorithms
No ratings yet
ML Algorithms
2 pages
PID Control of Heat Exchanger System: Yuvraj Bhushan Khare Yaduvir Singh
No ratings yet
PID Control of Heat Exchanger System: Yuvraj Bhushan Khare Yaduvir Singh
6 pages
Data Analytics-Python
No ratings yet
Data Analytics-Python
41 pages
RL Examples
No ratings yet
RL Examples
6 pages
Introduction To Artificial Intelligence: Materi 1
No ratings yet
Introduction To Artificial Intelligence: Materi 1
29 pages
Self-Driving Car Using Convolution Neural Network and Road Lane Detection
No ratings yet
Self-Driving Car Using Convolution Neural Network and Road Lane Detection
4 pages
Amity School of Engineering and Technology Amity University, Uttar Pradesh
No ratings yet
Amity School of Engineering and Technology Amity University, Uttar Pradesh
5 pages
Tutorial I Basics of State Variable Modeling
No ratings yet
Tutorial I Basics of State Variable Modeling
11 pages
Edge Detection Using Fuzzy Logic in Matlab
No ratings yet
Edge Detection Using Fuzzy Logic in Matlab
3 pages
Tisa Random File
No ratings yet
Tisa Random File
23 pages
ISA Transactions: Qing Guo, Tian Yu, Dan Jiang
No ratings yet
ISA Transactions: Qing Guo, Tian Yu, Dan Jiang
10 pages
Sentiment Analysis On Movie Reviews Using RNN
No ratings yet
Sentiment Analysis On Movie Reviews Using RNN
10 pages
SQL Developer User Guide
No ratings yet
SQL Developer User Guide
90 pages
MCQ
100% (1)
MCQ
5 pages