2 Convolutional Neural Network For Image Classification

Uploaded by

Kompruch Benjaputharak

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

102 views

2 Convolutional Neural Network For Image Classification

Uploaded by

Kompruch Benjaputharak

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Convolutional Neural Networks for

image classification

Nadia Jmour Sehla Zayen Afef Abdelkrim

LA.R.A Laboratory, National LA.R.A Laboratory, National LA.R.A Laboratory, National
Engineering School of Tunis Engineering School of Tunis Engineering School of Tunis
National Engineering School of National Engineering School of National Engineering school of
Carthage Carthage Carthage
Tunis, Tunisia Tunis, Tunisia Tunis, Tunisia
nadiajmour@gmail.com sehla.loussaief@gmail.com afef.a.abdelkrim@ieee.org

Abstract— This paper describes a learning approach based To address this problem, we propose using the
on training convolutional neural networks (CNN) for a traffic convolutional neural network AlexNet applied on the large-
sign classification system. In addition, it presents the preliminary scale datasets ImageNet, [6] [7], by transferring its learned
classification results of applying this CNN to learn features and image representations and reuse them to the classification
classify RGB-D images task. To determine the appropriate task with limited training data. The main idea is based on
architecture, we explore the transfer learning technique called designing a method which reuse a part of training layers of
“fine tuning technique”, of reusing layers trained on the AlexNet.
ImageNet dataset in order to provide a solution for a four-class
classification task of a new set of data. In the following, problem statement is presented in section
II. Sections III introduces the method and the CNN architecture
Keywords— Convolutional neural network, Deep Learning, exploited. Initial experiment results using the appropriate CNN
Transfer Learning, ImageNet. architecture which demonstrates that the developed deep neural
network achieves a satisfied success rate are described in the
I. INTRODUCTION first part of section IV. In the second part, the effect
of the MiniBatchsize parameter is discussed , [8].
Image representation for classification task used often
feature extraction methods which have been proven to be
effective for different visual recognition tasks, [1]. Local II. PROBLEM STATEMENT
binary patterns method is used for texture features Usually, it is not evident for a driver to keep his eyes
extracting. Histograms of oriented gradients are applying for everywhere at once while driving. Being concentrated on
image processing. Usually these types of methods have been the road, checking it, looking oncoming traffic, what’s
used to transform images and describe them for many tasks, behind him, all while trying to control his speed, can
[2]. Most of the applied features need to be identified by an become difficult and annoying. To avoid any road accident
expert and then manually coded as per the data type and problem, traffic sign need to be rigid, unique and clear for
domain. This process is difficult and expensive in terms of the driver.
expertise and time.
As a solution, deep learning reduces the task of With the Traffic Sign classification system, the risk of
developing new feature extractor, [3], by automating the warning from a potential hazard ahead can be vastly
phase of extracting and learning features. The proposed reduced. Also, with automatic classifying Traffic signs a
traffic sign classification system is able to recognize the mandatory problem of self-driving cars can be solved.
traffic sign images put on the road and classify them by
exploiting this technology. The present paper aims to build a classifier system that
can determine the type of the traffic sign displayed in an
There exist many different architectures of deep image, and is robust to different real-life conditions such as
learning. The model presented in this paper is a classifier poor lighting or obstructions by designing an image
system developed by using convolutional neural networks processing algorithm. As an initial work, four types of
category, [4], which is the most efficient and useful deep traffic sign are used: Nonstop signs, stop sign, green light
neural network used for this type of data, [5]. Therefore, and red light.
CNNs applied to learn images representation on large-scale
datasets for recognition tasks can be exploited by Digit image and face task classification, [9], describe
transferring these learning representations on other tasks limited variation in appearance and pose. Therefore, these
with limited amount of training data. two domains are close to our task and the applied methods

978-1-5386-4449-2/18/$31.00 ©2018 IEEE.

397
can be efficiently used to traffic sign classification task. convolution maps where W2 is the width, H2 is the
Applications and researches on image classi¿cation, height and D2 is the depth if we decided to use D2
transfer learning, and deep learning are references on which filters or convolution kernels, [19]. Convolution
our method is related and discussed below. maps produce a volume equal to [W2×H2×D2]
where W2, H2, D2 are given by equations (1), (2),
Recent methods of image classi¿cation tasks use the (3):
bag-of-features pipeline. SIFT descriptors, [10], are using
for clustering. Spatial pooling, [11], Histogram encoding,
[12] and recent Fisher Vector encoding, [13] are using for
feature collection. Although these representations have been (1)
given acceptable results, it is not obvious if they are optimal
for the task, since it requires a lot of time and effort from
experts in the specific domain. This process is difficult and
expensive in terms of expertise and time. (2)

Deep learning or deep neural networks reduces the (3)
task of developing new feature extractor for every visual
recognition problem. This optimization is realized by With :
automating the phase of learning image’s representation and
using graphics processing units (GPUs), [14], suited to the F : spatial extend of the filter.
application’s problem. K : number of filters.
P : zero padding (hyperparameter controlling
III. CONVOLUTIONAL NEURAL NETWORK the output volume).
S : stride (hyperparameter with which we slide
A. Architecture the filter).
Convolutional Networks (ConvNets) are currently the most • RELU layer applying an activation function such
efficient deep models for classifying images data. Their multi- as the max(0,x) function, to product elementwise
stage architectures are inspired from the science of biology. non-linearity. This operation does not affect or
Through these models, invariant features are learned change the size of the volume, [20].
hierarchically and automatically, [15]. They first identify low
• POOL layer inserted between successive Conv
level features and then learn to recognize and combine these
layers, applying a downsampling operation along
features to learn more complicated patterns. the spatial dimensions width and height. It uses
These different levels of features come from different MAX operation to optimize the spatial size of the
layers of the network. And each layer has specific number of representation as well as reducing the amount of
neurons and presented in 3 dimensions: height, width, depth, parameters, [21]. Pool Layer produces a volume
[16]. [W2×H2×D2] where W2, H2, D2 are given by
applying equations (4), (5) and (6) :
To understand convolutional neural network structure,
[17], we can observe it as two distinct parts. In input, images
are presented as a matrix of pixels. It has 2 dimensions for a (4)
grayscale image. The color is represented by a third
dimension, of depth 3 to represent the fundamental colors
(Red, Green, Blue), [18].
(5)
The first part of a CNN is the convolutive part. It
functions as a feature extractor of images. In this part, an
image is passed through a succession of filters, or (6)
convolution kernels, creating new images called convolution
maps. Some intermediate filters reduce the resolution of the
image by a local maximum operation.
In the end, a feature extractor vector or CNN code
• CONV layer accepting a volume of size concatenate the output informations as a unique vector.
[W1×H1×D1] where W1 is the width, H1 is the
height and D1 the depth, the outputs of neurons in This code is then connected to the input of a second part,
this type of layers are calculated by applying the consisting of fully connected layers (multilayer perceptron),
product between their weights and a local region [22]. The role of this part is to combine the characteristics of
they are connected to in the input volume. The the CNN code to classify the image. It determines the class
obtained output volume [W2×H2×D2] called scores, presenting in an output volume of size [1×1×k]. The
architecture of this part is a usual multilayer perceptron and

398
each of the k output neurons or numbers, connecting to all all convolutional layers corresponds to the first
the numbers of the previous layer, correspond to a category method presented, with final classifier a pre-
of the classification. initialized multilayer perceptron.

B. CNN training To learn features for our traffic sign classification task,
Creating CNN is expensive in terms of expertise, we apply ConvNets combined with fine tuning technique. We
equipment and the amount of needed data. The first step is to used the pre-trained convolutional neural network AlexNet
fix the architecture by fixing the number of chosen layers, their which is trained for 1000 possible categories on the large
sizes and matrix operations that connect them, [23]. The dataset ImageNet in the Large Scale Visual Recognition
training consists then, of optimizing the network’s coefficients Challenge (ILSVRC-2012), [30], containing over then 1.2
to minimize the output classification error. million images. Results were remarkable and achieved a top-
5 error of 15.3%.
This training can take several weeks for the best CNNs,
with many GPUs working on hundreds of thousands of IV. EXPERIMENTS
annotated images. Research teams specialized in improving In this section, we first detailed the architecture of the
CNN publish their technical innovations, so the complexity proposed model. Next, we present experimental results of our
of creating CNN can be avoided by adapting publicly method based on transfer learning technique for the traffic
available pre-trained networks. These techniques are called sign classification task dataset. Finally, we discuss the effect
transfer learning, [24], which consist on transferring of one of the hyperparameters of a deep neural network in
knowledge from the related source to the target domain. our model.
These pre-trained neural network can be used in two ways:
A. Dataset
• Automatic feature extractor of images : exploits only
the convolutive part of a pretrained network. It uses it The traffic sign dataset contains more than 360 images in
as an automatic feature extractor of images, to feed total, divided into different classes. To avoid using the
the classifier of our choice. It keep only the testing data, we leave 180 images from the training set for
convolutive part. This part is called frozen, to validation and 180 test images featuring among four classes
express the absence of training. This network takes "stop sign", "non stop sign", "Green light" and "Red light".
an image in good format and outputs the CNN code. Both training and testing data are distributed over these
Each image in the dataset is thus transformed into a categories.
feature vector, which is used to drive a new
classifier, [25]. This method has many practical B. CNN developped architecture
interests:
Adapted network exploited in our method is AlexNet deep
neural network. AlexNet was among the first well
The image is transformed into a small vector, which
performed convolutional neural network in the computer
extracts features that are usually very relevant. This vision community. This CNN has shown successful results
reduces the size of the problem, [26]. while training on difficult ImageNet dataset. The training of
the model is released on two GTX 580 GPUs for five to six
Feature extraction is being performed only once per days, [31], using batch stochastic gradient descent
image, it can be performed quickly on CPU. The algorithm.
machine learning libraries are usually sequential and
also run on CPUs. The network was made up of 5 internal convolutional
layers: C1, C2, C3, C4, C5, pooling layers, dropout layers,
This method makes it possible to exploit the power of and 3 fully connected layers : FC6, FC7, FC8. It was used
the CNNs without investing in GPUs, [27]. for classification with 1000 possible categories, [32].
• Fine tuning : an initialization of the target model, The input of the architecture takes images of size
which is then retrained more finely to deal with the [227×227×3] with a zero padding P=0. On the first
new classification problem, [28]. Here we use an Convolutional Layer, AlexNet applied 96 convolution
architecture carefully optimized by specialists, and kernels with size of F=11 by striding it among the input
we take advantage of features extraction capabilities volume with a strider S=4. The output volume had for size
learned on a large quality dataset. Fine Tuning on [55×55×96] where height and width are W= H= 55 = (227-
images consists of some sort of taking a visual 11)/4+1 and depth K=96. The total number of neurons in
system already well trained on a classification task to this layer is 55×55×96=290400 neurons.
refine it on a similar task. The only necessary change
to the network is the adaptation of the last layer, [29]. Each of the 290400 neurons was connected to a local
For training, it is possible to freeze the initial layers region of [11×11×3] in the input, and all of the 96 neurons are
of the neural network, and to adapt only the final connected with different values of weights to the same region
layers for the new classification problem. Freezing of size [11×11×3] in the input volume. The rest of successive
layers and filters applied are presented on fig.1.

399
Fig. 1. AlexNet architecture, [33].

C. Setting and results

To realize our classification task and evaluate results of
the related method on this target task, we exploit AlexNet
pretrained on 1000 categories by transferring its training and
optimizing parameters to the traffic sign classi¿cation task.
The model is trained with stochastic gradient descent
algorithm and we apply 4 training experiments followed by 3,
4, 5, 6 epochs with learning rate 0.0001 and a MiniBatchsize
30 which took an importing time on Intel® HD Graphics 3000
CPUs. Results of this experiment are reported in Table 1.

TABLE 1. Classification results.

Number of Training time (s) Accuracy (%)

iterations Fig. 2. Test classification for some labels.
3 Epochs 18 11668.74 0.6592
D. MiniBatchsize effect
4 Epochs 24 16125.34 0.8380
Several parameters of the training network have an
5 Epochs 30 26145.18 0.8444
influence on the obtained results, such as the dataset’s size,
the initial learning rate and the number of layers. In this part,
6 Epochs 36 29337.20 0.8620 the influence of the MiniBatchsize training parameter is
discussed.

MiniBatchsize or Batch training consists of

Classification rates were obtained for different number backpropagating the error of classification by groups of
of epochs. We observe the increase in the value of the images, [34]. To observe the effect of this parameter we
classification rate when incrementing the number of Epochs. propose to train the CNN for different values of
We managed to obtain a maximum accuracy of 0.8620 for 6 MiniBatchsize. The obtained results for the values of 10 and
training epochs by using the optimized architecture AlexNet 60 are presented in tables 2 and 3.
and taking advantage of the features extraction capabilities
TABLE 2. Training results for Minbatchsize 60.
learned on ImageNet dataset. Fig. 2 shows an exemple of
test classification for some labels. Number of Training time (s) Accuracy (%)
Iterations
3 Epochs 9 12091.69 0.5866

4 Epochs 12 17611.06 0.6089

5 Epochs 15 39255.54 0.6816

6 Epochs 18 42480.20 0.7541

400
therefore necessary to change the last layer of AlexNet and
replacing it by a layer of four neurons.

The proposed method consists on transferring trained

weights of layers in the first part of the adapted and
pretrained network C1, C2, C3, C4, C5, FC6 and FC7. Then
we implement the last adapted layer FCL8_2. The first part
is freezing, and results were obtained by applying training
only in the second part from FC6 layer to the FCL8_2
adapted layer. By this way the convolutional neural network
will be better adapted with our classification task. This
strategy has shown remarkable performance and optimized
training time.

Then, we have analyzed the effect of the

minibatchsize. After fine-tuning training iterations this
model scored 58.66% accuracy with an important
minibatchsize 60. Then, the model scored 93.33% accuracy
on the test set with a Minibatchsize 10 which is not too bad.
Fig. 3. Test classification for some labels with minibatchsize 60. Using this parameter is faster than calculating the error
over the entire training data at each iteration. It is more
The results of training for different numbers of epochs stable than working image by image, because the error
and a large Minibatchsize 60, gave classification rates during gradients have less variance. Too many images per batch
a significant training time varying between 0.5866 and can cause memory problems when running the code.
0.7541. Comparing to results obtained in table 1, the
decrease of the rate explain the problem of memorization V. CONCLUSION
shown in Fig. 3 where the « Don’t stop sign » has been
misclassified and considered as a « Stop Signs » category.
For feature images extraction and learning, deep neural
Table 3 shows that the training for different number of networks are very effective but these systems unfortunately
epochs gives important values of the classification rate. takes a long time for training layers with simple hardware.
These values vary between 0.8429 and 0.9333 and are
calculated during a training time of 25882.40s. This low In this paper, we have presented a method for learning
value of MiniBatch makes it possible to perform the traffic sign images for classifier system. This application is
classification task with more precision. released using deep learning to build this classifier, by
TABLE 3. Training results for MiniBatchsize 10. training on 360 images. The implementation of this type of
learning, particularly the convolutional neural network for
Number of Training time (s) Accuracy (%) the classification of image data, and by exploiting AlexNet
Iterations combined with the technique of transfer learning constitute
3 Epochs 54 11589.23 0.8429 the objectif of our work. We have demonstrated that fine
tuning parameters is a very important and useful technique
4 Epochs 72 15173.47 0.8939
in the training. The application of this fine tuning has given
5 Epochs 90 22263.47 0.8944 a reasonable results and has effect on the time of training.

6 Epochs 108 25882.40 0.9333

Our experimental results demonstrate also the effect of
the MiniBatchsize parameter. The adjustment of this
parameter is important in the training process, the results and
E. Discussion the training time depend on the chosen values. Too many
batch images can cause memory problems when running the
The combination of AlexNet and fine tuning technique code. Time of training was important because of the use of a
consisting on taking a visual system already well trained on simple hardware computer.
a classification task to refine it on a similar task, has
enabled the use of a carefully optimized architecture by Deep learning is worth understanding and it is practical
specialists, and take advantage of features extraction efficient since the training time is controlled. In future
capabilities learned on a large quality dataset. research, our system will be ameliorated with the use of
genetic algorithms which will be added to optimize and
The only necessary change to the network is to adapt solve the classification feature extraction problem in order to
the last layer. Our problem has 4 categories, while the initial optimize the number of parameters in this task.
training of the AlexNet was done on 1000 categories. It was

401
[23] D. Lowe. “Distinctive image features from scale-invariant keypoints.”
References IJCV, 60(2):91–110,2004.
[24] Y. LeCun, L. Bottou, and J. HuangFu. “Learning methods for generic
[1] Redmon J, and Angelova A, “Real-time grasp detection using object recognition with invariance to pose and lighting.” CVPR, 2004
convolutional neural networks”, IEEE International Conference on [25] F. Perronnin, J. S´anchez, and T. Mensink. “Improving the ¿sher kernel
Robotics and Automation, pp. 1316–1322, 2015. for large-scale image classi¿cation.” ECCV, 2010.
[2] Hang Chang, Cheng Zhong, Ju Han, Jian-Hua Mao, “Unsupervised [26] : P.Sermanet,D.Eigen,X.Zhang,M.Mathieu,R.Fergus,and Y.LeCun.
Transfer Learning via Multi-Scale Convolutional Sparse Coding for “Overfeat: Integrated recognition, localizationand detection using
Biomedical Application.” IEEE Transactions on Pattern Analysis and convolutional networks.” arXiv:1312.6229, 2013.
Machine Intelligence, 23 janvier 2017. [27] : Schmidhuber, J. “Multi-column deep neural networks for image
[3] Zhou, X., Yu, K., Zhang, T., & Huang, T. “Image classi¿cation using classi¿cation.” CVPR. 2012.
super-vector coding of local image descriptors.” In ECCV,2010. [28] M. A. Ranzato, C. Poultney, S. Chopra, and Y. Lecun. “Efficient
[4] van de Sande, K. E. A., Gevers, T., and Snoek, C. G. M, “Evaluating learning of sparse representations with an energy-based model.”
color descriptors for object and scene recognition”, IEEE Transactions Advances in Neural Information Processing Systems (NIPS), 2006.
on Pattern Analysisand Machine Intelligence.” 1582– 1596. 2010. [29] Y. LeCun, F.-J. Huang, and L. Bottou. “Learning methods for generic
[5] Howard, A. , “Some improvements on deep convolutional neural object recognition with invariance to pose and lighting.” Computer
network based image classi¿cation.” ICLR, 2014. Vision and Pattern Recognition, 2004.
[6] Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-Fei, L., [30] S. Behnke. “Hierarchical Neural Networks for Image Interpretation.”
“ImageNet: A large-scale hierarchical image database.” In CVPR, 2009. Lecture Notes in Computer Science. Springer, 2003.
[7] Ahonen, T., Hadid, A., and Pietikinen, “M. Face description with local [31] Y. Bengio, P. Lamblin, D. Popovici, and H. Larochelle. “Greedy layer-
binary patterns: Application to face recognition.” Pattern Analysis and wise training of deep networks.” Neural Information Processing
Machine Intelligence, 2037–2041. 2016. Systems, 2007
[8] K. Hornik, M. Stinchcombe, H. White, “Multilayer Feedforword
Networks are Universal Approximators.” Neural Networks, pp. 359-366,
1989.
[9] G. Cybenko, “Approximation by superpositions of a sigmoidal
function.” Math. Contr. Signals Syst., pp. 303-314, 1989.
[10] P.Sermanet, D.Eigen, X.Zhang, M.Mathieu, R.Fergus,and Y.LeCun.
“Overfeat: Integrated recognition, localization and detection using
convolutional networks.” arXiv:1312.6229, 2013.
[11] H. B. Burke, “Artificial neural networks for cancer research: Outcome
prediction.” Sem. Surg. Oncol, vol. 10, pp. 73–79, 1994.
[12] H.B. Burke, P.H. Goodman, D.B. Rosen, D.E. Henson, J.N. Weinstein,
F.E. Harrell, J.R. Marks, D.P. Winchester, D.G. Bostwick, “Artificial
neural networks improve the accuracy of cancer survival prediction.”
Cancer, vol. 79, pp. 857-862, 1997.
[13] J. Lampinen, S. Smolander, and M.Korhonen, “Wood surface inspection
system based on generic visual features.” Industrial Applications of
Neural Networks, F. F. Soulie and P. Gallinari, Eds, Singapore: World
Scientific, pp. 35-42, 1998.
[14] T. Petsche, A. Marcantonio, C. Darken, S. J. Hanson, G. M. Huhn, I.
Santoso, “An autoassociator for on-line motor monitoring.”, Industrial
Applications of Neural Networks, F. F. Soulie and P. Gallinari, Eds,
Singapore: World Scientific, pp. 91-97, 1998.
[15] A. Sifaoui, A. Abdelkrim, M. Benrejeb, “On RBF neural network
classifier design for iris plants”, The 37th International Conference on
Computers and Industrial Engineering, pp.113-118, Alexandrie, Octobre
2007.
[16] Sinno Jialin Pan and Qiang Yang, Fellow. “A Survey on Transfer
Learning.” IEEE Transactions on knowledge and data engineering, Vol.
22, No. 10, October 2010.
[17] M. Juneja, A. Vedaldi, C. V. Jawahar, and A. Zisserman. “Blocks that
shout: Distinctive parts for scene classi¿cation.” CVPR, 2013.
[18] R. Girshick, J. Donahue, T. Darrell, and J. Malik. “Rich feature
hierarchies for accurate object detection and semantic segmentation.”
CVPR, 2014.
[19] Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R.E. Howard, W.
Hubbard, and L.D. Jackel. “Backpropagation applied to handwritten zip
code recognition.” Neural Computation, 1(4):541–551, 1989.
[20] Y.Boureau, F. Bach, Y. LeCun, andJ. Ponce. “Learning midlevel
features for recognition.” CVPR, 2010.
[21] Marc Parizeau. “Le perceptron multicouche et son algorithme de
rétropropagation de l’erreur.” Département de génie électrique et de
génie informatique, Université Laval, 10 Septembre 2014.
[22] A. Ahmed, K. Yu, W. Xu, Y. Gong, and E. Xing. “Training hierarchical
feed-forward visual recognition models using transfer learning from
pseudo-tasks.” ECCV,2008.

402

A Survey On Vision Transformer
No ratings yet
A Survey On Vision Transformer
23 pages
2 DNN-CNN-RNN
100% (1)
2 DNN-CNN-RNN
87 pages
GPU Datasheet
No ratings yet
GPU Datasheet
3 pages
CNN RNN Assignment Set 4
0% (1)
CNN RNN Assignment Set 4
2 pages
Visual Cryptography
100% (1)
Visual Cryptography
65 pages
2018 Arxiv Mou VehicleSegmentation
No ratings yet
2018 Arxiv Mou VehicleSegmentation
14 pages
ECE 5th Sem Syllabus
0% (1)
ECE 5th Sem Syllabus
84 pages
Theoretical and Practical Analysis On CNN, MTCNN and Caps-Net Base Face Recognition and Detection PDF
No ratings yet
Theoretical and Practical Analysis On CNN, MTCNN and Caps-Net Base Face Recognition and Detection PDF
35 pages
Deep Learning (MODULE-3) (1)
No ratings yet
Deep Learning (MODULE-3) (1)
85 pages
CNN-based and DTW Features For Human Activity Recognition On Depth Maps
No ratings yet
CNN-based and DTW Features For Human Activity Recognition On Depth Maps
14 pages
Reconfigurable Hardware Design Approach For Economic Neural Network
No ratings yet
Reconfigurable Hardware Design Approach For Economic Neural Network
5 pages
CP16036
No ratings yet
CP16036
6 pages
Car Make and Model Recognition Using Ima
No ratings yet
Car Make and Model Recognition Using Ima
8 pages
Image Processing
No ratings yet
Image Processing
39 pages
Image Super Resolution Report
No ratings yet
Image Super Resolution Report
12 pages
KNN (K Nearest Neighbor)
No ratings yet
KNN (K Nearest Neighbor)
21 pages
Improved YOLOv4 Tiny Network For Real-Time Electronic Component Detection
No ratings yet
Improved YOLOv4 Tiny Network For Real-Time Electronic Component Detection
13 pages
Unit 3 Slides - Getting Started With Neural Networks
No ratings yet
Unit 3 Slides - Getting Started With Neural Networks
70 pages
Lecture 12 - Deep Learning
No ratings yet
Lecture 12 - Deep Learning
25 pages
CS231A Course Notes 1: Camera Models: Kenji Hata and Silvio Savarese
No ratings yet
CS231A Course Notes 1: Camera Models: Kenji Hata and Silvio Savarese
17 pages
Object Detection
No ratings yet
Object Detection
57 pages
ANPR PowerPoint
No ratings yet
ANPR PowerPoint
39 pages
Traffic Signal Annunciator: Government College of Engineering, Jalgaon 425002
No ratings yet
Traffic Signal Annunciator: Government College of Engineering, Jalgaon 425002
32 pages
Object Recognition
No ratings yet
Object Recognition
30 pages
Seminar On Deep CNN
No ratings yet
Seminar On Deep CNN
36 pages
Enhanced Super-Resolution Using GAN
No ratings yet
Enhanced Super-Resolution Using GAN
6 pages
Lung Cancer Detection Using Digital Image Processing On CT Scan Images
No ratings yet
Lung Cancer Detection Using Digital Image Processing On CT Scan Images
7 pages
Real Time Bangladeshi License Plate Detection & Recognition: Submitted by
No ratings yet
Real Time Bangladeshi License Plate Detection & Recognition: Submitted by
25 pages
Super-Resolution of Document Images Using Transfer Deep Learning of An ESRGAN Model
No ratings yet
Super-Resolution of Document Images Using Transfer Deep Learning of An ESRGAN Model
6 pages
Automated Vehicle License Plate Detection System Using Image Processing Algorithms PDF
No ratings yet
Automated Vehicle License Plate Detection System Using Image Processing Algorithms PDF
5 pages
Combining Multiple Sources of Knowledge in Deep Cnns For Action Recognition
No ratings yet
Combining Multiple Sources of Knowledge in Deep Cnns For Action Recognition
8 pages
Object Detection and Tracking Algorithms For Vehicle Counting: A Comparative Analysis
No ratings yet
Object Detection and Tracking Algorithms For Vehicle Counting: A Comparative Analysis
11 pages
Chapter 7. Object Recognition
No ratings yet
Chapter 7. Object Recognition
106 pages
Unit 1 CV
No ratings yet
Unit 1 CV
78 pages
Research Article: Image Enhancement Method Based On Deep Learning
No ratings yet
Research Article: Image Enhancement Method Based On Deep Learning
9 pages
Object Detection
No ratings yet
Object Detection
4 pages
Data Modelling and Visualization
No ratings yet
Data Modelling and Visualization
31 pages
UNIT_3 _DL
No ratings yet
UNIT_3 _DL
15 pages
AI-Lecture 12 - Simple Perceptron
100% (1)
AI-Lecture 12 - Simple Perceptron
24 pages
Face Recognition System
No ratings yet
Face Recognition System
32 pages
Soft Computing Assignment
100% (1)
Soft Computing Assignment
13 pages
The 9 Deep Learning Papers You Need To Know About 3
No ratings yet
The 9 Deep Learning Papers You Need To Know About 3
19 pages
Real-Time Traffic Sign and Light Recognition System For ADAS
No ratings yet
Real-Time Traffic Sign and Light Recognition System For ADAS
18 pages
Cs490 Advanced Topics in Computing (Deep Learning) : Lecture 16: Convolutional Neural Networks (CNNS)
No ratings yet
Cs490 Advanced Topics in Computing (Deep Learning) : Lecture 16: Convolutional Neural Networks (CNNS)
63 pages
Analysis of Walking Pattern Using LRCN For Early Diagnosis of Dementia in Elderly Patients
No ratings yet
Analysis of Walking Pattern Using LRCN For Early Diagnosis of Dementia in Elderly Patients
12 pages
Multi Object Tracking in Traffic Environments: A Systematic Literature
No ratings yet
Multi Object Tracking in Traffic Environments: A Systematic Literature
13 pages
Automatics Vehicle License Plate Recognition Using MATLAB
No ratings yet
Automatics Vehicle License Plate Recognition Using MATLAB
5 pages
Frequency Domain Filtering Image Processing
100% (1)
Frequency Domain Filtering Image Processing
24 pages
DL Unit-4
No ratings yet
DL Unit-4
26 pages
Visual Cryptography
100% (1)
Visual Cryptography
90 pages
Various Neural Network Architect Assignment Questions
No ratings yet
Various Neural Network Architect Assignment Questions
9 pages
Types of Neural Networks
No ratings yet
Types of Neural Networks
7 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
50 Most Important CNN Interview Questions
No ratings yet
50 Most Important CNN Interview Questions
18 pages
Unit 1 WSN
No ratings yet
Unit 1 WSN
139 pages
REPORT - DRONE AND IMPROVED HUMAN DETECTION IN SEA USING PI PICO New
No ratings yet
REPORT - DRONE AND IMPROVED HUMAN DETECTION IN SEA USING PI PICO New
52 pages
Human Activity Recognition
No ratings yet
Human Activity Recognition
40 pages
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
From Everand
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
Dr.Chandrakant
No ratings yet
An Introduction to 3D Computer Vision Techniques and Algorithms
From Everand
An Introduction to 3D Computer Vision Techniques and Algorithms
Boguslaw Cyganek
No ratings yet
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
From Everand
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
Sebastian Thelen
5/5 (1)
Crowd Counting Using CNN
No ratings yet
Crowd Counting Using CNN
15 pages
Image Compression Using Resilient-Propagation Neural Network
No ratings yet
Image Compression Using Resilient-Propagation Neural Network
5 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
36 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
50 pages
8 Neural Networks
No ratings yet
8 Neural Networks
55 pages
AI Masterclass
No ratings yet
AI Masterclass
2 pages
10. Unraveling the brain- A creative journey into neural networks
No ratings yet
10. Unraveling the brain- A creative journey into neural networks
14 pages
Fruits & Vegetable Classification and Calories Measurement System
No ratings yet
Fruits & Vegetable Classification and Calories Measurement System
2 pages
Different Artificial Neural Networks Architectures
No ratings yet
Different Artificial Neural Networks Architectures
27 pages
Lecture 04 (3hrs) Neural Network and Deep Learning-Part A
No ratings yet
Lecture 04 (3hrs) Neural Network and Deep Learning-Part A
76 pages
MLP Vs RBF Doctoral Thesis
No ratings yet
MLP Vs RBF Doctoral Thesis
32 pages
小样本学习研究综述赵凯琳
No ratings yet
小样本学习研究综述赵凯琳
21 pages
心理学deep learning Introduction - to - deep - neural - networks - - Syllabus - v0.9
No ratings yet
心理学deep learning Introduction - to - deep - neural - networks - - Syllabus - v0.9
3 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
68 pages
Gen Ai
No ratings yet
Gen Ai
23 pages
CERN Deep Learning and Vision
No ratings yet
CERN Deep Learning and Vision
72 pages
CM20315 03 Shallow
No ratings yet
CM20315 03 Shallow
59 pages
Week-1 (1)
No ratings yet
Week-1 (1)
4 pages
2021-exam2-solution
No ratings yet
2021-exam2-solution
11 pages
Backpropagation Neural Network For XOR Problem Java Source Code
100% (2)
Backpropagation Neural Network For XOR Problem Java Source Code
7 pages
BTP B23RVP02 Final PPT Presentation
No ratings yet
BTP B23RVP02 Final PPT Presentation
14 pages
AI Foundations and Applications: 8. Optimization of Learning Process
No ratings yet
AI Foundations and Applications: 8. Optimization of Learning Process
18 pages
AI & DS Record Note Requirement 2024-25
No ratings yet
AI & DS Record Note Requirement 2024-25
1 page
Chapter 7 - Neural-Networks
100% (1)
Chapter 7 - Neural-Networks
60 pages
AI Lab 10 Lab Tasks
No ratings yet
AI Lab 10 Lab Tasks
5 pages
Artificial Intelligence Infographic - 101
No ratings yet
Artificial Intelligence Infographic - 101
1 page
NNFL LP
No ratings yet
NNFL LP
4 pages
RNN Neural Network
No ratings yet
RNN Neural Network
23 pages
Shoolini University Mid Sem
No ratings yet
Shoolini University Mid Sem
3 pages
Long Short Term Memory Networks - Architecture of LSTM
No ratings yet
Long Short Term Memory Networks - Architecture of LSTM
14 pages