Deep Learning Unit 1

The document provides an overview of deep learning techniques. It defines deep learning as a machine learning method that learns features and tasks directly from data like images, text, or sound. It then lists and provides brief explanations of common deep learning architectures like convolutional neural networks, recurrent neural networks, and generative adversarial networks. The document also discusses key concepts in deep learning like activation functions, gradient descent, stochastic gradient descent, and mini-batch gradient descent.

Uploaded by

Aditya Pratap Singh

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

142 views

Deep Learning Unit 1

Uploaded by

Aditya Pratap Singh

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 32

Deep Learning

Unit -1
DL
ML vs DL
So, What do you
think, what is Deep
Learning
Deep learning is a machine
learning technique that learns
features and task directly from
the data, where data may by
images, text or sound!

For further assistance, code and slide https://fahadhussaincs.blogspot.com/

YouTube Channel : https://www.youtube.com/fahadhussaintutorial
• Artificial Neural Network
• Convolutional neural network
• Recurrent neural network
• Autoencoder
• GAN (Generative Adversarial Network)
• Pre Train Model (CNN Architecture and
many more…
Supervised Unsupervised Reinforcement Learning

Artificial Neural Network

Regression and Classification
What is Neural
Neural Working
Artificial
Normalize/Standardize
Neural Network
𝒎

∑ 𝒘𝒊 𝒙𝒊
𝒊= 𝒙

Applying Activation
Function
What is an Activation Function?
Activation functions are an extremely important feature of the artificial neural networks.
They basically decide whether a neuron should be activated or not. Whether the
information that the neuron is receiving is relevant for the given information or should it
be ignored.

The activation function is the non linear transformation that we do over the input
signal. This transformed output is then seen to the next layer of neurons as input.

• Linear Activation Function

• Non Linear Activation Function
What is an Activation Function?
Linear Function
The function is a line or linear. Therefore, the output of
the functions will not be confined between any range

Non Linear Function

1.Threshold
They make it easy for the model to generalize or adapt 2.Sigmoid
with variety of data and to differentiate between the
output
3.Tanh
4.ReLU
The Nonlinear Activation Functions are mainly divided on 5.Leaky ReLU
the basis of their range or curves
6.Softmax
Threshold Function?
Sigmoid Function?
The Sigmoid Function curve looks like a S-shape
This function reduces extreme values or outliers in data without removing them.
It converts independent variables of near infinite range into simple probabilities
between 0 and 1, and most of its output will be very close to 0 or 1.
Rectifier (Relu) Function?
ReLU is the most widely used activation function while designing networks today. First
things first, the ReLU function is non linear, which means we can easily backpropagate the
errors and have multiple layers of neurons being activated by the ReLU function.
Leaky Relu Function?
Leaky ReLU function is nothing but an improved version of the ReLU function. As we saw that
for the ReLU function, the gradient is 0 for x<0, which made the neurons die for activations in
that region. Leaky ReLU is defined to address this problem. Instead of defining the Relu
function as 0 for x less than 0, we define it as a small linear component of x.

What we have done here is that we have simply replaced the horizontal line with a non-zero, non-horizontal line. Here
a is a small value like 0.01 or so.
Sigmoid Function?
Pronounced “tanch,” tanh is a hyperbolic trigonometric function
The tangent represents a ratio between the opposite and adjacent sides of a right triangle,
tanh represents the ratio of the hyperbolic sine to the hyperbolic cosine: tanh(x) = sinh(x) /
cosh(x)
Unlike the Sigmoid function, the normalized range of tanh is –1 to 1 The advantage of tanh is
that it can deal more easily with negative numbers
Softmax Function (for Multiple
Classification)?
Softmax function calculates the probabilities distribution of the event over ‘n’ different events. In general way of
saying, this function will calculate the probabilities of each target class over all possible target classes. Later the
calculated probabilities will be helpful for determining the target class for the given inputs.

The main advantage of using Softmax is the output probabilities range. The range will 0 to 1, and the sum of all
the probabilities will be equal to one. If the softmax function used for multi-classification model it returns the
probabilities of each class and the target class will have the high probability.

The formula computes the exponential (e-power) of the given input value and the sum of exponential values of
all the values in the inputs. Then the ratio of the exponential of the input value and the sum of exponential
values is the output of the softmax function.
Activation Function Example
How Neural Network Work
and
Back Propagation in deep learning
How Neural Network Work with many neurons
What is Gradient Descent (BGD)
Gradient Descent is an optimization technique that is used to improve deep learning
and neural network-based models by minimizing the cost function.
Gradient Descent is a process that occurs in the backpropagation phase where the
goal is to continuously resample the gradient of the model’s parameter in the
opposite direction based on the weight w, updating consistently until we reach
the global minimum of function J(w).

More Precisely,
Gradient descent is an algorithm, which is used to iterate through different
combinations of weights in an optimal way.....to find the best combination of
weights which has a minimum error.
Brute force algorithm

Curse of dimensionality

Brute Force Algorithms refers to a programming style that does not include any shortcuts to
improve performance, but instead relies on sheer computing power to try all possibilities until the
solution to a problem is found. A classic example is the traveling salesman problem (TSP).
What is Gradient Descent
What is Gradient Descent
Useful link

https://towardsdatascience.com/understanding-the-mathematics-b
ehind-gradient-descent-dde5dc9be06e
Stochastic gradient descent
The word ‘stochastic‘ means a system or a process that is linked with a random
probability. Hence, in Stochastic Gradient Descent, a few samples are selected
randomly instead of the whole data set for each iteration. In Gradient Descent,
there is a term called “batch” which denotes the total number of samples from a
dataset that is used for calculating the gradient for each iteration. In typical
Gradient Descent optimization, like Batch Gradient Descent, the batch is taken to
be the whole dataset. Although, using the whole dataset is really useful for
getting to the minima in a less noisy or less random manner, but the problem
arises when our datasets get really huge.
Stochastic gradient descent
Stochastic gradient descent (often abbreviated SGD) is an iterative method for
optimizing an objective function with suitable smoothness properties (e.g. differentiable
or subdifferentiable). ~Convex Loss function~
Stochastic gradient descent
Stochastic gradient descent
Mini Batch gradient descent
Mini-batch gradient descent is a variation of the gradient descent algorithm
that splits the training dataset into small batches that are used to calculate
model error and update model coefficients.
Implementations may choose to sum the gradient over the mini-batch which
further reduces the variance of the gradient.

Mini-batch gradient descent seeks to find a balance between the robustness

of stochastic gradient descent and the efficiency of batch gradient descent. It
is the most common implementation of gradient descent used in the field of
deep learning.
Mini Batch gradient descent

BGD SGD MBGD

Thank You

Autoencoders - Presentation
No ratings yet
Autoencoders - Presentation
18 pages
Project Element Response: Project Name Today's Date Project Start Date Target Completion Date
No ratings yet
Project Element Response: Project Name Today's Date Project Start Date Target Completion Date
2 pages
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
Gradient Descent
No ratings yet
Gradient Descent
15 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
19 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
7 pages
SCSA3015 Deep Learning Unit 2 PDF
No ratings yet
SCSA3015 Deep Learning Unit 2 PDF
32 pages
02 ML Supervised Learning
No ratings yet
02 ML Supervised Learning
32 pages
2.building Blocks of Neural Networks
100% (1)
2.building Blocks of Neural Networks
2 pages
Optimization Techniques in Deep Learning
No ratings yet
Optimization Techniques in Deep Learning
14 pages
Deep Learning Lab Manual - IGDTUW - Vinisky Kumar
100% (1)
Deep Learning Lab Manual - IGDTUW - Vinisky Kumar
33 pages
Answers For End-Sem Exam Part - 2 (Deep Learning)
No ratings yet
Answers For End-Sem Exam Part - 2 (Deep Learning)
20 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
ML Unit 1
No ratings yet
ML Unit 1
44 pages
RBM, DBN, and DBM
No ratings yet
RBM, DBN, and DBM
79 pages
ML Lab
No ratings yet
ML Lab
21 pages
ML MCQ Unit 1
No ratings yet
ML MCQ Unit 1
8 pages
SCSA3015 Deep Learning Unit 4 PDF
No ratings yet
SCSA3015 Deep Learning Unit 4 PDF
30 pages
MACHINE LEARNING 1-5 (Ai &DS)
100% (1)
MACHINE LEARNING 1-5 (Ai &DS)
60 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
No ratings yet
Lecture - 2 Classification (Machine Learning Basic and KNN)
94 pages
Curse of Dimensionality
No ratings yet
Curse of Dimensionality
9 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
136 pages
Unit -3-NNDL- Notes
No ratings yet
Unit -3-NNDL- Notes
17 pages
Bidirectional RNN and RVNN
No ratings yet
Bidirectional RNN and RVNN
15 pages
Well Posed Learning Problems and Applications of ML
No ratings yet
Well Posed Learning Problems and Applications of ML
17 pages
Loss Functions
No ratings yet
Loss Functions
37 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
35 pages
Soft Max
No ratings yet
Soft Max
6 pages
Dropout Vs Pruning
No ratings yet
Dropout Vs Pruning
2 pages
Deep Learning Exp
No ratings yet
Deep Learning Exp
25 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
6 pages
Unit - 3
No ratings yet
Unit - 3
42 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
3 pages
Unit 2
No ratings yet
Unit 2
112 pages
Assignment # 01 Bscs - 7 Semester: Machine Learning
100% (1)
Assignment # 01 Bscs - 7 Semester: Machine Learning
5 pages
Tensor Flow
No ratings yet
Tensor Flow
12 pages
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
100% (1)
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
13 pages
Chapter-2-Fundamentals of Machine Learning
No ratings yet
Chapter-2-Fundamentals of Machine Learning
23 pages
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
No ratings yet
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
13 pages
Convolutional Neural Networks For Visual Recognition
No ratings yet
Convolutional Neural Networks For Visual Recognition
45 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Lab I TENSOR FLOW AND KERAS
No ratings yet
Lab I TENSOR FLOW AND KERAS
3 pages
Lecture Notes 5
No ratings yet
Lecture Notes 5
3 pages
Data Literacy Questions All Types
No ratings yet
Data Literacy Questions All Types
2 pages
Machine Learning Midterm
No ratings yet
Machine Learning Midterm
18 pages
Nueral Network Mcqs
No ratings yet
Nueral Network Mcqs
6 pages
Unit 1 - Machine Learning
No ratings yet
Unit 1 - Machine Learning
21 pages
Deep Learning Unit-II
No ratings yet
Deep Learning Unit-II
19 pages
71A Machine Learning
No ratings yet
71A Machine Learning
8 pages
Deep Learning (MODULE-3) (1)
No ratings yet
Deep Learning (MODULE-3) (1)
85 pages
Dl Question Bank
No ratings yet
Dl Question Bank
23 pages
ML First Unit
No ratings yet
ML First Unit
70 pages
CS230: Deep Learning: Winter Quarter 2019 Stanford University Midterm Examination 180 Minutes
No ratings yet
CS230: Deep Learning: Winter Quarter 2019 Stanford University Midterm Examination 180 Minutes
29 pages
AIML - 04 Single Layer Perceptron
No ratings yet
AIML - 04 Single Layer Perceptron
11 pages
ML - Expectation-Maximization Algorithm
No ratings yet
ML - Expectation-Maximization Algorithm
3 pages
Artificial Neural Networks: Part 1/3
No ratings yet
Artificial Neural Networks: Part 1/3
25 pages
Deep Learning With Tensorflow
No ratings yet
Deep Learning With Tensorflow
15 pages
Deep Learning Handout
100% (1)
Deep Learning Handout
6 pages
Passpotrt Name Change
No ratings yet
Passpotrt Name Change
2 pages
Installation Instruction Sheet - 36kV 630A T
No ratings yet
Installation Instruction Sheet - 36kV 630A T
8 pages
Software Engineering 01
No ratings yet
Software Engineering 01
69 pages
2D Animation 37 47
No ratings yet
2D Animation 37 47
11 pages
Configuring External Storage For Archive Backup - Tech
No ratings yet
Configuring External Storage For Archive Backup - Tech
6 pages
Khan Academy Kids App
No ratings yet
Khan Academy Kids App
6 pages
32" 100 HZ Full HD Ultra Slim LED TV (L32P11FZE)
No ratings yet
32" 100 HZ Full HD Ultra Slim LED TV (L32P11FZE)
6 pages
JUrnal Internasional AUgmented Reality
No ratings yet
JUrnal Internasional AUgmented Reality
7 pages
FIN117 Worksheet 1
No ratings yet
FIN117 Worksheet 1
4 pages
Understanding Phases of E-Government Project PDF
No ratings yet
Understanding Phases of E-Government Project PDF
6 pages
Instant ebooks textbook Operating System Concepts 10th 10th Edition Abraham Silberschatz download all chapters
100% (2)
Instant ebooks textbook Operating System Concepts 10th 10th Edition Abraham Silberschatz download all chapters
65 pages
Application FinalPrint
No ratings yet
Application FinalPrint
3 pages
Oryentasyon para Sa "Philippine National Public Key Infrastructure" (Pnpki)
No ratings yet
Oryentasyon para Sa "Philippine National Public Key Infrastructure" (Pnpki)
24 pages
Entity Relationship Model PDF
100% (2)
Entity Relationship Model PDF
37 pages
Neo4j Cypher Refcard Stable
No ratings yet
Neo4j Cypher Refcard Stable
2 pages
Dissertation Personal Action Plan
100% (2)
Dissertation Personal Action Plan
8 pages
B1 90 TB1200 03 01
No ratings yet
B1 90 TB1200 03 01
28 pages
Prince2 Process Model
No ratings yet
Prince2 Process Model
1 page
Docs Python Telegram Bot Org en v12.1.1
No ratings yet
Docs Python Telegram Bot Org en v12.1.1
254 pages
Lsppscripting
No ratings yet
Lsppscripting
30 pages
Small Size, Big Power: Fast, Accurate, Versatile XRF Analysis
No ratings yet
Small Size, Big Power: Fast, Accurate, Versatile XRF Analysis
2 pages
Lecture 3 Control Objectives (Cobit)
No ratings yet
Lecture 3 Control Objectives (Cobit)
23 pages
2SC1740S PDF
No ratings yet
2SC1740S PDF
2 pages
3.0.1.2 How Much Does This Cost Instructions WB
No ratings yet
3.0.1.2 How Much Does This Cost Instructions WB
2 pages
Transmission Network Design and Architecture Guidelines Version 1 3
No ratings yet
Transmission Network Design and Architecture Guidelines Version 1 3
64 pages
Bumble Bizz
100% (1)
Bumble Bizz
38 pages
Giáo Trình Polymer Ưa Nước Và Ứng Dụng - Nguyễn Văn Khôi
No ratings yet
Giáo Trình Polymer Ưa Nước Và Ứng Dụng - Nguyễn Văn Khôi
341 pages
Perbandingan Lampu Induksi LVD Dan Lampu LED
No ratings yet
Perbandingan Lampu Induksi LVD Dan Lampu LED
5 pages
Cymulate's Vectors Brochure
No ratings yet
Cymulate's Vectors Brochure
3 pages