Deep learning exp 2.3 MU

Apply any of the following learning algorithms to learn the parameters of the supervised single layer feed forward neural network.

Uploaded by

gaming47more04

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Deep learning exp 2.3 MU

Apply any of the following learning algorithms to learn the parameters of the supervised single layer feed forward neural network.

Uploaded by

gaming47more04

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

EXPERIMENT 2.

Aim: Apply any of the following learning algorithms to learn the parameters of the supervised
single layer feed forward neural network.

Theory:
What is an Optimizer?

An optimizer is a function or algorithm that modifies the neural network’s attributes, such as
weights and learning rate. As a result, it helps to reduce overall loss and improve accuracy.
Choosing the right weights for the model is difficult because a deep-learning model typically
has millions of parameters. It emphasizes the importance of selecting an appropriate
optimization algorithm for your application.

Understanding the Optimization algorithms is crucial to diving deep into deep learning.

Before you continue, there are a few terms you should be familiar with.

Epoch: The number of times the algorithm runs through the entire training dataset.

Sample: A sample is one row from a dataset.

Batch: This is the number of samples that will be used to update the model parameters.

Learning Rate: The learning rate is a parameter that tells the model how frequently the model
weights should be updated.

Cost Function/Loss Function: It is used to calculate the cost, which is the difference between
the predicted and actual value.

Weights/Bias: A model’s learnable parameters that control the signal between two neurons.

Momentum: A very popular technique that is used along with SGD. Instead of relying solely
on the Gradient of the current step to guide the search, momentum considers the gradients of
previous steps to determine the best output.

Drawbacks of Base Optimizers (GD, SGD, and Mini-Batch GD)

What is Adam Optimizer?

Adam derives its name from adaptive moment estimation. This optimization algorithm is a
stochastic gradient descent extension that updates network weights during training. It is a
hybrid of the “gradient descent with momentum” and the “RMSP” algorithms.

It is an adaptive learning rate method that calculates individual learning rates for various
parameters.

Adam can be used instead of the classical stochastic gradient descent procedure to update
network weights iterative based on training data.
The Adam optimizer employs a hybrid of two gradient descent methods:

Momentum: This algorithm is used to speed up the gradient descent algorithm by considering
the ”exponentially weighted average” of the gradients. Using averages causes the algorithm to
converge to the minima more quickly.

• mt = Aggregate of gradients at time t [Current] (Initially, mt = 0)

• mt-1 = Aggregate of gradients at time t-1 [Previous]
• Wt = Weights at time t
• Wt+1 = Weights at time t+1
• αt = Learning rate at time t
• ∂L = Derivative of Loss Function
• ∂Wt = Derivative of weights at time t
• β = Moving average parameter (Constant, 0.9)
Root Mean Square Propagation (RMSP):

RMSprop, or root mean square prop, is an adaptive learning algorithm that attempts to improve
AdaGrad. It uses the ”exponential moving average” rather than the cumulative Sum of squared
gradients as AdaGrad does.

• Wt = Weights at time t
• Wt+1 = Weights at time t+1
• αt = Learning rate at time t
• ∂L = Derivative of Loss Function
• ∂Wt = Derivative of weights at time t
• Vt = Sum of the square of past gradients. [i.e sum(∂L/∂Wt-1)] (initially, Vt = 0)
• β = Moving average parameter (const, 0.9)
• ϵ = A small positive constant (10-8)
Adam Optimizer takes the strengths or positive characteristics of the previous two methods and
builds on them to provide a more optimized gradient descent.

In this case, we control the gradient descent rate so that there is minimal oscillation when it
reaches the global minimum while taking large enough steps (step size) to avoid the local
minima hurdles along the way—as a result, combining the features of the above methods to
reach the global minimum efficiently.

Mathematical Aspect of Adam Optimizer:

Using the formulas used in the previous two methods, we get the following:

Code:
import keras
from keras.datasets import mnist
from keras.models import Sequential
from keras.layers import Dense, Dropout, Flatten
from keras.layers import Conv2D, MaxPooling2D
from keras import backend as K
(x_train, y_train), (x_test, y_test) = mnist.load_data()
print(x_train.shape, y_train.shape)
x_train= x_train.reshape(x_train.shape[0],28,28,1)
x_test= x_test.reshape(x_test.shape[0],28,28,1)
input_shape=(28,28,1)
y_train=keras.utils.to_categorical(y_train)#,num_classes=)
y_test=keras.utils.to_categorical(y_test)#, num_classes)
x_train= x_train.astype('float32')
x_test= x_test.astype('float32')
x_train /= 255
x_test /=255
batch_size = 64
num_classes = 10
epochs = 10
def build_model(optimizer):
model = Sequential()
model.add(Conv2D(32, kernel_size=(3, 3), activation='relu', input_shape=input_shape))
model.add(MaxPooling2D(pool_size=(2, 2)))
model.add(Dropout(0.25))
model.add(Flatten())
model.add(Dense(256, activation='relu'))
model.add(Dropout(0.5))
model.add(Dense(num_classes, activation='softmax'))
model.compile(loss=keras.losses.categorical_crossentropy,
optimizer=optimizer,
metrics=['accuracy'])
return model
optimizers = ['Adadelta', 'Adagrad', 'Adam', 'RMSprop', 'SGD']
for i in optimizers:
model = build_model(i)
hist = model.fit(x_train, y_train, batch_size=batch_size, epochs=epochs, verbose=1,
validation_data=(x_test, y_test))

Output:

Conclusion: Hence we are able to Apply any of the following learning algorithms to learn
the parameters of the supervised single layer feed forward neural network.

KMC Registration Certificate
0% (1)
KMC Registration Certificate
1 page
Sonia Richards - Week End 1 - Course Exercice
100% (1)
Sonia Richards - Week End 1 - Course Exercice
10 pages
Latinos and The Nation's Future Edited by Henry G. Cisneros and John Rosales
100% (2)
Latinos and The Nation's Future Edited by Henry G. Cisneros and John Rosales
273 pages
Optimizers
No ratings yet
Optimizers
4 pages
Important Optimization Algorithms Essentials
No ratings yet
Important Optimization Algorithms Essentials
12 pages
Deep Learning (MODULE-2) (2)
No ratings yet
Deep Learning (MODULE-2) (2)
86 pages
ADL Unit-3
No ratings yet
ADL Unit-3
21 pages
Activations, Loss Functions & Optimizers in ML
No ratings yet
Activations, Loss Functions & Optimizers in ML
29 pages
Curs6site PDF
No ratings yet
Curs6site PDF
40 pages
Op Tim Ization
No ratings yet
Op Tim Ization
22 pages
AdamZ research paper
No ratings yet
AdamZ research paper
13 pages
Lecture_2
No ratings yet
Lecture_2
31 pages
BME 6407 - Class 10 (April 2023)
No ratings yet
BME 6407 - Class 10 (April 2023)
31 pages
Building a RMSprop Optimizer 1721650945
No ratings yet
Building a RMSprop Optimizer 1721650945
10 pages
ChatGPT
No ratings yet
ChatGPT
4 pages
Optimization Techniques in Deep Learning
No ratings yet
Optimization Techniques in Deep Learning
14 pages
L5 Training Neural Networks Part 2 en v2
No ratings yet
L5 Training Neural Networks Part 2 en v2
70 pages
cours5
No ratings yet
cours5
23 pages
Soft Computing Assignment
No ratings yet
Soft Computing Assignment
9 pages
Introduction to Optimization-Lec1
No ratings yet
Introduction to Optimization-Lec1
36 pages
adam optimizer
No ratings yet
adam optimizer
14 pages
11 - Optimizers
No ratings yet
11 - Optimizers
16 pages
Training NNs
No ratings yet
Training NNs
34 pages
08 Training
No ratings yet
08 Training
18 pages
ADAM-1
No ratings yet
ADAM-1
11 pages
Deep Learning
No ratings yet
Deep Learning
18 pages
19_22
No ratings yet
19_22
9 pages
Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
No ratings yet
Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
1 page
Pure Optimization
No ratings yet
Pure Optimization
23 pages
optimization techniques (SGD alternatives)
No ratings yet
optimization techniques (SGD alternatives)
34 pages
Lecture 9 Loss, Optimizers, Batch Processing, Accuracy
No ratings yet
Lecture 9 Loss, Optimizers, Batch Processing, Accuracy
12 pages
2020 CS182 Section 2 Notes
No ratings yet
2020 CS182 Section 2 Notes
6 pages
A Study of the Optimization Algorithms in Deep Learning
No ratings yet
A Study of the Optimization Algorithms in Deep Learning
4 pages
Chapter-2 Single Feed Forward Netwotk
No ratings yet
Chapter-2 Single Feed Forward Netwotk
132 pages
Learning To Learn by Gradient Descent by Gradient Descent
No ratings yet
Learning To Learn by Gradient Descent by Gradient Descent
10 pages
23-Practical Aspects of Optimization
No ratings yet
23-Practical Aspects of Optimization
7 pages
DL 4
No ratings yet
DL 4
15 pages
MLP Encoder Decoder
No ratings yet
MLP Encoder Decoder
14 pages
Optimizers Types
No ratings yet
Optimizers Types
6 pages
Deep Learning
No ratings yet
Deep Learning
23 pages
Adafactor - Adaptive Learning Rates With Sublinear Memory Cost
No ratings yet
Adafactor - Adaptive Learning Rates With Sublinear Memory Cost
9 pages
17-Deep Learning Frameworks - Data Augmentation - Under-Fitting Vs Over-Fitting-21!08!2024
No ratings yet
17-Deep Learning Frameworks - Data Augmentation - Under-Fitting Vs Over-Fitting-21!08!2024
3 pages
Lesson 5 Deep Neural Net Optimization Tuning Interpretability
100% (1)
Lesson 5 Deep Neural Net Optimization Tuning Interpretability
105 pages
Optimization and Tips For Neural Network Training: Geena Kim
No ratings yet
Optimization and Tips For Neural Network Training: Geena Kim
24 pages
Otimization 2024_ver3
No ratings yet
Otimization 2024_ver3
42 pages
Gradient Descent Optimization
No ratings yet
Gradient Descent Optimization
27 pages
Lit Rev 7
No ratings yet
Lit Rev 7
12 pages
Momentum Update Rule
No ratings yet
Momentum Update Rule
4 pages
Module 2
No ratings yet
Module 2
67 pages
DL Practical 02 Binary Class Classifier Using ANN
No ratings yet
DL Practical 02 Binary Class Classifier Using ANN
5 pages
ML Concepts
No ratings yet
ML Concepts
3 pages
cst414-deep learning module 2
No ratings yet
cst414-deep learning module 2
13 pages
Optimization in Machine Learning
No ratings yet
Optimization in Machine Learning
26 pages
Unit 2.4
No ratings yet
Unit 2.4
31 pages
Lecture 8.5
No ratings yet
Lecture 8.5
9 pages
DL Lab Manual
No ratings yet
DL Lab Manual
52 pages
Role of Optimizer in Neural Network
No ratings yet
Role of Optimizer in Neural Network
2 pages
Optimization
No ratings yet
Optimization
3 pages
Lect 7
No ratings yet
Lect 7
43 pages
Code Adam Optimization Algorithm From Scratch
No ratings yet
Code Adam Optimization Algorithm From Scratch
28 pages
Part 13 MD
No ratings yet
Part 13 MD
41 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
From Everand
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
Fouad Sabry
No ratings yet
EXPRIMENT NO 7
No ratings yet
EXPRIMENT NO 7
4 pages
BCT_Exp.No.01
No ratings yet
BCT_Exp.No.01
4 pages
BCT_Exp.No.02
No ratings yet
BCT_Exp.No.02
4 pages
BCT_exp no 3
No ratings yet
BCT_exp no 3
3 pages
Experiment 2.5 DL
No ratings yet
Experiment 2.5 DL
3 pages
Experiment 2.4 DL
No ratings yet
Experiment 2.4 DL
4 pages
Annexure 6
No ratings yet
Annexure 6
1 page
Human Potential Course Vivobarefoot AU
No ratings yet
Human Potential Course Vivobarefoot AU
1 page
Similes Metaphors Activities
No ratings yet
Similes Metaphors Activities
4 pages
Studies in Education
No ratings yet
Studies in Education
120 pages
Writing The School Improvement Plan
No ratings yet
Writing The School Improvement Plan
20 pages
Color Quiz Lesson Handout
No ratings yet
Color Quiz Lesson Handout
2 pages
Per Dev Act 2.7 2.9
No ratings yet
Per Dev Act 2.7 2.9
3 pages
Q2 - M2 - Current and Future Trends in Media and Information
No ratings yet
Q2 - M2 - Current and Future Trends in Media and Information
32 pages
Smart Meter Display User Guide
No ratings yet
Smart Meter Display User Guide
11 pages
H.K. Moffatt - Cosmic Dynamos: From Alpha To Omega
No ratings yet
H.K. Moffatt - Cosmic Dynamos: From Alpha To Omega
5 pages
English Vocabulary 17 PDF
No ratings yet
English Vocabulary 17 PDF
21 pages
Gaya Penulisan Ums 2014 English Version
100% (1)
Gaya Penulisan Ums 2014 English Version
48 pages
Population: Crash
No ratings yet
Population: Crash
1 page
The Eight Cases in Sanskrit
50% (4)
The Eight Cases in Sanskrit
3 pages
Book Analysis Worksheet
No ratings yet
Book Analysis Worksheet
5 pages
On The Use Non-Stationary Penalty Functions T o Solve Nonlinear Constrained Optimization Problems With GA's
No ratings yet
On The Use Non-Stationary Penalty Functions T o Solve Nonlinear Constrained Optimization Problems With GA's
6 pages
Welding Visual Inspection Report F
0% (1)
Welding Visual Inspection Report F
1 page
L4M2 Quiz
No ratings yet
L4M2 Quiz
108 pages
AE25 - HSSC Regular Candidate Registration Form
No ratings yet
AE25 - HSSC Regular Candidate Registration Form
4 pages
A Project Report On To Study The Brand P
No ratings yet
A Project Report On To Study The Brand P
41 pages
Test Course Codes
No ratings yet
Test Course Codes
21 pages
Adorno I Mimesis
No ratings yet
Adorno I Mimesis
13 pages
Vineet Dhanawat
No ratings yet
Vineet Dhanawat
8 pages
Oil Company or Service Company - Offshore
No ratings yet
Oil Company or Service Company - Offshore
11 pages
DTE Question Bank Solution
No ratings yet
DTE Question Bank Solution
15 pages
Mini Peta Data Analysis Group 5
No ratings yet
Mini Peta Data Analysis Group 5
7 pages
Petroleum Engineering Course Arrangement
No ratings yet
Petroleum Engineering Course Arrangement
1 page