0% found this document useful (0 votes)

5 views

DeepLearning Lect2 3

Uploaded by

Nalain Abbas

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

DeepLearning Lect2 3

Uploaded by

Nalain Abbas

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 89

Deep Learning

Lecture-2
Dr. Abdul Jaleel
Associate Professor
Machine learning: A new Programming Paradigm
Linear Regression y=mx+c

But how to determine such exact m & c ??

Linear Regression y=mx+c

Possibility of too many

lines.

which one is best

suited?
Mean Squared Error
Residuals, Error or Loss

Cost Function

May be used to
Compare different
hypothetical Lines
Lots of Regression Lines, Each having some Cost
Which one is the best?
That have minimum
MSE cost.
Loss Functions

 Squared Loss Loss =

 Mean Squared Error (MSE) MSE =

 Absolute Loss Loss =

 Mean Absolute Error (MAE) MAE =

8
Convex Optimization and Gradient Descent Approach
A real-valued function defined on an n-dimensional interval is called convex if the
line segment between any two points on the graph of the function lies above or on the
graph.
Convex Optimization for a set of 21 Data-Points

Regression Line

12
Convex Optimization

Loss function plot of 21

data points for

Wj ={-1,-0.5, 0, 0.5, 1, 1.5, 2, 2.5,…,5}

Loss =

13
Convex Optimization

From a range of weight values plotted in left side graph, Lets estimate loss for weight value zero.

14
Convex Optimization

𝑦 =0

Mean Squared Error Loss for weight value zero is calculated as a

difference/distance of Red and Green lines plotted in right side.
+0
15
Convex Optimization

𝑦 =0.5 𝑥

+0
16
Convex Optimization

𝑦 =1 𝑥

Next, lets guess loss for weight value one. +0

17
Convex Optimization

𝑦 =1.5 𝑥

Weight value 1.5 decreases the MSE.

+0
18
Convex Optimization

𝑦 =2 𝑥

For weight value 2, the predicted line best fits the data point line.
+0
19
Convex Optimization with bias

𝑦 𝑝 =𝑤𝑥 + 𝑏

20
Convex Optimization with bias

Loss =

21
Convex Optimization with bias

Loss =

Loss function’s surface plot is converted into contour plot.

22
Convex Optimization with bias

𝑦 𝑝 =𝑤𝑥 + 𝑏
23
Convex Optimization with bias

𝑦 =0 𝑥 − 1

𝑦 𝑝 =𝑤𝑥 + 𝑏
24
Convex Optimization with bias

𝑦 =1 𝑥 −1

𝑦 𝑝 =𝑤𝑥 + 𝑏
25
Convex Optimization with bias

𝑦 =2 𝑥 −1

𝑦 𝑝 =𝑤𝑥 + 𝑏
26
Convex Optimization with bias

𝑦 =2 𝑥+ 0

𝑦 𝑝 =𝑤𝑥 + 𝑏
27
Convex Optimization with bias

𝑦 =2 𝑥+1

𝑦 𝑝 =𝑤𝑥 + 𝑏
28
Gradient Descent Approach
Gradient Descent
Slope and
Derivative
Slope and Derivative
Result: the derivative of x2 is 2x
Derivative Partial Derivative
Partial Derivative
Gradient Descent Approach
Deep Learning
Lecture-3
Dr. Abdul Jaleel
Associate Professor
H(x) = Pred_y
 Lets apply Gradient descent in
coefficient learning to find the
Gradient
values of a function's parameters
Descent
that minimize the cost function as
far as possible.
Almost we reached a best fit line
- In Neural Networks, we apply Logistic Regression on the outcome of Gradient

Decent based learned parameters of Linear Regression best fit line.

- The Sigmoid Function works as an activation function for the Neuron to

classify

the outcome
Why we need a Sigmoid / Logit function instead of
Step Function for Neuron Activation
Why we need a Sigmoid / Logit function instead of
Step Function for Neuron Activation
Why we need a Sigmoid / Logit function instead of
Step Function for Neuron Activation
The Linear Equation

Non Linear
Activation Function
How it works for Row 1
Predicted and Actual outcome for Row 1 :- Error calculated with LogLoss function
instead of MSE
Predicted and Actual outcome for Row 2 :- Error calculated with LogLoss function
Predicted and Actual outcome for Row 13 :- Error calculated with LogLoss function
https://towardsdatascience.com/why-not-mse-as-a-lo
ss-function-for-logistic-regression-589816b5e03c
Loss is high for W1=1,W2=1, Need to apply Gradient Descent
Implementation of activation functions in python
Implementation of Loss functions in python
Now we start implementing gradient descent in plain python. Again the goal is to come up with same w1, w2
and bias that keras model calculated.
We want to show how keras/tensorflow would have computed these values internally using gradient descent

First write couple of helper routines such as sigmoid and log_loss

Now comes the time to implement our own custom neural network class !!
This shows that in the end we were able to come up with same value of w1,w2
and bias using a plain python implementation of gradient descent function

you can compare predictions from our own custom model

and tensoflow model.

You will notice that predictions are almost same

 https://www.analyticsvidhya.com/blog/2021/08/understanding-lin
ear-regression-with-mathematical-insights/
 https://youtu.be/xq7aULLsCtw
LINKS
 https://youtu.be/1-OGRohmH2s
https://
towardsdatascience.com/why-not-mse-as-a-loss-fu
nction-for-logistic-regression-589816b5e03c

https://www.baeldung.com/cs/cost-function-logisti
c-regression-logarithmic-expr#:~:
text=Mean%20Squared%20Error%2C%20common
ly%20used,function%20is%20however%20always
%20convex

Machine Learning Cheat Sheet ??? - ?
No ratings yet
Machine Learning Cheat Sheet ??? - ?
231 pages
Machine Learning Cheat Sheet
100% (1)
Machine Learning Cheat Sheet
211 pages
CS480 6 Linear Models
No ratings yet
CS480 6 Linear Models
68 pages
Machine learning
No ratings yet
Machine learning
19 pages
Chapter 5 - Support Vector Machine: Prepared By: Shier Nee, SAW
No ratings yet
Chapter 5 - Support Vector Machine: Prepared By: Shier Nee, SAW
44 pages
ML Cheatsheet
100% (1)
ML Cheatsheet
219 pages
Curs3site PDF
No ratings yet
Curs3site PDF
38 pages
Lecture 2- gradient-directional derivatives
No ratings yet
Lecture 2- gradient-directional derivatives
31 pages
Week 2 Watermark
No ratings yet
Week 2 Watermark
84 pages
ML Cheatsheet PDF
100% (1)
ML Cheatsheet PDF
211 pages
ELDI2002 5 Ali Herssi
No ratings yet
ELDI2002 5 Ali Herssi
9 pages
ITM107 LP Sensitivity Analysis
No ratings yet
ITM107 LP Sensitivity Analysis
4 pages
Linear Programming-Solving LPP by
No ratings yet
Linear Programming-Solving LPP by
31 pages
1 2D-3D - Geomtery
No ratings yet
1 2D-3D - Geomtery
34 pages
10 Linear Regression
No ratings yet
10 Linear Regression
61 pages
ECE 3040 Lecture 6: Programming Examples: © Prof. Mohamad Hassoun
No ratings yet
ECE 3040 Lecture 6: Programming Examples: © Prof. Mohamad Hassoun
17 pages
Optimization Lec03 ClassicalConstrained
No ratings yet
Optimization Lec03 ClassicalConstrained
18 pages
cz4041 8b Regression
No ratings yet
cz4041 8b Regression
19 pages
2 Simple Regression Model
No ratings yet
2 Simple Regression Model
55 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
LR RP and Linear Programing
No ratings yet
LR RP and Linear Programing
95 pages
Linear Models (Unit II) Chapter III 1
No ratings yet
Linear Models (Unit II) Chapter III 1
24 pages
Simple Regression Model
No ratings yet
Simple Regression Model
54 pages
06LogisticRegression
No ratings yet
06LogisticRegression
55 pages
Slides 2
No ratings yet
Slides 2
27 pages
Linear Regression- Gradient Descent Method
No ratings yet
Linear Regression- Gradient Descent Method
15 pages
ML Module 2
No ratings yet
ML Module 2
185 pages
Regression PDF
No ratings yet
Regression PDF
37 pages
Regression
No ratings yet
Regression
44 pages
Lecture04. Training Models (Regression in Chapter 4)
No ratings yet
Lecture04. Training Models (Regression in Chapter 4)
44 pages
05 Class RegressionCorrelation
No ratings yet
05 Class RegressionCorrelation
57 pages
Algebra 2: Section 2.8: Get Your Remote and Your Graphing Calculator
No ratings yet
Algebra 2: Section 2.8: Get Your Remote and Your Graphing Calculator
8 pages
M02 Linear Regression Methods
No ratings yet
M02 Linear Regression Methods
40 pages
2 - Utility Maximization and Choice - 1
No ratings yet
2 - Utility Maximization and Choice - 1
69 pages
OR Report
No ratings yet
OR Report
49 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
Linear Regression
No ratings yet
Linear Regression
26 pages
L3_CSE256_FA24_FFN
No ratings yet
L3_CSE256_FA24_FFN
64 pages
Introduction To Matlab Tutorial 11
No ratings yet
Introduction To Matlab Tutorial 11
37 pages
5.linear Regression
No ratings yet
5.linear Regression
39 pages
Part A Assignment - No - 4
No ratings yet
Part A Assignment - No - 4
14 pages
Ch2_Lec3_ Linear Regression and Gradient Descent
No ratings yet
Ch2_Lec3_ Linear Regression and Gradient Descent
60 pages
Gradient Descent
No ratings yet
Gradient Descent
9 pages
Regression
No ratings yet
Regression
16 pages
Linear Programming - The Graphical Method
No ratings yet
Linear Programming - The Graphical Method
6 pages
Lect03 CSN382
No ratings yet
Lect03 CSN382
31 pages
ML Logistic Regression
No ratings yet
ML Logistic Regression
19 pages
IML-Summary
No ratings yet
IML-Summary
12 pages
NOTES 2
No ratings yet
NOTES 2
12 pages
14 - Force Method Analysis For Beams
No ratings yet
14 - Force Method Analysis For Beams
37 pages
2.1 Linear Regression
No ratings yet
2.1 Linear Regression
39 pages
Chapter4_Regression.docx
No ratings yet
Chapter4_Regression.docx
15 pages
Xu Ly Anh Lab 3
No ratings yet
Xu Ly Anh Lab 3
7 pages
QT Steps
No ratings yet
QT Steps
19 pages
Sample Linear Programming
No ratings yet
Sample Linear Programming
37 pages
HST 1201 - WLP 1 - Introduction To Limits
100% (1)
HST 1201 - WLP 1 - Introduction To Limits
18 pages
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
Ncert Exemplar Math Class 09 Chapter 02 Polynomials
No ratings yet
Ncert Exemplar Math Class 09 Chapter 02 Polynomials
31 pages
H - M 2ND Year Math
No ratings yet
H - M 2ND Year Math
1 page
Ijerph 1285584 Supplementary
No ratings yet
Ijerph 1285584 Supplementary
2 pages
2012 Gauss
No ratings yet
2012 Gauss
54 pages
Cubic Spline Interpolation: MATH 375, Numerical Analysis
No ratings yet
Cubic Spline Interpolation: MATH 375, Numerical Analysis
65 pages
Session 7
No ratings yet
Session 7
54 pages
Anderson-Björck For Linear Sequences : by Richard F. King
No ratings yet
Anderson-Björck For Linear Sequences : by Richard F. King
6 pages
Question Bank Unit I Part A: Ae306-Digital Signal Processing AEI
No ratings yet
Question Bank Unit I Part A: Ae306-Digital Signal Processing AEI
4 pages
Larson ELA 8e 09 02 Final
No ratings yet
Larson ELA 8e 09 02 Final
19 pages
Multiple Choice (8 X 1 PT)
No ratings yet
Multiple Choice (8 X 1 PT)
5 pages
Econometrics All R Codes Final
No ratings yet
Econometrics All R Codes Final
12 pages
Polynomial
No ratings yet
Polynomial
8 pages
Sample 7580
0% (1)
Sample 7580
11 pages
Nz6S73TkKj1-j6-9 - zINMuJvN2BBTtj0u-EPSM - Unit 7 - End of Unit Quiz MR
No ratings yet
Nz6S73TkKj1-j6-9 - zINMuJvN2BBTtj0u-EPSM - Unit 7 - End of Unit Quiz MR
3 pages
5.3 Processing 2 Jobs K Machines
No ratings yet
5.3 Processing 2 Jobs K Machines
12 pages
Analysis of Euler-Bernoulli Beams With Arbitrary Boundary Conditions On Winkler Foundation Using A B-Spline Collocation Method
No ratings yet
Analysis of Euler-Bernoulli Beams With Arbitrary Boundary Conditions On Winkler Foundation Using A B-Spline Collocation Method
23 pages
Sheldon Axler
No ratings yet
Sheldon Axler
24 pages
Math011 11 4
No ratings yet
Math011 11 4
30 pages
Quadrature Rules For Numerical Integration Over Triangles and Tetrahedra
No ratings yet
Quadrature Rules For Numerical Integration Over Triangles and Tetrahedra
3 pages
MTH603 Final Term by JUNAID
No ratings yet
MTH603 Final Term by JUNAID
41 pages
Konsensya
No ratings yet
Konsensya
3 pages
X X X X X X: Example 3. Given The Equations
No ratings yet
X X X X X X: Example 3. Given The Equations
5 pages
Problem Statement: Solving Travelling Salesman Problem Using Reedy Algorithm & Rute Force Algorithm
No ratings yet
Problem Statement: Solving Travelling Salesman Problem Using Reedy Algorithm & Rute Force Algorithm
8 pages
4 - Integration by Partial Fractions
No ratings yet
4 - Integration by Partial Fractions
19 pages
Linear Programming Quiz Questionnaire (MMW) VIADO
No ratings yet
Linear Programming Quiz Questionnaire (MMW) VIADO
6 pages
Algorithms of Scientific Computing II: 3. Algebraic Multigrid Methods
No ratings yet
Algorithms of Scientific Computing II: 3. Algebraic Multigrid Methods
51 pages
Multiple Choice. Read The Statements Carefully and Choose The Correct Answer From The Choices. Write Your Answer On The Space
No ratings yet
Multiple Choice. Read The Statements Carefully and Choose The Correct Answer From The Choices. Write Your Answer On The Space
3 pages
02 03 SampleQuiz
No ratings yet
02 03 SampleQuiz
2 pages
Optimization of Chemical Processes (Che1011)
No ratings yet
Optimization of Chemical Processes (Che1011)
20 pages