Logistic Regression_ Gradient Descent_ Example

The document describes a step-by-step process for updating weights and bias in a binary classification model using a dataset with four samples. It includes calculations for the forward pass, binary cross-entropy cost, gradients, and updates for weights and bias across multiple samples. After one epoch, the final updated parameters are a weight of 0.0077 and a bias of 0.0561.

Uploaded by

manchestermilf1

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Logistic Regression_ Gradient Descent_ Example

Uploaded by

manchestermilf1

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Example (1)

Assume
• We have ane data point, with feature 𝒙 = 𝟎. 𝟓𝐱
• Target label 𝒚 = 𝟏.
• Initial weights 𝒘 = 𝟎. 𝟐
• Initial bias 𝒃 = 𝟎. 𝟏.
• Learning rate 𝜶 = 𝟎. 𝟏.

---------------------------------------------------------------------------------------------------------------------
Step 1: Forward Pass
1 Calculate the linear combination =
𝑧 = 𝑤 ⋅ 𝑥 + 𝑏 = 0.2 ⋅ 0.5 + 0.1 = 0.2
2 Apply the sigmoid function 𝜎(𝑧) to get the prediction 𝑗ˆ.
1 1
𝑦ˆ = 𝜎(𝑧) = −𝑧
= ≈ 0.5498
1+𝑒 1 + 𝑒 −0.2

Step 2: Compute the Cost (Binary Cross-Entropy)

The Binary Cross-Entropy ( 𝐵𝐶 ) cost function for one data point is:
BCE = −(𝑦 ⋅ log⁡(𝑦) + (1 − 𝑦) ⋅ log⁡(1 − 𝑦ˆ))
Plugging in 𝑦 = 1 and 𝑦ˆ ≈ 0.508 :
BCE ≈ −(1 ⋅ log⁡(0.5498) + (1 − 1) ⋅ log⁡(1 − 0.5498)) ≈ −log⁡(0.5498) ≈ 0.5981

Step 3: Compute Gradients

To update the weights, we need the gradients of the BCE cost with respect to 𝑤 and 𝑏.
1 Gradient with respect to ul:
∂BCE
= (𝑦ˆ − 𝑦) − 𝑥 = (0.5498 − 1) ⋅ 0.5 = −0.2251
∂ℏ𝑤
2 Gradient with respect to 𝑘 :
∂BCE
= 𝑦ˆ − 𝑦 = 0.5498 − 1 = −0.4502
∂𝑏

Step 4: Update Weights and Bias

Using the learning rate 𝛼 = 0.1, we update 𝑤 and 𝑏 as follows:
1 Update u:
∂BCE
𝑤 =𝑤−𝛼− = 0.2 − 0.1 ⋅ (−0.2251) = 0.2 + 0.0225 = 0.2225
∂𝜗𝑤
2 Update b:
∂BCE
𝑏 =𝑏−𝛼⋅ = 0.1 − 0.1 ⋅ (−0.4502) = 0.1 + 0.0450 = 0.1450
∂𝑏
Summary of Updated Parameters
After one iteration, the updated weights and bias are:
• 𝑢 = 0.2225
• 𝑏 = 0.1450
Example (2)

The dataset with four samples:

Sample 𝑥 𝑦
1 0.5 1
2 1.5 0
3 2.0 1
4 3.0 0

Initial Conditions:
• Initial weight 𝑤 = 0.2
• Initial bias 𝑏 = 0.1
• Learning rate 𝛼 = 0.1
Goal:
We'll update the weights for each sample and go through one epoch of training.
---------------------------------------------------------------------------------------------------------------------

Step 1: Forward Pass, Prediction, and Cost Calculation

For each sample, we'll calculate the prediction 𝑦ˆ and the Binary Cross-Entropy cost.
Sample 1:
1 Calculate the linear combination 𝑧 :
𝑧 = 𝑤 ⋅ 𝑥 + 𝑏 = 0.2 ⋅ 0.5 + 0.1 = 0.2
2 Apply the sigmoid function to get 𝑦ˆ :
1
𝑦ˆ = 𝜎(𝑧) = ≈ 0.5498
1 + 𝑒 −0.2
3 Compute the BCE Cost:
BCE = −(𝑦 ⋅ log⁡(𝑦ˆ) + (1 − 𝑦) ⋅ log⁡(1 − 𝑦ˆ))
With 𝑦 = 1 and 𝑦ˆ ≈ 0.5498 :
BCE ≈ −log⁡(0.5498) ≈ 0.5981
Sample 2:
1 Calculate 𝑧 :
𝑧 = 𝑤 ⋅ 𝑥 + 𝑏 = 0.2 ⋅ 1.5 + 0.1 = 0.4
2 Apply the sigmoid to get 𝑦ˆ :
1
𝑦ˆ = 𝜎(𝑧) = ≈ 0.5987
1 + 𝑒 −0.4
3 Compute the BCE Cost: With 𝑦 = 0 and 𝑦ˆ ≈ 0.5987 :
BCE ≈ −log⁡(1 − 0.5987) ≈ 0.9130

Sample 3:
1 Calculate 𝑧 :
𝑧 = 𝑤 ⋅ 𝑥 + 𝑏 = 0.2 ⋅ 2.0 + 0.1 = 0.5
2 Apply the sigmoid to get 𝑦ˆ :
1
𝑦ˆ = 𝜎(𝑧) = ≈ 0.6225
1 + 𝑒 −0.5
3 Compute the BCE Cost: With 𝑦 = 1 and 𝑦ˆ ≈ 0.6225 :
BCE ≈ −log⁡(0.6225) ≈ 0.4741
Sample 4:
1 Calculate 𝑧 :
𝑧 = 𝑤 ⋅ 𝑥 + 𝑏 = 0.2 ⋅ 3.0 + 0.1 = 0.7
2 Apply the sigmoid to get 𝑦ˆ :
1
𝑦ˆ = 𝜎(𝑧) = ≈ 0.6682
1 + 𝑒 −0.7
3 Compute the BCE Cost: With 𝑦 = 0 and 𝑦ˆ ≈ 0.6682 :
BCE ≈ −log⁡(1 − 0.6682) ≈ 1.1015

Step 2: Compute Gradients for Each Sample

Now we'll compute the gradients of the BCE cost with respect to 𝑤 and 𝑏 for each sample.
Sample 1:
1 Gradient with respect to 𝑤 :
∂BCE
= (𝑦ˆ − 𝑦) ⋅ 𝑥 = (0.5498 − 1) ⋅ 0.5 = −0.2251
∂𝑤
2 Gradient with respect to 𝑏 :
∂BCE
= 𝑦ˆ − 𝑦 = 0.5498 − 1 = −0.4502
∂𝑏
Sample 2:
1 Gradient with respect to 𝑤 :
∂BCE
= (𝑦ˆ − 𝑦) ⋅ 𝑥 = (0.5987 − 0) ⋅ 1.5 = 0.8980
∂𝑤
2 Gradient with respect to 𝑏 :
∂BCE
= 𝑦ˆ − 𝑦 = 0.5987 − 0 = 0.5987
∂𝑏
Sample 3:
1 Gradient with respect to 𝑤 :
∂BCE
= (𝑦ˆ − 𝑦) ⋅ 𝑥 = (0.6225 − 1) ⋅ 2.0 = −0.755
∂𝑤
2 Gradient with respect to 𝑏 :
∂BCE
= 𝑦ˆ − 𝑦 = 0.6225 − 1 = −0.3775
∂𝑏
Sample 4:
1 Gradient with respect to 𝑤 :
∂BCE
= (𝑦ˆ − 𝑦) ⋅ 𝑥 = (0.6682 − 0) ⋅ 3.0 = 2.0046
∂𝑤
2 Gradient with respect to 𝑏 :
∂BCE
= 𝑦ˆ − 𝑦 = 0.6682 − 0 = 0.6682
∂𝑏

Step 3: Update Weights and Bias

Using the gradients and learning rate, we update 𝑤 and 𝑏 for each sample.
After Sample 1 Update:
1 Update 𝑤 :
∂BCE
𝑤 =𝑤−𝛼⋅ = 0.2 − 0.1 ⋅ (−0.2251) = 0.2 + 0.0225 = 0.2225
∂𝑤
2 Update 𝑏 :
∂BCE
𝑏 =𝑏−𝛼⋅ = 0.1 − 0.1 ⋅ (−0.4502) = 0.1 + 0.0450 = 0.1450
∂𝑏
After Sample 2 Update:
1 Update 𝑤 :
𝑤 = 0.2225 − 0.1 ⋅ 0.8980 = 0.2225 − 0.0898 = 0.1327
2 Update 𝑏 :
𝑏 = 0.1450 − 0.1 ⋅ 0.5987 = 0.1450 − 0.0599 = 0.0851

After Sample 3 Update:

1 Update 𝑤 :
𝑤 = 0.1327 − 0.1 ⋅ (−0.755) = 0.1327 + 0.0755 = 0.2082
2 Update 𝑏 :
𝑏 = 0.0851 − 0.1 ⋅ (−0.3775) = 0.0851 + 0.03775 = 0.1229
After Sample 4 Update:
1 Update 𝑤 :
𝑤 = 0.2082 − 0.1 ⋅ 2.0046 = 0.2082 − 0.2005 = 0.0077
2 Update 𝑏 :
𝑏 = 0.1229 − 0.1 ⋅ 0.6682 = 0.1229 − 0.0668 = 0.0561
Summary of Updated Parameters
After one epoch, the updated weights and bias are:
• 𝑤 = 0.0077
• 𝑏 = 0.0561

Chap 3 Two Variable Regression Model The Problem of Estimation
No ratings yet
Chap 3 Two Variable Regression Model The Problem of Estimation
35 pages
Chapter 6 - Advanced Machine Learning PDF
No ratings yet
Chapter 6 - Advanced Machine Learning PDF
37 pages
Experiment N1
No ratings yet
Experiment N1
7 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Curs4site PDF
No ratings yet
Curs4site PDF
44 pages
Unit-III Advanced Machine Learning
No ratings yet
Unit-III Advanced Machine Learning
8 pages
Experiment No
No ratings yet
Experiment No
29 pages
Exp2.2 - Jupyter Notebook
No ratings yet
Exp2.2 - Jupyter Notebook
3 pages
Gradient descent
No ratings yet
Gradient descent
16 pages
Module 3.Docxaiml
No ratings yet
Module 3.Docxaiml
20 pages
Backpropagation: Loading Data
No ratings yet
Backpropagation: Loading Data
12 pages
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
No ratings yet
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
43 pages
Curs3site PDF
No ratings yet
Curs3site PDF
38 pages
Vertopal.com C1 W1 Lab04 Gradient Descent Soln
No ratings yet
Vertopal.com C1 W1 Lab04 Gradient Descent Soln
11 pages
Take It Easy: Created Status Last Read
No ratings yet
Take It Easy: Created Status Last Read
55 pages
Vertopal.com C1 W2 Lab03 Feature Scaling and Learning Rate Soln
No ratings yet
Vertopal.com C1 W2 Lab03 Feature Scaling and Learning Rate Soln
10 pages
Index: Name - JINESH PRAJAPAT Class - B. Tech, III Year Branch - AI & DS Sem - V
No ratings yet
Index: Name - JINESH PRAJAPAT Class - B. Tech, III Year Branch - AI & DS Sem - V
35 pages
Machine Learning Notes Cs229 1
No ratings yet
Machine Learning Notes Cs229 1
217 pages
Week 7 - Lab
No ratings yet
Week 7 - Lab
6 pages
09_EnsembleLearning
No ratings yet
09_EnsembleLearning
36 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
Neural Network Code
No ratings yet
Neural Network Code
5 pages
L3_CSE256_FA24_FFN
No ratings yet
L3_CSE256_FA24_FFN
64 pages
Ai Last 5
No ratings yet
Ai Last 5
4 pages
Advanced Machine Learning: Module-1
No ratings yet
Advanced Machine Learning: Module-1
164 pages
7 - Feedforward and Backpropagation
No ratings yet
7 - Feedforward and Backpropagation
55 pages
ANN_PPT
No ratings yet
ANN_PPT
48 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
Week 2
No ratings yet
Week 2
17 pages
Stanford ML CS229-Merged Notes
No ratings yet
Stanford ML CS229-Merged Notes
126 pages
Machine Learning Notes by Standard Andrew Ng
No ratings yet
Machine Learning Notes by Standard Andrew Ng
142 pages
Linearna Regresija - NG
No ratings yet
Linearna Regresija - NG
7 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
293 pages
Machine Learning Notes AndrewNg
No ratings yet
Machine Learning Notes AndrewNg
141 pages
1.4+Computing+Gradient+Using+Backpropagation
No ratings yet
1.4+Computing+Gradient+Using+Backpropagation
5 pages
Unit 2
No ratings yet
Unit 2
36 pages
Lab-5 Report
No ratings yet
Lab-5 Report
11 pages
FALLSEM2024-25 BCSE401L TH VL2024250102084 2024-09-03 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE401L TH VL2024250102084 2024-09-03 Reference-Material-I
16 pages
Feedforward Propagation: 1.1 Visualizing The Data
No ratings yet
Feedforward Propagation: 1.1 Visualizing The Data
11 pages
CSCE 5063-001: Assignment 2: 1 Implementation of SVM Via Gradient Descent
No ratings yet
CSCE 5063-001: Assignment 2: 1 Implementation of SVM Via Gradient Descent
5 pages
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
No ratings yet
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
9 pages
Back in NN
No ratings yet
Back in NN
12 pages
ML Assignment-9
No ratings yet
ML Assignment-9
4 pages
21bit0706 VL2024250106861 Da
No ratings yet
21bit0706 VL2024250106861 Da
7 pages
Backpropagation Math
No ratings yet
Backpropagation Math
6 pages
Module-1 Backpropagation Process in Deep Neural Network
No ratings yet
Module-1 Backpropagation Process in Deep Neural Network
5 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
DeepLearning Practice Question Answers
No ratings yet
DeepLearning Practice Question Answers
43 pages
Back Propagation
No ratings yet
Back Propagation
2 pages
cs229 Notes1 PDF
No ratings yet
cs229 Notes1 PDF
28 pages
Ex4 Tutorial - Forward and Back-Propagation
No ratings yet
Ex4 Tutorial - Forward and Back-Propagation
20 pages
Deep learning
No ratings yet
Deep learning
15 pages
What Is Machine Learning by Coursera
No ratings yet
What Is Machine Learning by Coursera
47 pages
Slide 2-f2
No ratings yet
Slide 2-f2
52 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
15 pages
Deep Learning Lab Manual-36-41
No ratings yet
Deep Learning Lab Manual-36-41
6 pages
Regression
No ratings yet
Regression
30 pages
Gradient Boosting
No ratings yet
Gradient Boosting
17 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
From Everand
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
Shubhankar Paul
No ratings yet
Foundation Maths by Example
From Everand
Foundation Maths by Example
Tim Prichard
No ratings yet
Section 5 Ai303
No ratings yet
Section 5 Ai303
26 pages
Lecture_5
No ratings yet
Lecture_5
42 pages
Lecture_3
No ratings yet
Lecture_3
36 pages
Lecture_7
No ratings yet
Lecture_7
29 pages
Lecture_6
No ratings yet
Lecture_6
25 pages
Midterm Quiz 1 - Attempt Review 1
100% (1)
Midterm Quiz 1 - Attempt Review 1
4 pages
Ai (50Q) MCQ
No ratings yet
Ai (50Q) MCQ
11 pages
Chapter 3:game Theory: 3.1optimal Decision in Games
No ratings yet
Chapter 3:game Theory: 3.1optimal Decision in Games
17 pages
Math 013 PHW1
No ratings yet
Math 013 PHW1
2 pages
MEC318 Numerical Methods in Engineering 17093::sachin Kansal 2.0 1.0 0.0 3.0 Courses With Numerical and Conceptual Focus
No ratings yet
MEC318 Numerical Methods in Engineering 17093::sachin Kansal 2.0 1.0 0.0 3.0 Courses With Numerical and Conceptual Focus
7 pages
Midt1 13wi
No ratings yet
Midt1 13wi
5 pages
GS 4353 Numerical Analysis Week 1
No ratings yet
GS 4353 Numerical Analysis Week 1
39 pages
Maths Class X Chapter 02 Polynomials Practice Paper 02 1
No ratings yet
Maths Class X Chapter 02 Polynomials Practice Paper 02 1
3 pages
Dsa Syllabus
No ratings yet
Dsa Syllabus
3 pages
Factoring Polynomials
No ratings yet
Factoring Polynomials
9 pages
Cse 421 Midterm
No ratings yet
Cse 421 Midterm
5 pages
Optimization of Complex decisions - L1
No ratings yet
Optimization of Complex decisions - L1
59 pages
AI34
No ratings yet
AI34
3 pages
Final Lesson Plan DAA
No ratings yet
Final Lesson Plan DAA
13 pages
Heap Algorithm
No ratings yet
Heap Algorithm
6 pages
2022 2023 Q4 Tos Esp 10....
No ratings yet
2022 2023 Q4 Tos Esp 10....
5 pages
L18-Searching & Sorting PDF
No ratings yet
L18-Searching & Sorting PDF
16 pages
Machine Learning Math Deep Dive - Opendir - Cloud
No ratings yet
Machine Learning Math Deep Dive - Opendir - Cloud
1 page
Mock Quiz#7 With Solution
No ratings yet
Mock Quiz#7 With Solution
6 pages
04 Randomized Algorithms
No ratings yet
04 Randomized Algorithms
25 pages
Least Square Method
No ratings yet
Least Square Method
23 pages
OR-I Course Guide Book 2014
No ratings yet
OR-I Course Guide Book 2014
3 pages
Bisection Method
No ratings yet
Bisection Method
20 pages
Introduction
No ratings yet
Introduction
137 pages
Linear Programming Notes From Roque
No ratings yet
Linear Programming Notes From Roque
2 pages
Randomized Algorithms Dsa CP3151
No ratings yet
Randomized Algorithms Dsa CP3151
14 pages
Polynomials: 1. Objective Questions
No ratings yet
Polynomials: 1. Objective Questions
7 pages
Lecture W2ab
No ratings yet
Lecture W2ab
44 pages
Week 07b
No ratings yet
Week 07b
11 pages