0% found this document useful (0 votes)

5 views

Linear+regression+with+one+variable

Uploaded by

alaaabdo347890

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Linear+regression+with+one+variable

Uploaded by

alaaabdo347890

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Machine learning Algorithms

Linear regression with one variable :

Model representation
 Model Representation

 Cost Function

 Gradient Descent
500
Housing Prices
400

300

dependent 4
200
variable X

125
0
Independent variable

Supervised Regression:
Learning
“right answers” or “Labeled Predict continuous valued
data” given output (price)
B

Training set of Size in feet 2 ( x) Price ($ ) in 1000's (y)

housing prices 2104
1416
460
232
^
( Portland, OR )
1534 315 > m
852 178
••• •••

Notation:
m = Number of training examples
Example
x's = "input" variable / features
y's = "output" variable / "target" variable
x (1) 2104
(x,y) one training example (one raw)
y (2) 232
(x (i),y (i)) i th training example
x (4) 852
Training set the job of a learning algorithm to
output a function is usually
w denoted lowercase h and h
Learning algorithm stands for hypothesis

x h > y

the job of a hypothesis function

is taking the value of x and it
tries to output the estimated
value of y. So h is a function that
maps from x's to y's
How do we represent h ?

X
Linear Equations
Y
y = f (X) = 00 + <9lX
Change in Y
θ1= Slop (ΔY)

Change in X (ΔX)

θ0=Y-intercept
X
Types of Regression Models
Positive Linear Relationship Relationship NOT Linear
3.5
3-
2.5 - t +
2 +
1.5 | J
1
0.5 -
0 + t
5 10 15 20

Negative Linear Relationship No Relationship

S
7
6
5
4
3
2
1
O t + + t
O 2 4 6 S 1O
u

The cost function, let us figure out how to fit the best
possible straight line to our data.
Size in feet 2 ( x ) Price ($ ) in 1000's ( y)
Training Set
2104 460
1416 232
1534 315
852 178
••• ••
#

Hypothesis: he ( x ) — 9Q + 9\ X
How to choose θi’s ?
Scatter plot

• 1. Plot of All (Xi, Yi) Pairs

• 2. Suggests How Well Model Will Fit

Y
60
40
20
0 X
0 20 40 60
Thinking Challenge

How would you draw a line through the points?

How do you determine which line ‘fits best’?

Y
60
40
20
0 X
0 20 40 60

11
Thinking Challenge
How would you draw a line through the points?
How do you determine which line ‘fits best’?

Y
60
40
20
0 X
0 20 40 60
Intercept
unchanged
Thinking Challenge
How would you draw a line through the points?
How do you determine which line ‘fits best’?
Slope
unchanged

Y
60
40
20
0 X
0 20 40 60
Intercept
changed
Thinking Challenge
How would you draw a line through the points?
How do you determine which line ‘fits best’?
Slope
changed

Y
6
0
4
0 X
2 0 2 4 60
Intercept 0 0 0
changed
0
Least Squares
• 1. „Best Fit‟ Means Difference Between Actual Y
Values and Predicted Y Values is a Minimum.
• So square errors!
m m
 Yi  h(x i)   ˆi
2 2
i1 i1

15
Least Squares
• 1. „Best Fit‟ Means Difference Between Actual Y Values &
Predicted Y Values Are a Minimum. So square errors!

m m
 Yi  h(x i)   ˆi
2 2
i1 i1
• 2. LS Minimizes the Sum of the Squared Differences
(errors) (SSE)

16
Least Squares Graphically
n
LS minimizes   2
  2
1   2
2   2
3   2
4
i 1
Y Y201X2 ˆ2
^4
^2
^1 ^3
hθ(xi )  θ0  θ1 X i
X
17
Least Squared errors Linear
Regression

I
,

A
Minimiz
e

> predictions on the

X training set
the actual values
Idea : Choose #oA so that
/10 (2) is close to y for our
training examples ( x , y )
Minimiz
e
Cost function visualization
Consider a simple case of hypothesis by setting θ0=0, then h becomes : hθ(x)=θ1x
Each value of θ1 corresponds to a different hypothesis as it is the slope of the line
which corresponds to different lines passing through the origin as shown in plots below as y-intercept
i.e. θ0 is nulled out.

At θ1=2,

At θ1=1,

At θ1=0.5, J(0.5) ( 0.52 + l 2 + 1.52 ) = 0.58

Simple
Simple Hypothesis
Cost function visualization

At θ1=2,

At θ1=1,

At θ1=0.5, J(0.5) „ -- ( 0.52 + l 2 + 1.52 ) = 0.58

2* 6

On plotting points like this further, one gets

the following graph for the cost function which
is dependent on parameter θ1.

plot each value of θ1 corresponds to a

different hypothesizes -2 4 9\ 6
Cost function visualization

What is the optimal value of θ1 that minimizes

J(θ1) ?

It is clear that best value for θ1 =1 as J(θ1 ) = 0,

which is the minimum.

How to find the best value for θ1 ?

Plotting ?? Not practical specially in high

dimensions?
The solution :

1. Analytical solution: not applicable for large -2 4 9\ 6

datasets
2. Numerical solution: ex: Gradient descent .
Hypothesis:
h 0 ( x ) = #o H-

Parameters:

Cost Function:
m
(i ) 2
J ( 0o , 0 i ) = 2 hl 2Z (h o { x ) - y (i )
)

Goal: minimize J ( O n . 0 \ )
0o , 0 \
Gradient Descent
Iterative solution not only in linear regression. It's
actually used all over the place in machine learning.

 Objective: minimize any function ( Cost Function J)

Have some function J ( QQ , Q\ )

Want min J( #oA )
0O,0I

Outline:
• Start with some %

• Keep changing 0o
^ i -
to reduce / ( $o ? $i)

until we hopefully end up at a minimum

Imagine that this is a landscape of grassy park, and you want to go
to the lowest point in the park as rapidly as possible

Red: means
Starting point
high blue:
means low

J()


local
minimum 
New Starting
point

Red: means
high blue:
means low

J()

New local
minimum



With different starting
point
Gradient descent Algorithm
d
repeat until convergence
• Where
o
^
:= is the assignment operator
:= 9 j - a
893;

o ois the learning rate which basically defines how big the steps are during
the descent

o o «J( 0 0 ) s the partial derivative term

OGj o» i
'
o j = 0, 1 represents the feature index number

Also the parameters should be updated simulatenously , i.e. ,

tempo ’.= OQ — ex. %r J( eo , 0 )

dOo|
i
2 00

1 75

1.50

125

tempi Oi — at % J ( Bo, Oi )
dGx
- Z 1.00

0 75
0 50

9Q := tempo 0.25

-0.25 x
-050
6\ := tempi -0 50-0 75
-0.75
-1 005-1.00
J(θ1)

d
1 1  j(1)
+ slop
d1

θ1 θ1= θ1- (+ve)

J(θ1)

- slop

θ1= θ1- (-ve)

θ1
Gradient descent algorith Linear Regression Model

repeat until convergence

d
Mra3rwi)
]
I

( for j = 1 and j = (!)

/
d m

d j
j(0,1) 
d 1
 h (xi ) Yi 2
d j 2m i1
d
 0 1 (xi ) Yi 
1 m
d
j(0,1)  
2
d j d j 2m i1
m
j(0,1)  h(xi )  Yi 
d 1
j  0:
d0 m i1
m
j(0,1)  h(xi )  Yi  xi
d 1
j 1:
d1 m i1
Gradient descent algorithm

repeat until convergence {

‘
m
1 ( i ) ) - (i)
#0 0() am E ( M* 2/ )
m
01 = 01
•
1
a 111 E ( M* (i ) ) - (i )
2/ ) xw•

}
"Batch" Gradient Descent

"Batch": Each step of gradient descent

uses all the training examples.
repeat until convergence {
m
9o : Oo
‘
1
am w {i )
£ (M* ) y )
-

m
9\ 91 i
am E { ho { x {l)
)- y ) - x
{i)
^
}
Example after implement some iterations
using gradient descent
Iteration 1

0.6

0.5 -

0.4 -

0.3 -

0.2 -

0.1
-2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0
x
Iteration 2

0.6

0.5 -

0.4 -

0.3 -

0.2 -

0.1
-2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0
x
Iteration 3

0.6

0.5 -

0.4 -

0.3 -

0.2 -

0.1
-2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0
x
Iteration 4

0.6

0.5 -

0.4 -

0.3 -

0.2 -

0.1
-2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0
x
Iteration 5

0.6

0.5 -

0.4 -

0.3 -

0.2 -

0.1
-2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0
x
Iteration 6

0.6

0.5 -

0.4 -

0.3 -

0.2 -

0.1
-2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0
x
Iteration 7

0.6

0.5 -

0.4 -

0.3 -

0.2 -

0.1
-2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0
x
Iteration 8

0.6

0.5 -

0.4 -

0.3 -

0.2 -

0.1
-2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0
x
Iteration 9
0.6

0.5 -

0.4 -

0.3 -

0.2 -

0.1
-2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0
x
Iteration 10
0.6

0.5 -

0.4 -

0.3 -

0.2 -

0.1
-2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0
x
Iteration 11
0.6

0.5 -

0.4 -

0.3 -

0.2 -

0.1
-2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0
x
Thanks

Lec2 Linear Regression With One Variable
No ratings yet
Lec2 Linear Regression With One Variable
48 pages
Lecture 2-Linear-Regression-Part1
No ratings yet
Lecture 2-Linear-Regression-Part1
80 pages
Regression Analysis
No ratings yet
Regression Analysis
54 pages
L3 Linear Regression and Gradient Descent
No ratings yet
L3 Linear Regression and Gradient Descent
46 pages
Linear Regression
No ratings yet
Linear Regression
63 pages
Linear Regression
100% (1)
Linear Regression
51 pages
Slide 3 - Linear Regression One Variable
No ratings yet
Slide 3 - Linear Regression One Variable
60 pages
ML 02 Linear Regression
No ratings yet
ML 02 Linear Regression
51 pages
Linear Regression With One Variable
No ratings yet
Linear Regression With One Variable
12 pages
(MLP) Lecture Notes
No ratings yet
(MLP) Lecture Notes
22 pages
Lecture 2.1 Linear Regression
No ratings yet
Lecture 2.1 Linear Regression
36 pages
04 LinearRegression
No ratings yet
04 LinearRegression
61 pages
01B-DL2023-LinearModels
No ratings yet
01B-DL2023-LinearModels
47 pages
CSE_412__Lab_Manual_3___Linear_Regression
No ratings yet
CSE_412__Lab_Manual_3___Linear_Regression
10 pages
Linear Regression
No ratings yet
Linear Regression
75 pages
2. Linear_ Regression_SGD
No ratings yet
2. Linear_ Regression_SGD
71 pages
CS229
No ratings yet
CS229
69 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
30 pages
cs229 2
No ratings yet
cs229 2
275 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
30 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
7 pages
[ML&PR 2025] Lec2 Regression II
No ratings yet
[ML&PR 2025] Lec2 Regression II
41 pages
Regression
No ratings yet
Regression
30 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
Least Square Vs Gradient Descent
No ratings yet
Least Square Vs Gradient Descent
52 pages
Week 04
No ratings yet
Week 04
101 pages
Linear Regression: Level:4 Department: IT, Security
No ratings yet
Linear Regression: Level:4 Department: IT, Security
35 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
ML02
No ratings yet
ML02
25 pages
cs229 Notes1 PDF
No ratings yet
cs229 Notes1 PDF
28 pages
Gdesc LMS
No ratings yet
Gdesc LMS
7 pages
Lecture 3 Ai
No ratings yet
Lecture 3 Ai
48 pages
Linear-Regression
No ratings yet
Linear-Regression
55 pages
04 LinearRegression PDF
No ratings yet
04 LinearRegression PDF
61 pages
Lec 07-08 - Final
No ratings yet
Lec 07-08 - Final
32 pages
Module3_Ch1
No ratings yet
Module3_Ch1
83 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
15 pages
Updating_Weight
No ratings yet
Updating_Weight
9 pages
Computing For Data Sciences: Introduction To Regression Analysis
No ratings yet
Computing For Data Sciences: Introduction To Regression Analysis
9 pages
Machine Learning Notes by Standard Andrew Ng
No ratings yet
Machine Learning Notes by Standard Andrew Ng
142 pages
Stanford ML CS229-Merged Notes
No ratings yet
Stanford ML CS229-Merged Notes
126 pages
Linearna Regresija - NG
No ratings yet
Linearna Regresija - NG
7 pages
Machine Learning Notes AndrewNg
No ratings yet
Machine Learning Notes AndrewNg
141 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
293 pages
Gradient Descent - Linear Regression
100% (1)
Gradient Descent - Linear Regression
47 pages
Linear Regression Python Programming
No ratings yet
Linear Regression Python Programming
25 pages
ML: Introduction 1. What Is Machine Learning?
No ratings yet
ML: Introduction 1. What Is Machine Learning?
38 pages
[PR 2024] Lec2 Regression II
No ratings yet
[PR 2024] Lec2 Regression II
41 pages
Cost Function
No ratings yet
Cost Function
17 pages
lec6_7_Linear_regression
No ratings yet
lec6_7_Linear_regression
38 pages
GradientDescent-Regression_slides
No ratings yet
GradientDescent-Regression_slides
26 pages
Prs Lab1 Merged
No ratings yet
Prs Lab1 Merged
215 pages
L. D. College of Engineering: Lab Manual For
No ratings yet
L. D. College of Engineering: Lab Manual For
70 pages
Unit 4 - Linear Regression
No ratings yet
Unit 4 - Linear Regression
52 pages
SAT Math: Master the Skills in 40 Pages
From Everand
SAT Math: Master the Skills in 40 Pages
Jennifer L Johnson
No ratings yet
Transformation of Axes (Geometry) Mathematics Question Bank
From Everand
Transformation of Axes (Geometry) Mathematics Question Bank
Mohmmad Khaja Shareef
3/5 (1)
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Activity Sheets in Precalculus Quarter 2, Week 1& 2
No ratings yet
Activity Sheets in Precalculus Quarter 2, Week 1& 2
7 pages
Grade 7 14 Cellstissues and Organs 231024 092354
No ratings yet
Grade 7 14 Cellstissues and Organs 231024 092354
4 pages
Corrugated Web Beam
No ratings yet
Corrugated Web Beam
4 pages
0 Finite Difference
No ratings yet
0 Finite Difference
18 pages
Grade 5 Practice Paper - B
No ratings yet
Grade 5 Practice Paper - B
7 pages
? ???????? ?????
No ratings yet
? ???????? ?????
14 pages
Reviewer in Physics II
No ratings yet
Reviewer in Physics II
4 pages
(Ebook) Quantitative Methods in Derivatives Pricing: An Introduction to Computational Finance by Domingo Tavella ISBN 9780471394471, 0471394475 - The latest updated ebook is now available for download
100% (1)
(Ebook) Quantitative Methods in Derivatives Pricing: An Introduction to Computational Finance by Domingo Tavella ISBN 9780471394471, 0471394475 - The latest updated ebook is now available for download
51 pages
Certificate in Fire
100% (1)
Certificate in Fire
11 pages
小學三年級 Primary 3: 120 120 試題 Question Paper
0% (1)
小學三年級 Primary 3: 120 120 試題 Question Paper
10 pages
Physics 1 Week 11 20 Quiz
No ratings yet
Physics 1 Week 11 20 Quiz
16 pages
Physical Chemistry Exam
No ratings yet
Physical Chemistry Exam
5 pages
Maths Notes
No ratings yet
Maths Notes
38 pages
Physics Sliptest Spectros
No ratings yet
Physics Sliptest Spectros
9 pages
Permutations-And-Combinations
No ratings yet
Permutations-And-Combinations
23 pages
XG X Series Customizable Vision System Catalogue
No ratings yet
XG X Series Customizable Vision System Catalogue
54 pages
Air Compressor Lab
No ratings yet
Air Compressor Lab
4 pages
Masterpact Maintenance Procedure
100% (1)
Masterpact Maintenance Procedure
94 pages
Summer Training Embedded System
No ratings yet
Summer Training Embedded System
18 pages
The Nature of Mathematics: Natural Patterns
No ratings yet
The Nature of Mathematics: Natural Patterns
12 pages
Data Warehousing and Data Mining: Downloaded From
No ratings yet
Data Warehousing and Data Mining: Downloaded From
94 pages
Question Bank
No ratings yet
Question Bank
5 pages
Annexure - A1
No ratings yet
Annexure - A1
1 page
Inside Caliper
No ratings yet
Inside Caliper
13 pages
Protocolo Field Test Claro
No ratings yet
Protocolo Field Test Claro
106 pages
Branching Processes Variation Growth and Extinction of Populations First Edition Patsy Haccou - Explore the complete ebook content with the fastest download
100% (1)
Branching Processes Variation Growth and Extinction of Populations First Edition Patsy Haccou - Explore the complete ebook content with the fastest download
51 pages
Final Manpower Planning
No ratings yet
Final Manpower Planning
14 pages
1-Ārambha - 6-Nak Atra - 1-Nak Atra Moon Signs
No ratings yet
1-Ārambha - 6-Nak Atra - 1-Nak Atra Moon Signs
19 pages
Immuno Report
No ratings yet
Immuno Report
14 pages
Representation of Relief On Topo Maps
No ratings yet
Representation of Relief On Topo Maps
33 pages

Linear+regression+with+one+variable

Uploaded by

Linear+regression+with+one+variable

Uploaded by

Machine learning Algorithms

Linear regression with one variable :

Training set of Size in feet 2 ( x) Price ($ ) in 1000's (y)

the job of a hypothesis function

Negative Linear Relationship No Relationship

• 1. Plot of All (Xi, Yi) Pairs

How would you draw a line through the points?

> predictions on the

At θ1=0.5, J(0.5) ( 0.52 + l 2 + 1.52 ) = 0.58

At θ1=0.5, J(0.5) „ -- ( 0.52 + l 2 + 1.52 ) = 0.58

On plotting points like this further, one gets

plot each value of θ1 corresponds to a

What is the optimal value of θ1 that minimizes

It is clear that best value for θ1 =1 as J(θ1 ) = 0,

How to find the best value for θ1 ?

Plotting ?? Not practical specially in high

1. Analytical solution: not applicable for large -2 4 9\ 6

 Objective: minimize any function ( Cost Function J)

Have some function J ( QQ , Q\ )

until we hopefully end up at a minimum

o o «J( 0 0 ) s the partial derivative term

Also the parameters should be updated simulatenously , i.e. ,

tempo ’.= OQ — ex. %r J( eo , 0 )

θ1 θ1= θ1- (+ve)

θ1= θ1- (-ve)

repeat until convergence

( for j = 1 and j = (!)

repeat until convergence {

"Batch": Each step of gradient descent

You might also like