0% found this document useful (0 votes)

40 views

02 Simple Regression

The document discusses simple linear regression. It defines simple regression as having one independent variable. It describes how ordinary least squares (OLS) works to find the line of best fit by minimizing the sum of the squared residuals. The OLS estimators for the slope and intercept are proven to be best linear unbiased estimators (BLUE) under the assumptions of homoscedasticity, no autocorrelation, zero mean errors, and independence of x and the error term.

Uploaded by

Daveli Natanael

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views

02 Simple Regression

Uploaded by

Daveli Natanael

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Simple Regression

Topics

 Def. Simple Regression

 How does OLS work?
 Properties of OLS
 Expected values and variance of the OLS
estimators
 BLUE estimators
Regression

 Regression is concerned with describing and evaluating the

relationship between a given variable (usually called the
Dependent Variable) and one or more other variables (usually
known as the Independent Variable(s)).

 Simple regression
– For simplicity, say k=1. This is the situation where y depends on
only one dependent variable (x).
Terminology for Simple Regression
Simple Regression: An Example

 Suppose that we have the following data on the excess returns on a

fund manager’s portfolio (“fund XXX”) together with the excess
returns on a market index:

 We have some intuition that the beta on this fund is positive, and we
therefore want to find whether there appears to be a relationship
between x and y given the data that we have. The first stage would
be to form a scatter plot of the two variables.
Graph (Scatter Diagram)

45
Excess return on fund XXX

4045
Excess return on fund XXX

3540
3035
2530
2025
20
15
15
10
10
5
5
0
0
00 55 10
10 1515 20 20 25 25
Excess returnon
Excess return onmarket
marketportfolio
portfolio
Finding a Line of Best Fit

 We can use the general equation for a straight line,

y = a + bx
to get the line that best “fits” the data.

 Is this realistic? No. So what we do is to add a random

disturbance term, u into the equation.
y t =  + x t + u t
where t = 1,2,3,4,5
Actual and Fitted Value

Choose intercept and slope so that the (vertical) distances from the
data points to the fitted lines are minimised (so that the line fits the data
as closely as possible):
y

yi y

ût
û i
yt denote the actual data point t
ŷt
ŷi
ŷt denote the fitted value from
the regression line
ût denote the residual, yt - ŷt

xi x
Finding a Line of Best Fit
ordinary least square (OLS)

 Choose  and  so that the (vertical) distances from

the data points to the fitted lines are minimised (so
that the line fits the data as closely as possible):
yt =  + xt + ut

 What we actually do is take each distance and

square it and minimise the total sum of the squares
(hence least squares).
How Does OLS Work?

5
    5, or minimise  t .
ˆ
 So min. 1
u 2
ˆ
u 2
2
ˆ
u 2
3
ˆ
u 2
4
ˆ
u 2
ˆ
u 2
This is
t 1
known as the residual sum of squares.
y t =  + x t + u t

 But what was ût ? It was the difference between the

actual point and the line, yt - ŷt .


 So minimising  t t
y  ˆ
y 2
is equivalent to minimising
 t with respect to
ˆ
u 2
$ and $ .
Deriving the OLS Estimator

ˆ t  ˆ  ˆxt , so let L 
 But y  t t
( y  ˆ
y ) 2
  t
( y  ˆ  ˆ
 x ) 2
t
t i

 Want to minimise L with respect to (w.r.t.) $ and $ , so differentiate L

w.r.t. $ and $
L
ˆ

 2 ( yt  ˆ  ˆxt )  0 (1)
t
L
 2 xt ( yt  ˆ  ˆxt )  0 (2)
ˆ t

 From (1),  ( y t  ˆ  ˆxt )  0  y t  Tˆ  ˆ  xt  0

 But  y t  Ty and  xt  Tx .
Deriving the OLS Estimator (cont’d)

 So we can write Ty  Tˆ  Tˆx  0 or y  ˆ  ˆx  0 (3)

 From (2),  xt ( yt  ˆ  ˆxt )  0 (4)

 From (3), ˆ  y  ˆx (5)

 Substitute into (4) for $ from (5),

 xt ( yt  y  ˆx  ˆxt )  0
t

 t t  t
x y  y x  ˆ
x  t
x  ˆ
  t 0
x
2

 xt yt  Tyx  ˆTx 2  ˆ  xt  0
2

t
Deriving the OLS Estimator (cont’d)

 Rearranging for $ ,

ˆ (Tx 2   xt2 )  Tyx   xt yt

 So overall we have

ˆ   x y  Tx y
t t
and ˆ  y  ˆx
 x  Tx2
t
2

 This method of finding the optimum is known as ordinary least

squares.
What do We Use and For?

 In the CAPM example used above, plugging the 5 observations in to

make up the formulae given above would lead to the estimates
$ = -1.74 and $ = 1.64. We would write the fitted line as:
yˆ t  1.74  1.64 x t
 Question: If an analyst tells you that she expects the market to yield a
return 20% higher than the risk-free rate next year, what would you
expect the return on fund XXX to be (prediction)?

 Solution: We can say that the expected value of y = “-1.74 + 1.64 *

value of x”, so plug x = 20 into the equation to get the expected value
for y:
yˆ i  1.74  1.64 20  31.06
The Population and the Sample

 The population is the total collection of all objects or people

to be studied, for example,

 Interested in Population of interest

predicting outcome the entire electorate
of an election

 A sample is a selection of just some items from the

population.

 A random sample is a sample in which each individual item

in the population is equally likely to be drawn.
The PRF and the SRF

 The population regression function (PRF) is a description of

the model that is thought to be generating the actual data and
the true relationship between the variables (i.e. the true values
of  and ).

 The PRF is yt  b1  b2 xt  ut

 The SRF is yˆ t  ˆ  ˆxt

and we also know that uˆt  yt  yˆt ,

 We use the SRF to infer likely values of the PRF.

 We also want to know how “good” our estimates of  and 

are.
Expected Value and Variances of the OLS Estimators

 We observe data for xt, but since yt also depends on ut, we must be
specific about how the ut are generated.
 We usually make the following set of assumptions about the ut’s (the
unobservable error terms):
 Technical Notation Interpretation
1. E(ut) = 0 The errors have zero mean
2. Var (ut) = 2 The variance of the errors is constant and finite
over all values of xt (Homoskedasticity)
3. Cov (ui,uj)=0 The errors are statistically independent of
one another (Zero Autocorrelation)
4. Cov (ut,xt)=0 No relationship between the error and
corresponding x variate
Homoskedasticity vs. Heteroskedasticity

Homoskedasticity Heteroskedasticity
Properties of the OLS Estimator

 If assumptions 1. through 4. hold, then the estimators $ and $

determined by OLS are known as Best Linear Unbiased
Estimators (BLUE).
What does the acronym stand for?

 “Estimator” - $ is an estimator of the true value of .

 “Linear” - $ is a linear estimator for dependent variable
 “Unbiased” - On average, the actual value of the $ and $ ’s will
be equal to the true values.
 “Best” - means that the OLS estimator has minimum variance
among the class of linear unbiased estimators. The
Gauss-Markov theorem proves that the OLS
estimator is best.
Properties of the OLS Estimator (cont’)

 An alternative assumption to 4., which is slightly stronger, is that the

xt’s are non-stochastic or fixed in repeated samples.

 A fifth assumption is required if we want to make inferences about

the population parameters (the actual  and ) from the sample
parameters ( $ and $ )

 Additional Assumption is needed, if you want to check hypothesis

5. ut is normally distributed
Linearity

 In order to use OLS, we need a model which is linear in the

parameters ( and  ). It does not necessarily have to be linear in
the variables (y and x).

 Linear in the parameters means that the parameters are not

multiplied together, divided, squared or cubed etc.

 Some models can be transformed to linear ones by a suitable

substitution or manipulation, e.g. the exponential regression model
Yt  e X t eut ln Yt     ln X t  ut

 Then let yt=ln Yt and xt=ln Xt

yt    xt  ut
Linear and Non-linear Models

 This is known as the exponential regression model. Here, the

coefficients can be interpreted as elasticities.

 Similarly, if theory suggests that y and x should be inversely related:


yt     ut
xt
then the regression can be estimated using OLS by substituting
1
zt 
xt
 But some models are intrinsically non-linear, e.g.

yt    xt  ut
Unbiasedness

The least squares estimates of $ and$ are unbiased. That is

E($)= and E($ )=

Thus on average the estimated value will be equal to the true

values. To prove this also requires the assumption that E(ut)=0.
Unbiasedness is a stronger condition than consistency.
Efficiency

An estimator $ of parameter  is said to be efficient if it is unbiased

and no other unbiased estimator has a smaller variance. If the
estimator is efficient, we are minimising the probability that it is a
long way off from the true value of .
Consistency

The least squares estimators $ and $ are consistent. That is, the
estimates will converge to their true values as the sample size
increases to infinity. Need the assumptions E(xtut)=0 and
Var(ut)=2 <  to prove this. Consistency implies that

 
lim Pr ˆ      0   0
T 
Estimator or Estimate?

 Estimators are the formulae used to calculate the

coefficients

 Estimates are the actual numerical values for the

coefficients.
Standard Errors of Parameters

 Any set of regression estimates of $ and $ are specific to the sample

used in their estimation.
 Recall that the estimators of  and  from the sample parameters ($ and 
$

ˆ   t 2 t
) are given by x y  Nx y
and ˆ  y  ˆx
 xt  Nx 2

 What we need is some measure of the reliability or precision of the

estimators($ and$). The precision of the estimate is given by its standard
error. Given assumptions 1 - 4 above, then the standard errors can be
shown to be given by
 t   t ,
2 2
x x
SE (ˆ )  
N  ( xt  x ) 2 N  xt2  N 2 x 2
1 1
SE ( ˆ )   
 ( xt  x ) 2
t
x 2
 Nx 2

where s is the estimated standard deviation of the residuals.

Estimating the Variance of the Disturbance Term

 The variance of the random variable ut is given by

Var(ut) = E[(ut)-E(ut)]2
which reduces to Var(ut) = E(ut2)
1
 We could estimate this using the average of ut :
2
s 
2

N
t
u 2

 Unfortunately this is not workable since ut is not observable. We can

use the sample counterpart to ut, which is ût : 1
2
s 
2

N
 uˆ t
But this estimator is a biased estimator of 2.

 An unbiased estimator of  is given by

s
 uˆ 2
t

N 2

where  uˆ 2
t is the residual sum of squares and N is the sample
size.
Questions?

(Joseph L. McCauley) Classical Mechanics PDF
100% (12)
(Joseph L. McCauley) Classical Mechanics PDF
488 pages
Ch3 Slides Ed4 2024
No ratings yet
Ch3 Slides Ed4 2024
72 pages
Ch3_slides_Ed4_2024_20(1)
No ratings yet
Ch3_slides_Ed4_2024_20(1)
72 pages
1-Chap II Econometrics ABC DR Mitiku
No ratings yet
1-Chap II Econometrics ABC DR Mitiku
80 pages
2 Basic Regression
No ratings yet
2 Basic Regression
69 pages
qrm2 Session1 2
No ratings yet
qrm2 Session1 2
89 pages
125.785 Module 2.1
No ratings yet
125.785 Module 2.1
94 pages
Lecture 2. Simple Linear Regression
No ratings yet
Lecture 2. Simple Linear Regression
49 pages
ECO 401 Econometrics: SI 2021 Week 2, 14 September
100% (1)
ECO 401 Econometrics: SI 2021 Week 2, 14 September
47 pages
Ordinary Least Squares
No ratings yet
Ordinary Least Squares
21 pages
Lecture 2
No ratings yet
Lecture 2
39 pages
Week 2 - The Simple Linear Regression Model PDF
No ratings yet
Week 2 - The Simple Linear Regression Model PDF
47 pages
MFIN 305_Lecture1
No ratings yet
MFIN 305_Lecture1
77 pages
cheatsheet
No ratings yet
cheatsheet
2 pages
Week 3-4
No ratings yet
Week 3-4
75 pages
Week 2
No ratings yet
Week 2
43 pages
Chapter 2
No ratings yet
Chapter 2
12 pages
Gujarati D, Porter D, 2008: Basic Econometrics 5Th Edition Summary of Chapter 3-5
No ratings yet
Gujarati D, Porter D, 2008: Basic Econometrics 5Th Edition Summary of Chapter 3-5
64 pages
Regression: Dr. Agustinus Suryantoro, M.S
No ratings yet
Regression: Dr. Agustinus Suryantoro, M.S
31 pages
Ols 23-24
No ratings yet
Ols 23-24
87 pages
The Simple Regression Model
No ratings yet
The Simple Regression Model
24 pages
ECC321 chapter2
No ratings yet
ECC321 chapter2
5 pages
Linear Regression Models
No ratings yet
Linear Regression Models
41 pages
Lecture 6
No ratings yet
Lecture 6
45 pages
Lecture 2 Simple Regression Model
100% (1)
Lecture 2 Simple Regression Model
47 pages
Lecture 2 SLR - 1
No ratings yet
Lecture 2 SLR - 1
28 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
42 pages
Week 2 - Simple Linear Regression
No ratings yet
Week 2 - Simple Linear Regression
25 pages
TCH442E Quantitative Methods For Finance
No ratings yet
TCH442E Quantitative Methods For Finance
21 pages
Regression With One Regressor
No ratings yet
Regression With One Regressor
25 pages
UnivariateRegression 2
No ratings yet
UnivariateRegression 2
72 pages
CHP 3 PDF
No ratings yet
CHP 3 PDF
31 pages
Chapter 1 Article
No ratings yet
Chapter 1 Article
9 pages
Tema I (Mínimos Cuadrados Ordinarios)
No ratings yet
Tema I (Mínimos Cuadrados Ordinarios)
49 pages
Ordinary Least Squares: Rómulo A. Chumacero
No ratings yet
Ordinary Least Squares: Rómulo A. Chumacero
50 pages
Manual ML 1
No ratings yet
Manual ML 1
8 pages
Ordinary Least Squares Linear Regression Review: Week 4
No ratings yet
Ordinary Least Squares Linear Regression Review: Week 4
10 pages
Linear Regression Models
No ratings yet
Linear Regression Models
42 pages
Basic Economterics - I
No ratings yet
Basic Economterics - I
17 pages
Multiple regression
No ratings yet
Multiple regression
14 pages
EmFi L 04
No ratings yet
EmFi L 04
17 pages
Emet2007 Notes
No ratings yet
Emet2007 Notes
6 pages
Ch.2 The Simple Regression Model
No ratings yet
Ch.2 The Simple Regression Model
6 pages
Welcome To The Course: Financial Econometrics I
No ratings yet
Welcome To The Course: Financial Econometrics I
14 pages
Econ 399 Chapter2a
No ratings yet
Econ 399 Chapter2a
40 pages
Introduction To Econometrics - Summary
No ratings yet
Introduction To Econometrics - Summary
23 pages
AG909 Quantitative Methods For Finance
No ratings yet
AG909 Quantitative Methods For Finance
7 pages
Introduction To Mathematical Modeling: Simple Linear Regression
No ratings yet
Introduction To Mathematical Modeling: Simple Linear Regression
21 pages
Chap 2
No ratings yet
Chap 2
15 pages
OLS Method
No ratings yet
OLS Method
12 pages
Chapter 2 Econometric
No ratings yet
Chapter 2 Econometric
28 pages
Econometrics: Two Variable Regression: The Problem of Estimation
No ratings yet
Econometrics: Two Variable Regression: The Problem of Estimation
28 pages
BRM - L4,5 - Linear Regression
No ratings yet
BRM - L4,5 - Linear Regression
113 pages
The Simple Regression Model: DR Jin Hongfei 1
No ratings yet
The Simple Regression Model: DR Jin Hongfei 1
41 pages
Simple Regression Model - Estimation
No ratings yet
Simple Regression Model - Estimation
9 pages
The Multiple Linear Regression Model: Version: 30-10-2023, 16:07
No ratings yet
The Multiple Linear Regression Model: Version: 30-10-2023, 16:07
17 pages
2 - Model Linear Jamak Dan OLS
No ratings yet
2 - Model Linear Jamak Dan OLS
11 pages
Lecture 3
No ratings yet
Lecture 3
27 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Long-Memory Time Series: Theory and Methods
From Everand
Long-Memory Time Series: Theory and Methods
Wilfredo Palma
No ratings yet
Laplace Transforms Essentials
From Everand
Laplace Transforms Essentials
Morteza Shafii-Mousavi
3.5/5 (3)
Preliminary Case IIBC 2022
No ratings yet
Preliminary Case IIBC 2022
11 pages
Financial Statement Analysis: K.R. Subramanyam
No ratings yet
Financial Statement Analysis: K.R. Subramanyam
50 pages
10 Garch
No ratings yet
10 Garch
1 page
Central Banking and The Conduct of Monetary Policy
No ratings yet
Central Banking and The Conduct of Monetary Policy
55 pages
Kuis Akun No 1
No ratings yet
Kuis Akun No 1
10 pages
Summary of Chapter 2
No ratings yet
Summary of Chapter 2
4 pages
Citate Link
No ratings yet
Citate Link
1 page
Quiz Ans Key
No ratings yet
Quiz Ans Key
12 pages
Kuis Akun No 1
No ratings yet
Kuis Akun No 1
10 pages
Talk Show Seminar Event Proposal
No ratings yet
Talk Show Seminar Event Proposal
8 pages
Reliability-Based Topology Optimization
No ratings yet
Reliability-Based Topology Optimization
5 pages
Maths 4
No ratings yet
Maths 4
21 pages
Error Propagation
No ratings yet
Error Propagation
8 pages
ncert-maths-class-12th-differential-equation-100-questions-with-solutions-including-mcq-0-2023-17-10-061646 (1)
No ratings yet
ncert-maths-class-12th-differential-equation-100-questions-with-solutions-including-mcq-0-2023-17-10-061646 (1)
43 pages
Chapter 1 Sta408 Sept17
No ratings yet
Chapter 1 Sta408 Sept17
31 pages
07 - Workbook Part 2 - Business Statistics
No ratings yet
07 - Workbook Part 2 - Business Statistics
158 pages
PROBLEM 6.105: Solution
No ratings yet
PROBLEM 6.105: Solution
10 pages
Cambridge IGCSE ™: Mathematics 0580/42 October/November 2022
No ratings yet
Cambridge IGCSE ™: Mathematics 0580/42 October/November 2022
14 pages
Unit I Lesson 1 Interpolation & Extrapolation: Context
No ratings yet
Unit I Lesson 1 Interpolation & Extrapolation: Context
9 pages
Myp Command Terms
No ratings yet
Myp Command Terms
2 pages
2016 IEB Grade 11 Paper 2 Nov QP
No ratings yet
2016 IEB Grade 11 Paper 2 Nov QP
15 pages
Class 12_KST STUDY POINT_Question Bank_Maths
No ratings yet
Class 12_KST STUDY POINT_Question Bank_Maths
230 pages
Lennart_Carleson
No ratings yet
Lennart_Carleson
3 pages
Calculus MCQ's
No ratings yet
Calculus MCQ's
4 pages
Maths Class Xii Sample Paper Test 01 For Board Exam 2024
No ratings yet
Maths Class Xii Sample Paper Test 01 For Board Exam 2024
5 pages
DKKDKD
No ratings yet
DKKDKD
7 pages
Scope For FINAL Exam Grade 10 2024 - TERM 4
100% (1)
Scope For FINAL Exam Grade 10 2024 - TERM 4
5 pages
Double-Exponential Expression of Lightning Current Waveforms
No ratings yet
Double-Exponential Expression of Lightning Current Waveforms
4 pages
User Curves PDF
100% (1)
User Curves PDF
19 pages
Chapter 1
No ratings yet
Chapter 1
41 pages
The Origins of Greek Mathematics
No ratings yet
The Origins of Greek Mathematics
15 pages
Spring 07 Math 510 HW Solution I
No ratings yet
Spring 07 Math 510 HW Solution I
2 pages
Week 6
No ratings yet
Week 6
15 pages
Solving Coupled System of Nonlinear PDE's Using The Natural Decomposition Method
No ratings yet
Solving Coupled System of Nonlinear PDE's Using The Natural Decomposition Method
21 pages
Fuzzy Magic Labelling of Neutrosophic Path and Star Graph
No ratings yet
Fuzzy Magic Labelling of Neutrosophic Path and Star Graph
12 pages
2020-ML-Grade 12-Final Examination Memorandum - Paper 1
No ratings yet
2020-ML-Grade 12-Final Examination Memorandum - Paper 1
22 pages
Knots Mathematics With A Twist
100% (2)
Knots Mathematics With A Twist
148 pages
Research Methods in Mathematics
No ratings yet
Research Methods in Mathematics
26 pages
Math 4th 15 FactorTreeandGCF Merged 1
No ratings yet
Math 4th 15 FactorTreeandGCF Merged 1
592 pages