Chapter5-Multiple Linear Regression
Chapter5-Multiple Linear Regression
Introduction
Up to now, we have been dealing with regression relationships in which two
variables, dependent and one independent variables were involved. Multiple
linear regression analysis is merely an extension of simple linear regression.
The difference is that more than one independent variable is involved in the
relationship.
The multiple linear regression model is;
The term linear is used because the model is a linear function of the unknown
parameters, β 0 s.
Example Yield (Y) depend on many variables for example amount of rainfall
and amount of fertilizer. Our model will be of the form;
yi = β0 + β1 X1i + β2 X2i + i
Estimation of Parameters
There are many ways of estimating the parameters in a regression model.
However in this course we shall focus attention on the Least Squares method.
There are two to apply the LS method
1
However, with multiple linear regression, the matrix approach seem to be
more appropriate.
Y = Xβ + , where
y1 1 x1,1 . . . xp−1,1 β0
y2
1 x1,2 . . . xp−1,2
β1
. . . . .
Y = , X = , β = and
.
. . .
.
. . . . .
yn 1 x1,n . . . xp−1,n βp−1
1
2
.
=
.
.
n
Consequently, the random vector has expectation,
E [Y] = Xβ
Ŷ = Xβ̂
β̂ = (XT X)−1 XT Y
2
The estimate of the variance of β̂i is given by
(B) We reject H0 if
β̂i − b
t= q < tα (n − p)
V ar(β̂i )
(C) We reject H0 if
β̂i − b
t= q > tα (n − p)
V ar(β̂i )
SST = YT Y − nȳ 2
SSR = β̂ T XT Y − nȳ 2
3
and
SSE = YT Y − β̂ T XT Y
Once β has been estimated, the sum of squares can be easily computed.
Mean squares
The sum of squares divided by its degrees of freedom is called mean squares.
The two important mean squares are the Regression Mean Squares (MSR)
and Error Mean Squares (MSE) and these are given by
SSR
M SR =
p−1
SSE
M SE =
n−p
To test for the significance of the regression, our hypotheses are of the
form
H0 : β1 = β2 = ... = βp−1 = 0
H1 : βi 6= 0 for at least one i at α significance level
4
Test Statistic: F-ratio
Exercise 5.1
Consider the following data set, where Y is the dependent variable and X1
and X2 are the regressors
(c) Construct the ANOVA table and test for the significance of the regres-
sion line using α = 0.05.