Chap3 - Multiple Regression
Chap3 - Multiple Regression
Multiple Regression
Outline
1. Multiple Regression Equation
2. The Three-Variable Model: Notation and
Assumptions
3. OLS Estimation for the three-variable
model
4. Properties of OLS estimators
5. Goodness of fit –R2 and adjusted R2
6. More on Functional Form
7. Hypothesis Testing in Multiple
Regression
1. Multiple regression equation
• Y = One dependent variable (criterion)
• X = Two or more independent variables (predictor
variables).
• ui the stochastic disturbance term
• is the intercept
• measures the change in Y with respect to Xk,
holding other factors fixed.
Motivation for multiple regression
– Incorporate more explanatory factors into the
model
– Explicitly hold fixed other factors that otherwise
would be in u
– Allow for more flexible functional forms
• Example: Wage equation
Now measures effect of education explicitly holding experience fixed
Other factors
Other factors
Log of CEO salary Log sales Quadratic function of CEO tenure with firm
• An unbiased estimator of :
• If the t statistic exceeds the critical value, then you can reject the
null hypothesis; otherwise, you do not reject it
Example- Stata output
• Model: wage = f(educ,exper, tenure )
Example- Stata output
• Model: wage = f(educ,exper, tenure )
Example- Stata output
• We have
t = -4.958
Reject H0
7.3. Testing the Equality of Two Regression Coefficients
Method 2: F-test
• If the F statistics exceeds the critical value then you can reject
the null hypothesis; otherwise, you do not reject it.
7.3. Testing the Equality of Two Regression Coefficients
Method 3
• Example: Return to education at 2 year vs. at 4 year colleges
Test against .
. test exper=tenure
( 1) exper - tenure = 0
F( 1, 522) = 24.58
Prob > F = 0.0000
We reject the hypothesis that the two effects are
equal.
7.4. Restricted Least Squares: Testing Linear Equality
Restrictions
Batting average Home runs per year Runs batted in per year
against
Degrees of freedom in
the unrestricted model
• Discussion
– The three variables are „jointly significant“
– They were not significant when tested individually
• Test hypothesis that, after controlling for cigs, parity, and faminc, parents’
education has no effect on birth weight
7.5. Testing for Structural or Parameter Stability of
Regression Models: The Chow Test
• Now we have three possible regressions:
• Time period 1970–1981: Yt = λ1 + λ2Xt + u1t (1)
Time period 1982–1995: Yt = γ1 + γ2Xt + u2t (2)
Time period 1970–1995: Yt = α1 + α2Xt + ut (3)
• there is no difference between the two time periods. The mechanics
of the Chow test are as follows:
1. Estimate regression (3), obtain RSS3 with df = (n1 + n2 − k)
We call RSS3 the restricted residual sum of squares (RSSR) because it is obtained by
imposing the restrictions that λ1 = γ1 and λ2 = γ2, that is, the subperiod regressions
are not different.
2. Estimate Eq. (1) and obtain its residual sum of squares, RSS1, with df
= (n1 − k).
3. Estimate Eq. (2) and obtain its residual sum of squares, RSS2, with df
= (n2 − k).
7.5. Testing for Structural or Parameter Stability of
Regression Models: The Chow Test
4. The unrestricted residual sum of squares (RSSUR), that is,
RSSUR = RSS1 + RSS2 with df = (n1 + n2 − 2k)
5. F ratio: