Chapter 14, Multiple Regression Using Dummy Variables

1) The multiple regression model examines the linear relationship between a single dependent variable (Y) and two or more independent variables (X1, X2, etc.). It estimates the coefficients using sample data. 2) The coefficients are interpreted as the change in the mean of Y from a one-unit change in the corresponding independent variable, while holding other variables constant. 3) The model can be tested for overall significance using an F-test and individual variables can be tested using t-tests to determine their significance and linear relationship with Y.

Uploaded by

Amin Haleeb

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

160 views

Chapter 14, Multiple Regression Using Dummy Variables

Uploaded by

Amin Haleeb

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 19

Ch.

14: The Multiple Regression Model

building
Idea: Examine the linear relationship between
1 dependent (Y) & 2 or more independent variables (X
i
)
X X X Y
ki k 2i 2 1i 1 0 i
+ + + + + =
Multiple Regression Model with k Independent Variables:
Y-intercept
Population slopes Random Error
The coefficients of the multiple regression model
are estimated using sample data with k
independent variables

Interpretation of the Slopes: (referred to as a Net
Regression Coefficient)
b
1
=The change in the mean of Y per unit change in X
1
,
taking into account the effect of X
2
(or net of X
2
)
b
0
Y intercept. It is the same as simple regression.

ki k 2i 2 1i 1 0 i
X b X b X b b Y

+ + + + =
Estimated
(or predicted)
value of Y
Estimated slope coefficients
Estimated
intercept
Three dimension
Y
X
1
X
2
Graph of a Two-Variable Model
2 2 1 1 0
X b X b b Y

+ + =
Example:
Simple Regression Results

Multiple Regression Results

Check the size and significance level of the
coefficients, the F-value, the R-Square, etc. You
will see what the net of effects are.

Coefficients Standard Error t Stat
Intercept (b
0
) 165.0333581 16.50316094 10.000106
Lotsize (b
1
)
6.931792143 2.203156234 3.1463008
F-Value 9.89
Adjusted R Square 0.108
Standard Error 36.34
Coefficients Standard Error t Stat
Intercept 59.32299284 20.20765695 2.935669
Lotsize 3.580936283 1.794731507 1.995249
Rooms 18.25064446 2.681400117 6.806386
F-Value 31.23
Adjusted R Square 0.453
Standard Error 28.47
Using The Equation to Make Predictions
Predict the appraised value at average lot size
(7.24) and average number of rooms (7.12).

What is the total effect from 2000 sf increase in lot
size and 2 additional rooms?

$215,180 or 215.18
) 18.25(7.12 (7.24) 3.58 59.32 . App.Val
=
+ + =
$43,660
(18.25)(2) 0) (3.58)(200
value app. in Increse
=
+ =
Coefficient of Multiple Determination, r
2
and Adjusted r
2
Reports the proportion of total variation in Y
explained by all X variables taken together (the
model)

Adjusted r
2
r
2
never decreases when a new X variable is added to the
model
This can be a disadvantage when comparing models
squares of sum total
squares of sum regression
SST
SSR
r
2
k .. 12 . Y
= =
What is the net effect of adding a new variable?
We lose a degree of freedom when a new X variable is added
Did the new X variable add enough explanatory power to offset
the loss of one degree of freedom?
Shows the proportion of variation in Y explained
by all X variables adjusted for the number of X
variables used

(where n = sample size, k = number of independent variables)

Penalize excessive use of unimportant independent
variables
Smaller than r
2
Useful in comparing among models

(

|
.
|

\
|

=
1 k n
1 n
) r 1 ( 1 r
2
k .. 12 . Y
2
adj
Multiple Regression Assumptions
Assumptions:
The errors are normally distributed
Errors have a constant variance
The model errors are independent
Errors (residuals) from the regression model:
e
i
= (Y
i
Y
i
)
These residual plots are used in multiple
regression:
Residuals vs. Y
i
Residuals vs. X
1i

Residuals vs. X
2i

Residuals vs. time (if time series data)

Two variable model

Y
X
1
X
2
2 2 1 1 0
X b X b b Y

+ + =
Y
i
Y
i
<

x
2i
x
1i
The best fit equation, Y ,
is found by minimizing the
sum of squared errors, Ee
2
<

Sample
observation
Residual = e
i

= (Y
i
Y
i
)
<

Are Individual Variables Significant?
Use t-tests of individual variable slopes
Shows if there is a linear relationship between the
variable X
i
and Y; Hypotheses:
H
0
:
i
= 0 (no linear relationship)
H
1
:
i
0 (linear relationship does exist between X
i
and Y)
Test Statistic:

Confidence interval for the population slope
i

i
b
i
1 k n
S
0 b
t

=
i
b 1 k n i
S t b

Is the Overall Model Significant?

F-Test for Overall Significance of the Model
Shows if there is a linear relationship between all of the X
variables considered together and Y
Use F test statistic; Hypotheses:
H
0
:
1
=
2
= =
k
= 0 (no linear relationship)
H
1
: at least one
i
0 (at least one independent
variable affects Y)
Test statistic:
1 k n
SSE
k
SSR
MSE
MSR
F

= =
Testing Portions of the Multiple
Regression Model
To find out if inclusion of an individual X
j
or a
set of Xs, significantly improves the model,
given that other independent variables are
included in the model
Two Measures:
1. Partial F-test criterion
2. The Coefficient of Partial Determination
Contribution of a Single Independent
Variable X
j

SSR(X
j
| all variables except X
j
)
= SSR (all variables) SSR(all variables except X
j
)
Measures the contribution of X
j
in explaining the total
variation in Y (SST)
consider here a 3-variable model:
SSR(X
1
| X
2
and X
3
)
= SSR (all variablesX1-x3) SSR(X
2
and X
3
)

SSR
UR

Model
SSR
R
Model
The Partial F-Test Statistic
Consider the hypothesis test:
H
0
: variable Xj does not significantly improve the model after all
other variables are included
H
1
: variable Xj significantly improves the model after all other
variables are included
1) - k - /(n SSE MSE
n) restrictio of number )/(df SSR - (SSR
F
UR
R UR
=
=
=
Note that the numerator is the contribution of X
j
to the regression.

If Actual F Statistic is > than the Critical F, then
Conclusion is: Reject H
0
; adding X
1
does improve model
Coefficient of Partial Determination for
one or a set of variables
Measures the proportion of total variation in the dependent
variable (SST) that is explained by X
j
while controlling for
(holding constant) the other explanatory variables
R UR
R UR
2
j) except variables Yj.(all
SSR SST
SSR - SSR
r

=
Using Dummy Variables
A dummy variable is a categorical
explanatory variable with two levels:
yes or no, on or off, male or female
coded as 0 or 1
Regression intercepts are different if the
variable is significant
Assumes equal slopes for other variables
If more than two levels, the number of
dummy variables needed is (number of
levels - 1)

Different Intercepts, same slope
Y (sales)
b
0
+ b
2
b
0

1 0 1 0
1 2 0 1 0
X b b (0) b X b b Y

X b ) b (b (1) b X b b Y

1 2 1
1 2 1
+ = + + =
+ + = + + =
Fire Place
No Fire Place
If H
0
:
2
= 0 is
rejected, then
Fire Place has a
significant effect
on Values
Interaction Between Explanatory
Variables
Hypothesizes interaction between pairs of X variables
Response to one X variable may vary at different levels of
another X variable
Contains two-way cross product terms

Effect of Interaction
Without interaction term, effect of X
1
on Y is measured by
1
With interaction term, effect of X
1
on Y is measured by
1
+
3
X
2
Effect changes as X
2
changes

) (X b X b X b b
X b X b X b b Y

2 1 3 2 2 1 1 0
3 3 2 2 1 1 0
X + + + =
+ + + =
Example: Suppose X2 is a dummy variable
and the estimated regression equation is
Slopes are different if the effect of X
1
on Y depends on X
2
value
X
1
0 1 0.5 1.5
Y
= 1 + 2X
1
+ 3X
2
+ 4X
1
X
2

Y

Project Management 5th edition Edition Harvey Maylor download pdf
100% (1)
Project Management 5th edition Edition Harvey Maylor download pdf
41 pages
A Guide To Modern Econometrics, 5th Edition Answers To Selected Exercises - Chapter 2
No ratings yet
A Guide To Modern Econometrics, 5th Edition Answers To Selected Exercises - Chapter 2
5 pages
David Stove-On Enlightenment (2002)
100% (4)
David Stove-On Enlightenment (2002)
224 pages
10 RepeatedMeasuresAndMixedANOVA
No ratings yet
10 RepeatedMeasuresAndMixedANOVA
30 pages
Lecture 4
No ratings yet
Lecture 4
161 pages
Phi Coefficient
No ratings yet
Phi Coefficient
1 page
Testing The Assumptions of Linear Regression
100% (1)
Testing The Assumptions of Linear Regression
14 pages
Download full Sampling Theory and Practice 1st Edition Changbao Wu Mary E Thompson ebook all chapters
No ratings yet
Download full Sampling Theory and Practice 1st Edition Changbao Wu Mary E Thompson ebook all chapters
50 pages
Who 5 Wellbeing Indicator
No ratings yet
Who 5 Wellbeing Indicator
1 page
Assumptions in Multiple Regression
100% (1)
Assumptions in Multiple Regression
16 pages
ARCH Model
No ratings yet
ARCH Model
26 pages
Multi Regression
No ratings yet
Multi Regression
17 pages
Assumptions of Simple and Multiple Linear Regression Model
No ratings yet
Assumptions of Simple and Multiple Linear Regression Model
25 pages
Ch2 Slides
No ratings yet
Ch2 Slides
80 pages
Sample Size and Power
No ratings yet
Sample Size and Power
19 pages
Logit Model For Binary Data
No ratings yet
Logit Model For Binary Data
50 pages
Linear Regression Analysis For STARDEX: Trend Calculation
No ratings yet
Linear Regression Analysis For STARDEX: Trend Calculation
6 pages
Regression
No ratings yet
Regression
46 pages
10E-Poisson Regression
No ratings yet
10E-Poisson Regression
19 pages
Regression With Dummy Variables Econ420 1
No ratings yet
Regression With Dummy Variables Econ420 1
47 pages
Chi Square Test
No ratings yet
Chi Square Test
13 pages
Axiomatic Probability and Concepts
No ratings yet
Axiomatic Probability and Concepts
6 pages
A Brief Overview of The Classical Linear Regression Model: Introductory Econometrics For Finance' © Chris Brooks 2013 1
No ratings yet
A Brief Overview of The Classical Linear Regression Model: Introductory Econometrics For Finance' © Chris Brooks 2013 1
80 pages
The Elements of Statistical Learning Data Mining I
No ratings yet
The Elements of Statistical Learning Data Mining I
2 pages
Chap 13
No ratings yet
Chap 13
60 pages
Bayes' Theorem: Probability Theory Statistics
No ratings yet
Bayes' Theorem: Probability Theory Statistics
9 pages
Correlation-Regression 2019
No ratings yet
Correlation-Regression 2019
76 pages
Introduction To Econometrics - Stock & Watson - CH 5 Slides
100% (2)
Introduction To Econometrics - Stock & Watson - CH 5 Slides
71 pages
Chap4 Normality (Data Analysis) FV
100% (1)
Chap4 Normality (Data Analysis) FV
72 pages
Gujarati D, Porter D, 2008: Basic Econometrics 5Th Edition Summary of Chapter 3-5
No ratings yet
Gujarati D, Porter D, 2008: Basic Econometrics 5Th Edition Summary of Chapter 3-5
64 pages
The Point Biserial
No ratings yet
The Point Biserial
6 pages
Module 1 Notes
100% (1)
Module 1 Notes
73 pages
Robust Regression Modeling With STATA Lecture Notes
No ratings yet
Robust Regression Modeling With STATA Lecture Notes
93 pages
Unit 3 Z-Scores, Measuring Performance: Learning Outcome
No ratings yet
Unit 3 Z-Scores, Measuring Performance: Learning Outcome
10 pages
Biometrika Trust, Oxford University Press Biometrika
No ratings yet
Biometrika Trust, Oxford University Press Biometrika
22 pages
4 - LM Test and Heteroskedasticity
No ratings yet
4 - LM Test and Heteroskedasticity
13 pages
Session 14 - Joint Probability Distributions (GbA) PDF
No ratings yet
Session 14 - Joint Probability Distributions (GbA) PDF
69 pages
Hypothesis Testing - Analysis of Variance (ANOVA)
No ratings yet
Hypothesis Testing - Analysis of Variance (ANOVA)
14 pages
ANCOVA
No ratings yet
ANCOVA
17 pages
Basic Business Statistics: Introduction and Data Collection
No ratings yet
Basic Business Statistics: Introduction and Data Collection
33 pages
CH 4 Order Statistics
No ratings yet
CH 4 Order Statistics
5 pages
(Bruderl) Applied Regression Analysis Using Stata
No ratings yet
(Bruderl) Applied Regression Analysis Using Stata
73 pages
Moving Average 2
No ratings yet
Moving Average 2
11 pages
Multiple Regression MS
No ratings yet
Multiple Regression MS
35 pages
Multiple Regression
No ratings yet
Multiple Regression
20 pages
Odds Ratio, Hazard Ratio and Relative Risk: Janez Stare Delphine Maucort-Boulch
No ratings yet
Odds Ratio, Hazard Ratio and Relative Risk: Janez Stare Delphine Maucort-Boulch
9 pages
Longitudinal Research: Present Status and Future Prospects: John B. Willett & Judith D. Singer
No ratings yet
Longitudinal Research: Present Status and Future Prospects: John B. Willett & Judith D. Singer
19 pages
What Is Hypothesis Testing
No ratings yet
What Is Hypothesis Testing
18 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
8 pages
Multiple Regression and Correlation Analysis: BX A Y
No ratings yet
Multiple Regression and Correlation Analysis: BX A Y
35 pages
Logistic Regression & Practice
100% (1)
Logistic Regression & Practice
51 pages
Multiple Regression
No ratings yet
Multiple Regression
17 pages
Heteroscedasticity: What Heteroscedasticity Is. Recall That OLS Makes The Assumption That
No ratings yet
Heteroscedasticity: What Heteroscedasticity Is. Recall That OLS Makes The Assumption That
20 pages
Research 1
No ratings yet
Research 1
22 pages
Example For Multiple Linear Regression
No ratings yet
Example For Multiple Linear Regression
10 pages
S.id.C.8 Linear Regression
No ratings yet
S.id.C.8 Linear Regression
11 pages
Introduction to Statistics Through Resampling Methods and R
From Everand
Introduction to Statistics Through Resampling Methods and R
Phillip I. Good
No ratings yet
Bivariate
No ratings yet
Bivariate
28 pages
CUHK STAT5102 Ch3
No ratings yet
CUHK STAT5102 Ch3
73 pages
Simple Linear Regression
100% (1)
Simple Linear Regression
50 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Basic Econometrics Health
No ratings yet
Basic Econometrics Health
183 pages
Basic Econometrics: The Nature of Regression Analysis
No ratings yet
Basic Econometrics: The Nature of Regression Analysis
9 pages
Overview of Health Indicators
No ratings yet
Overview of Health Indicators
44 pages
Sample Paragraph of An APA Results Section For A Significant Chi-Square Test
No ratings yet
Sample Paragraph of An APA Results Section For A Significant Chi-Square Test
1 page
Intro To Research
No ratings yet
Intro To Research
10 pages
QB BCom BBA Business Research Methods
No ratings yet
QB BCom BBA Business Research Methods
22 pages
Tugas Statbis Sesi 14 - Atika Triya S - 101120121299 - Reguler Sore
No ratings yet
Tugas Statbis Sesi 14 - Atika Triya S - 101120121299 - Reguler Sore
5 pages
Topic 31 - Multiple Logistic Regression Outline: I 1 I 2 I 3 I
No ratings yet
Topic 31 - Multiple Logistic Regression Outline: I 1 I 2 I 3 I
10 pages
Biostatistics of HKU MMEDSC Session5handoutprint3
No ratings yet
Biostatistics of HKU MMEDSC Session5handoutprint3
28 pages
Bks MaiHL 1404 gdc11 xxcg50
No ratings yet
Bks MaiHL 1404 gdc11 xxcg50
3 pages
Quiz 4
100% (1)
Quiz 4
6 pages
Course Syllabus
No ratings yet
Course Syllabus
9 pages
What Will You Do in Each Situation?
100% (2)
What Will You Do in Each Situation?
26 pages
Kenneth Waltz Theory of International Politics
No ratings yet
Kenneth Waltz Theory of International Politics
130 pages
A. Mathematical Sentence B. Expression C. English Sentence D. No Answer A. Mathematical Sentence B. Expression C. English Sentence D. No Answer
No ratings yet
A. Mathematical Sentence B. Expression C. English Sentence D. No Answer A. Mathematical Sentence B. Expression C. English Sentence D. No Answer
4 pages
Experimental Psychology
No ratings yet
Experimental Psychology
36 pages
Question 3 Kom 5115: Statistics For Communication (20 Marks) Research 2011
No ratings yet
Question 3 Kom 5115: Statistics For Communication (20 Marks) Research 2011
8 pages
Regression Statistics
No ratings yet
Regression Statistics
8 pages
Tests of Significance Notes PDF
No ratings yet
Tests of Significance Notes PDF
12 pages
03 - Mich - Solutions To Problem Set 1 - Ao319
No ratings yet
03 - Mich - Solutions To Problem Set 1 - Ao319
13 pages
Use of Statistics by Scientist
No ratings yet
Use of Statistics by Scientist
22 pages
Regression Analysis Linear and Multiple Regression
No ratings yet
Regression Analysis Linear and Multiple Regression
6 pages
Candidate - Elimination Algorihm
No ratings yet
Candidate - Elimination Algorihm
39 pages
Get (Ebook) Observing Children in Their Natural Worlds: A Methodological Primer by Anthony Pellegrini, Anthony D. Pellegrini ISBN 0805846891 free all chapters
100% (2)
Get (Ebook) Observing Children in Their Natural Worlds: A Methodological Primer by Anthony Pellegrini, Anthony D. Pellegrini ISBN 0805846891 free all chapters
81 pages
Fundamentals of Philosophy
No ratings yet
Fundamentals of Philosophy
24 pages
Statistics II - Asymptotic Theory For Least Squares: Marcelo Sant'Anna
No ratings yet
Statistics II - Asymptotic Theory For Least Squares: Marcelo Sant'Anna
12 pages
Research Methods For Business: A Skill Building Approach
No ratings yet
Research Methods For Business: A Skill Building Approach
26 pages
Samir Okasha
No ratings yet
Samir Okasha
9 pages
Agassi - 1975 - Science in Flux
No ratings yet
Agassi - 1975 - Science in Flux
579 pages
PowerPoint Chapter 1
No ratings yet
PowerPoint Chapter 1
70 pages
Practice Exam III
100% (2)
Practice Exam III
8 pages
Schedule For Vet U Exjobb - VT18
No ratings yet
Schedule For Vet U Exjobb - VT18
1 page