Lecture 9 Simple Regression
Lecture 9 Simple Regression
A Decision-Making Approach
7th Edition
Lecture 9
Introduction to Linear Regression
y y
x x
y y
x x
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 14-4
Scatter Plot Examples
(continued)
Strong relationships Weak relationships
y y
x x
y y
x x
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 14-5
Scatter Plot Examples
(continued)
No relationship
x
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 14-6
Correlation Coefficient
(continued)
x x x
r = -1 r = -.6 r=0
y y
x x
r = +.3
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. r = +1 Chap 14-9
Calculating the
Correlation Coefficient
Sample correlation coefficient:
r
( x x )( y y )
[ ( x x ) ][ ( y y ) ]
2 2
Tree n xy x y
Height, r
y 70 [n( x 2 ) ( x)2 ][n( y 2 ) ( y)2 ]
60
8(3142) (73)(321)
50
40
[8(713) (73)2 ][8(14111) (321) 2 ]
30
0.886
20
10
0
r = 0.886 → relatively strong positive
0 2 4 6 8 10 12 14
linear association between x and y
Trunk Diameter, x
Correlation between
Tree Height and Trunk Diameter
y β0 β1x ε
Variable
y y β0 β1x ε
Observed Value
of y for xi
εi Slope = β1
Predicted Value Random Error
of y for xi
for this x value
Intercept = β0
xi x
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 14-19
Estimated Regression Model
The sample regression line provides an estimate of
the population regression line
ŷ i b0 b1x variable
e 2
(y ŷ) 2
(y (b 0 b1x))
2
x 2
( x ) 2
and n
or
b0 y b1x
n xy x y
b1
n x 2 ( x ) 2
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 14-22
Interpretation of the
Slope and the Intercept
ANOVA
df SS MS F Significance F
Regression 1 18934.9348 18934.9348 11.0848 0.01039
Residual 8 13665.5652 1708.1957
Total 9 32600.5000
350
Slope
300
250
= 0.10977
200
150
100
50
Intercept 0
= 98.248 0 500 1000 1500 2000 2500 3000
Square Feet
Xi x
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 14-35
Coefficient of Determination, R2
The coefficient of determination is the portion
of the total variation in the dependent variable
that is explained by variation in the
independent variable
SSR
R 2 where 0 R 1
2
SST
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 14-36
Coefficient of Determination, R2
(continued)
Coefficient of determination
SSR sum of squares explained by regression
R 2
SST total sum of squares
R r 2 2
where:
R2 = Coefficient of determination
r = Simple correlation coefficient
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 14-37
Examples of Approximate
R2 Values
y
R2 = 1
x
R = +1
2
y
0 < R2 < 1
x
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 14-39
Examples of Approximate
R2 Values
(continued)
R2 = 0
y
No linear relationship
between x and y:
98.25 0.1098(2000)
317.85
The predicted price for a house with 2000
square feet is 317.85($1,000s) = $317,850
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 14-43
Types of Estimation