Linear Regression Example Data
Linear Regression Example Data
Square Feet
(X)
245
1400
312
1600
279
1700
308
1875
199
1100
219
1550
405
2350
324
2450
319
1425
255
1700
Statistics for Managers Using
Microsoft Excel, 5e 2008
Prentice-Hall, Inc.
Chap 13-1
Chap 13-2
Chap 13-3
Regression Statistics
Multiple R
0.76211
R Square
0.58082
Adjusted R Square
0.52842
Standard Error
41.33032
Observations
10
ANOVA
Regression
Residual
Statistics
for Managers
Using Microsoft Excel, 5e
TotalPrentice-Hall,
2008
Inc.
df
SS
MS
18934.9348
18934.9348
13665.5652
1708.1957
Chap 13-4
32600.5000
Significance F
11.0848
0.01039
Slope
= 0.10977
Intercept
= 98.248
Chap 13-5
Chap 13-6
Chap 13-7
Chap 13-8
Do not try to
extrapolate beyond
the range of
observed Xs
Statistics for Managers
Using Microsoft Excel, 5e
2008 Prentice-Hall,
Inc.
Chap 13-9
Measures of Variation
Total variation is made up of two par
ts:
SST
Total Sum of
Squares
SSR
Regression Sum of
Squares
SST ( Yi Y )2
SSE
Error Sum of
Squares
)2
SSR ( Yi Y )2 SSE ( Yi Y
i
where:
=Y
Mean value of the dependent variable
Y
= Predicted value of Y for the given Xi value
Chap 13-10
Coefficient of Determination, r
SST
total sum of squares
2
0 r 1
2
Chap 13-11
0.58082
SST 32600.5000
2
Regression Statistics
Multiple R
0.76211
R Square
0.58082
Adjusted R Square
0.52842
Standard Error
41.33032
Observations
10
ANOVA
df
SS
MS
Regression
18934.9348
18934.9348
Residual
13665.5652
1708.1957
32600.5000
Chap 13-12
Significance F
11.0848
0.01039
SYX
SSE
n2
Where
SSE = error sum of squares
n = sample size
Statistics for Managers
Using Microsoft Excel, 5e
2008 Prentice-Hall,
Inc.
Chap 13-13
(
Y
Y
)
i i
i 1
n2
0.76211
R Square
0.58082
Adjusted R Square
0.52842
Standard Error
S YX 41.33032
41.33032
Observations
10
ANOVA
Regression
Residual for Managers
Statistics
Using Microsoft Excel, 5e
Total
2008 Prentice-Hall,
Inc.
df
SS
MS
18934.9348
18934.9348
13665.5652
1708.1957
Chap 13-14
32600.5000
Significance F
11.0848
0.01039
small s YX
large s YX
Chap 13-15
Sb1
d.f.Chap
n13-16
2
1 = hypothesized slope
Sb1 = standard
error of the slope
Square Feet
(x)
245
1400
312
1600
279
1700
308
1875
199
1100
219
1550
405
2350
324
2450
319
1425
255
1700
Chap 13-17
H 1 : 1 0
Intercept
Square Feet
Coefficients
b1
Standard Error
Sb1
t Stat
98.24833
58.03348
1.69296
0.12892
0.10977
0.03297
3.32938
0.01039
b1 1 0.10977 0
t
3.32938
t
Sb1
0.03297
Statistics for Managers
Using Microsoft Excel, 5e
2008 Prentice-Hall,
Inc.
P-value
Chap 13-18
H0: 1 = 0
H1: 1 0
d.f. = 10- 2
=8
/2=.02
5
Reject H0
Decision: Reject H0
/2=.025
Do not reject
H0
0
-t/2
2.3060
Reject H
0
t/2
2.306 3.32
0
9
Chap 13-19
Intercept
H0: 1 = 0
H1: 1 0
Square Feet
Coefficients
Standard Error
t Stat
98.24833
58.03348
1.69296
0.12892
0.10977
0.03297
3.32938
0.01039
P-value
Chap 13-20
MSR
F
MSE
SSR
MSR
k
SSE
MSE
n k 1
Chap 13-21
0.76211
R Square
0.58082
Adjusted R Square
0.52842
Standard Error
41.33032
Observations
MSR 18934.9348
F
11.0848
MSE 1708.1957
With 1 and 8
degrees of
freedom
P-value
for the FTest
10
ANOVA
Regression
Statistics for Managers
Using
Microsoft Excel, 5e
Residual
2008 Prentice-Hall,
Inc.
df
SS
MS
18934.9348
18934.9348
13665.5652
Chap 13-22 1708.1957
F
11.0848
Significance F
0.01039
H 0 : 1 = 0
MSR
F
11.08
MSE
H 1 : 1 0
= .05
df2 = 8
df1= 1
Decision:
Reject H0 at =
0.05
Conclusion:
Critical Value:
F = 5.32
=.
05
Do not
Reject H0
reject
Statistics
for
Managers
H0
F =
.05 5e
Using Microsoft Excel,
2008 Prentice-Hall,
Inc.
5.32
b1 t n2Sb1
d.f. = n
-2
Intercept
Square Feet
Coefficients
Standard Error
t Stat
P-value
Lower 95%
Upper 95%
98.24833
58.03348
1.69296
0.12892
-35.57720
232.07386
0.10977
0.03297
3.32938
0.01039
0.03374
0.18580
Chap 13-24
Intercept
Square Feet
Standard Error
t Stat
P-value
Lower 95%
Upper 95%
98.24833
58.03348
1.69296
0.12892
-35.57720
232.07386
0.10977
0.03297
3.32938
0.01039
0.03374
0.18580
Chap 13-25
Confidence
Interval for
the mean of
Y, given Xi
Y = b0+b1Xi
Chap 13-26
Xi
1 (X i X) 2 1
(X i X) 2
hi
n
SSX
n (X i X) 2
Chap 13-27
Find the 95% confidence interval for the mean price of 2,000
square-foot houses
1
(Xi X)
Y t n-2S YX
317.85 37.12
2
n (Xi X)
Chap 13-29
Y t n-1S YX
1
(Xi X)2
1
317.85 102.28
2
n (Xi X)
Chap 13-30
H0 : = 0
H1 : 0
Test statistic
(with n 2rdegrees
of freedom)
-
where
1 r
n2
2
r r 2 if b1 0
r r 2 if b1 0
Chap 13-31
(correlation exists)
=.05 , df = 10 - 2 = 8
t
Statistics for Managers
Using Microsoft Excel, 5e
2008 Prentice-Hall,
Inc.
r
1 r2
n2
.762 0
1 .762 2
10 2
Chap 13-32
3.329
d.f. = 10- 2 =
8
/2=.025
Reject H0
-t/2
2.3060
Statistics for Managers
Using Microsoft Excel, 5e
2008 Prentice-Hall,
Inc.
/2=.025
Do not reject
H
00
Conclusion:
There is evidence
of a linear
Reject H
t/2
association at the
2.3060
3.329 5% level of
significance
0
Chap 13-33
Residual Analysis
ei Yi Yi
The residual for observation i, e i, is the difference betwe
en its observed and predicted value
Check the assumptions of regression by examining the re
siduals
Examine for Linearity assumption
Evaluate Independence assumption
Evaluate Normal distribution assumption
Examine Equal variance for all levels of X
Statistics
Graphical
Analysis of Residuals
for Managers
Using Microsoft Excel, 5e
Chap 13-34
Can
plot residuals vs. X
2008
Prentice-Hall,
Inc.
x
Not
Linear
residuals
residuals
x
Linear
Chap 13-35
residuals
residuals
Independent
residuals
Not Independent
Chap 13-36
Chap 13-37
residuals
residuals
Unequal variance
x
Equal variance
Chap 13-38
Residuals
251.92316
-6.923162
273.87671
38.12329
284.85348
-5.853484
304.06284
3.937162
218.99284
-19.99284
268.38832
-49.38832
356.20251
48.79749
367.17929
-43.17929
254.6674
64.33264
10
284.85348
-29.85348
Chap 13-39
Measuring Autocorrelation:
The Durbin-Watson Statistic
Used when data are collected over time
to detect if autocorrelation is present
Autocorrelation exists if residuals in on
e time period are related to residuals in
another period
Chap 13-40
Autocorrelation
Autocorrelation is correlation of the errors (r
esiduals) over time
statistically independent
Statistics for Managers
Using Microsoft Excel, 5e
2008 Prentice-Hall,
Inc.
Chap 13-41
Chap 13-42
Chap 13-43