Linear Regression
Linear Regression
Linear regression
x x
y y
x x
Scatter Plot Examples
Weak relationships
Strong relationships
y y
x x
y y
x x
Scatter Plot Examples
No relationship at all
y
x
Correlation Coefficient
Unit free
y y y
x x x
r = -1 r = -.6 r=0
y y
r = +0.3 x r = +1 x
Calculating the Correlation Coefficient
Sample correlation coefficient:
r
( x x )( y y ) SS xy / SS SS
xx yy
[ ( x x ) ][ ( y y ) ]
2 2
where:
r = Sample correlation coefficient
n = Sample size
x = Value of the ‘independent’ variable
y = Value of the ‘dependent’ variable
Example
Child Child
Height Weight
(cm) (Kg)
x y xy x2 y2
35 8 280 1225 64
49 9 441 2401 81
27 7 189 729 49
33 6 198 1089 36
60 13 780 3600 169
21 7 147 441 49
45 11 495 2025 121
51 12 612 2601 144
=321 =73 =3142 =14111 =713
Calculation Example
Child
Height,
y
70 n xy x y
r
60
[n( x 2 ) ( x)2 ][n( y 2 ) ( y)2 ]
50
40 8(3142) (73)(321)
30
[8(713) (73)2 ][8(14111) (321) 2 ]
20
10 0.886
0
0 2 4 6 8 10 12 14
r .886
t 4.68
1 r 2 1 .886 2
n2 82
Introduction to Regression Analysis
Regression analysis is used to:
Predict the value of a dependent variable based on the value of
at least one independent variable
Explain the impact of changes in an independent variable on the
dependent variable
y β0 β1x ε
Linear component Random Error
component
Population Linear Regression
y y β0 β1x ε
Observed
Value of y
for xi
εi Slope = β1
Predicted Random Error
Value of y for this x value
for xi
Intercept
= β0
xi x
Estimated Regression Model
The sample regression line provides an estimate of the
population regression line
ŷ i b0 b1x variable
Standardized
Unstandardized Coefficients Coefficients
Unpredictable
Variation
Multiple regression
%fat age Sex
Coefficients
Standardized 95% Confidence
Unstandardized Coefficients Coefficients Interval for B
Lower Upper
Model B Std. Error Beta t Sig. Bound Bound
1 (Constant) 15.625 3.283 4.760 .000 8.728 22.522
sex 16.594 3.670 .729 4.521 .000 8.883 24.305
2 (Constant) 6.209 5.331 1.165 .260 -5.039 17.457
sex 10.130 4.517 .445 2.243 .039 .600 19.659
age .309 .145 .424 2.136 .047 .004 .614
a. Dependent Variable: %age of body fat relative to body
Write the model for the output.
Interpret the findings.
Thank you!!