Statistics: Introduction To Regression
Statistics: Introduction To Regression
Three segments
Overview
Calculation of regression coefficients
Assumptions
Statistics One
Lecture 7
Introduction to Regression
1
Regression: Overview
Important concepts & topics
Simple regression vs. multiple regression
Regression equation
Regression model
Lecture 7 ~ Segment 1
Regression: Overview
10/12/13
Regression: Overview
Regression: Overview
IMPACT example
IMPACT provides data on 6 variables
Verbal memory
Visual memory
Visual motor speed
Reaction time
Impulse control
Symptom score
7
10/12/13
10
IMPACT example
For this example, assume:
Symptom Score is the outcome variable
Simple regression example:
Predict Symptom Score from just one variable
12
10/12/13
Regression equation
Regression equation
Y = m + bX + e
Y = B0 + B1X1 + e
Y is a linear function of X
m = intercept
b = slope
e = error (residual)
Y is a linear function of X1
B0 = intercept = regression constant
B1 = slope = regression coefficient
e = error (residual)
13
14
Model R and R2
IMPACT example
Y = B0 + B1X1 + e
R = rY
The correlation between the predicted scores
and the observed scores
R2
In R, function lm
16
10/12/13
IMPACT example
IMPACT example
= 20.48 + 1.43(X)
r = .40
R2 = 16%
17
18
Regression model
19
20
10/12/13
IMPACT example
IMPACT example
Y = B0 + B1X1 + B2X2 + e
R2 = 22%
In R, function lm
21
IMPACT example
22
Model R and R2
R = multiple correlation coefficient
R = rY
The correlation between the predicted scores
and the observed scores
R2
23
24
10/12/13
IMPACT example
Segment summary
Important concepts & topics
R2 = 22%
rY = .47
25
END SEGMENT
26
Lecture 7 ~ Segment 2
Calculation of regression coefficients
27
28
10/12/13
Estimation of coefficients
Estimation of coefficients
Regression equation:
Y = B0 + B1X1 + e
= B0 + B1X1
(Y ) = e (residual)
29
Estimation of coefficients
30
IMPACT example
31
32
10/12/13
Estimation of coefficients
Estimation of coefficients
SS.Y
SS.X
SS.Y
SS.X
33
34
Estimation of coefficients
Estimation of coefficients
SP.XY
SP.XY = SS.MODEL
SS.Y
SS.Y
SP.XY
SS.MODEL
SS.X
SS.X
35
36
10/12/13
Estimation of coefficients
Estimation of coefficients
SS.Y
SS.RESIDUAL
SS.MODEL
SS.X
37
38
Estimation of coefficients
Segment summary
Important concepts
SDy = SDx = 1
B = r x (SDy/ SDx)
= r
39
40
10
10/12/13
END SEGMENT
Lecture 7 ~ Segment 3
Assumptions
41
42
Assumptions
Assumptions
Reliability of X and Y
Validity of X and Y
Random and representative sampling
43
44
11
10/12/13
Assumptions
Anscombes quartet
45
46
Anscombes quartet
Anscombes quartet
= 3.00 + 0.50(X1)
e = (Y )
47
48
12
10/12/13
Anscombes quartet
Anscombes quartet
49
50
Segment summary
Assumptions when interpreting r
Normal distributions for Y
Linear relationship between X and Y
Homoscedasticity
Examine residuals to evaluate assumptions
END SEGMENT
51
52
13
10/12/13
END LECTURE 7
53
14