8 Student's T Test
8 Student's T Test
8 Student's T Test
=
l l
t
Inference :
There is no evidence that the sample is taken
from the population with mean weight of 24 gm
-
1
.
9
6
0
Area = .025
Area =.005
Z
-
2
.
5
7
5
Area = .025
Area = .005
1
.
9
6
2
.
5
7
5
Determining the p-Value
.9
5
t
0
f(t)
-1.96
1.96
.025
red area = rejection region for 2-sided test
Given below are the 24 hrs total energy
expenditure (MJ/day) in groups of lean and
obese women. Examine whether the obese
womens mean energy expenditure is
significantly higher ?.
Lean
6.1 7.0 7.5
7.5 5.5 7.6
7.9 8.1 8.1
8.1 8.4 10.2
10.9
T-test for difference in means
Obese
8.8 9.2 9.2
9.7 9.7 10.0
11.5 11.8 12.8
Two sample t-test
Difference
between means
Sample size
Variability
of data
t-test
t
+
+
T-test for difference in means
Null Hypothesis
Obese womens mean energy expenditure is
equal to the lean womens energy expenditure.
Test statistics :
t =
x
1
x
2
o
1
n
1
+
1
n
2
o =
(n
1
1)s
1
2
+ (n
2
1)s
2
2
n
1
+ n
2
2
x
x
o
1, 2 - means of sample 1 and sample 2
1, 2 sd of sample 1 and sample 2
n1 , n2 number of study subjects in sample 1 and
sample 2
o
t(n1+n2-2)
T-test for difference in means
Data Summary
lean Obese
N 13 9
8.10 10.30
S 1.38 1.25
82 . 3
13
25 . 1
9
32 . 1
3 . 10 1 . 8
2 2
=
+
=
l l
t
Inference : The cal t (3.82) is higher than tab t at
0.05, 20. ie 2.086 . This implies that there is a
evidence that the mean energy expenditure in obese
group is significantly (p<0.05) higher than that of
lean group
tab t 9+13-2 =20 df = t
0.05,20
=2.086
x
Example
Suppose we want to test the
effectiveness of a program designed
to increase scores on the quantitative
section of the Graduate Record Exam
(GRE). We test the program on a
group of 8 students. Prior to entering
the program, each student takes a
practice quantitative GRE; after
completing the program, each student
takes another practice exam. Based
on their performance, was the
program effective?
Each subject contributes 2 scores:
repeated measures design
Student Before Program After Program
1 520 555
2 490 510
3 600 585
4 620 645
5 580 630
6 560 550
7 610 645
8 480 520
Can represent each student with a single
score: the difference (D) between the
scores
Student
Before Program After Program
D
1 520 555 35
2 490 510 20
3 600 585 -15
4 620 645 25
5 580 630 50
6 560 550 -10
7 610 645 35
8 480 520 40
Approach: test the effectiveness of
program by testing significance of D
Alternative hypothesis: program is
effective scores after program will
be higher than scores before
program average D will be greater
than zero
H
0
:
D
0
H
1
:
D
> 0
Student
Before
Program
After
Program
D
D
2
1 520 555 35 1225
2 490 510 20 400
3 600 585 -15 225
4 620 645 25 625
5 580 630 50 2500
6 560 550 -10 100
7 610 645 35 1225
8 480 520 40 1600
D = 180 D
2
= 7900
So, need to know D and D
2
:
Recall that for single samples:
error standard
mean - score
=
=
X
obt
s
X
t
For related samples:
D
D
obt
s
D
t
=
where:
N
s
s
D
D
=
and
( )
1
2
2
N
N
D
D
s
D
( )
( )
45 . 23
1 8
8
180
7900
1
2
2
2
=
N
N
D
D
s
D
5 . 22
8
180
= = =
N
D
D
Standard deviation of D:
Mean of D:
Standard error:
2908 . 8
8
45 . 23
= = =
N
s
s
D
D
D
D
obt
s
D
t
=
Under H
0
,
D
= 0, so:
714 . 2
2908 . 8
5 . 22
= = =
D
obt
s
D
t
From Table B.2: for = 0.05, one-tailed, with df = 7,
t
crit
= 1.895
2.714 > 1.895 reject H
0
The program is effective.
t-Value
t is a measure of:
How difficult is it to believe the null hypothesis?
High t
Difficult to believe the null hypothesis -
accept that there is a real difference.
Low t
Easy to believe the null hypothesis -
have not proved any difference.
In Conclusion !
Student s t-test will be used:
--- When Sample size is small
and for the following situations:
(1) to compare the single sample mean
with the population mean
(2) to compare the sample means of
two indpendent samples
(3) to compare the sample means of
paired samples