As Las 5
As Las 5
As Las 5
Supposed we are interested in the effect of three teaching approaches (conventional, modular, blended
learning) to the performance of the students in Mathematics. If there is no difference between the different
approaches, then we would expect all of them to be approximately equal. Otherwise, we would expect
the mean performances to differ. Here, we introduce a situation that utilizes a test that is used to analyze
the data from more than two populations. These tests are used to deal with treatment (e.g. teaching
approaches) effects, including tests that take into account other factors that may affect the response (e.g.
Mathematics performance). The hypothesis that the population means are equal is considered equivalent
to the hypothesis that there is no difference in treatment effects. The analytical method we will use in such
problems is called the analysis of variance (ANOVA). Initial development of this method could be credited
to Sir Ronald A. Fisher who introduced this technique for the analysis of agricultural field experiments which
at present is also useful when dealing with social science and educational researches.
Two-way ANOVA
Now, consider a situation where it is of interest to study the effect of two factors, A and B, on some
response. For instance, in addition to the teaching approaches as factor affecting the Mathematics
performance of the students, we may also consider the time when their classes are schedule (i.e.
morning or afternoon). In this case, we will use a randomized block design, or the two-way analysis of
variance. It considers the case of 𝑛 replications of the treatment combinations (we term it here blocks)
determined by 𝑎 levels of factor 𝐴 (e.g. schedule) and 𝑏 levels of factor 𝐵 (e.g. teaching approach). This
means that there will be 𝑎𝑏 blocks in all. For each of these cases, it is important not only to determine if
the two factors have an influence on the response (e.g. Mathematics performance), but also if there is a
significant interaction between the two factors, hence, there are three pairs of hypotheses.
The two-way ANOVA table has the following format with the corresponding formula for the value that
falls on each of its cells.
𝑆𝑆𝑏 𝑀𝑆𝑏
Factor B 𝑆𝑆𝑏 = 𝑎𝑛 ∑(𝑦̅𝑏 − 𝑦̅)2 𝑑𝑓𝑏 = 𝑏 − 1 𝑀𝑆𝑏 =
𝑑𝑓𝑏
𝐹𝑏 =
𝑀𝑆𝑤
𝑆𝑆𝑎𝑏 𝑀𝑆𝑎𝑏
Interaction 𝑆𝑆𝑎𝑏 = 𝑆𝑆𝑡 − 𝑆𝑆𝑎 − 𝑆𝑆𝑏 − 𝑆𝑆𝑤 𝑑𝑓𝑎𝑏 = (𝑑𝑓𝑎 )(𝑑𝑓𝑏 ) 𝑀𝑆𝑎𝑏 =
𝑑𝑓𝑎𝑏
𝐹𝑎𝑏 =
𝑀𝑆𝑤
𝑆𝑆𝑤
Within 𝑆𝑆𝑤 = ∑(𝑦𝑎𝑏 − 𝑦̅𝑎𝑏 )2 𝑑𝑓𝑤 = 𝑎𝑏(𝑛 − 1) 𝑀𝑆𝑤 =
𝑑𝑓𝑤
Example 2.2.2. The samples presented in the table below represent test scores from six classes of
Statistics taught using three different approaches (conventional, modular and blended
learning) set at different schedules (morning and afternoon) and are independently
obtained. Assume that the populations are normal with equal variances. At α = 0.05 level of
significance, test for equality of population means.
Notice that there six blocks with 4 replications each (so 𝑛 = 4). There are 2 types of schedule implying
that there are 2 levels of Factor A (so 𝑎 = 2). There are 3 types of teaching approach used implying
that there are 3 levels of Factor B (so 𝑏 = 3).
Factor B
Teaching Approach
Factor A
Conventional Modular Blended
Schedule
10 7 4
12 9 5
Morning
11 8 6
9 12 5
12 13 6
13 15 6
Afternoon
10 12 4
13 12 4
2. 𝐻𝑜 ′ ′: There is no significant difference between the test scores when grouped according to teaching
approach used.
𝐻𝑎′′ : There is significant difference between the test scores when grouped according to teaching
approach used.
3. 𝐻𝑜 ′′′ : There is no significant interaction between the schedule and teaching approach used.
𝐻𝑎′′′ : There is significant interaction between the schedule and teaching approach used.
b. Factor B
𝑆𝑆𝑏 = 𝑎𝑛 ∑(𝑦̅𝑏 − 𝑦̅)2
Similar to (a), this formula means that we are going to
i. Subtract the total mean from each of the column means
ii. Square each of the differences
iii. Summate the result from (ii)
iv. Multiply the result from (iii) with the product 𝑎𝑛
𝑦̅𝑏 − 𝑦̅ 𝑎 = 2; 𝑛 = 4
11.250 – 9 .083 = (2.167)2 = 4.696
11.000 – 9 .083 = (1.917) 2 = 3.675
5.000 – 9 .083 = (– 4.083)2 = 16.671
Sum = 25.042 x 2 x 4 = 200.33
Therefore, 𝑆𝑆𝑏 = 200.33
c. Within or Error
𝑆𝑆𝑤 = ∑(𝑦𝑎𝑏 − 𝑦̅𝑎𝑏 )2
This formula implies that we are going to
i. Get the difference between each entry data in a block and their
corresponding block means (This is the reason why I color-coded each of
the blocks with different colors and colored also the block means with the
same color as the block it represents.)
ii. Square each of the differences
iii. Summate the result from (ii)
𝑦𝑎𝑏 − 𝑦̅𝑎𝑏 𝑦𝑎𝑏 − 𝑦̅𝑎𝑏 𝑦𝑎𝑏 − 𝑦̅𝑎𝑏
Sum = 37
Therefore, 𝑆𝑆𝑤 = 37
d. Total
𝑆𝑆𝑡 = ∑(𝑦𝑎𝑏 − 𝑦̅)2
This formula implies that we are going to
i. Subtract the total mean from each of the data
ii. Square each of the differences
iii. Summate the result from (ii)
Sum = 273.83
Therefore, 𝑆𝑆𝑡 = 273.83
e. Interaction
The Sum of Squares Interaction is
𝑆𝑆𝑡 = 𝑆𝑆𝑎 + 𝑆𝑆𝑏 + 𝑆𝑆𝑎𝑏 + 𝑆𝑆𝑤
By manipulating the equation to solve for 𝑆𝑆𝑎𝑏 given 𝑆𝑆𝑡 from (d), 𝑆𝑆𝑎 from (a), 𝑆𝑆𝑏
from (b) and 𝑆𝑆𝑤 from (c), then we have
𝑆𝑆𝑎𝑏 = 𝑆𝑆𝑡 − 𝑆𝑆𝑎 − 𝑆𝑆𝑏 − 𝑆𝑆𝑤
Therefore, 𝑆𝑆𝑎𝑏 = 273.83 − 20.16 − 200.33 − 37 = 16.33
e. Total
𝑑𝑓𝑡 = 𝑎𝑏𝑛 − 1 = (2)(3)(4) − 1 = 24 − 1 = 23
Step 6: Of course, to decide whether to accept or reject the null hypothesis, there is a need to
calculate also either of the following:
a. F-critical for each of the sources of variation
Textbooks and other references suggest looking it up on Table of Critical Values
for F-Distribution which are usually on the appendices of Statistics books but if you
have a computer with Microsoft Excel, you can enter the formula function
= F. INV. RT (α, dfof the source , dfwithin )
Note that when using this method in deciding whether to accept or reject the
hypothesis, if:
• F-statistic < F-critical, accept the Ho.
• F-statistic > F-critical, reject the Ho and conclude the Ha.
Within 18
= FDIST(9.79,1,18)
Factor A 1 9.79 Reject Ho
It will return 0.006
= FDIST(48.63,2,18)
Factor B 2 48.63 Reject Ho
It will return 0.000
= FDIST(3.97,2,18)
Interaction 2 3.97 Reject Ho
It will return 0.037
Within 18
Total 273.83 23
𝐻𝑎′ :
b.
𝐻𝑜 ′ ′:
𝐻𝑎′′ :
c.
𝐻𝑜 ′′′ :
𝐻𝑎′′′ :
2. Decide the level of significance, 𝒂.
The hypothesis will be tested using 𝑎 = ______ level of significance.
3. Choose the appropriate test statistic.
The test that will be used is ___________________________________ because __________________________
_______________________________________________________________________________________________.
4. Compute the value of the statistical test.
Computation:
(Show calculation for each of the missing entries.)
Source of Sum of Degrees of Mean
F- statistic F-critical P-value Decision
Variation Squares Freedom Squares
Factor A 1157 3
Factor B 350 2
Interaction
Within 1501 24
Total 3779
6. Draw a conclusion.
Conclusion:
23 34 38 39
Heterogeneous 29 37 39 32
grouping 31 36 41 30
32 32 38 27
43 38 39 35
Homogenous 39 38 40 36
grouping 39 39 37 38
42 35 36 39