4.1 Introduction To Studying Two Factors: ESGC 6112: Lecture 4 Two-Factor Cross-Classification Designs
4.1 Introduction To Studying Two Factors: ESGC 6112: Lecture 4 Two-Factor Cross-Classification Designs
4.1 Introduction To Studying Two Factors: ESGC 6112: Lecture 4 Two-Factor Cross-Classification Designs
2 independents variables (factors): battery brand and the device in which the
battery is used.
Study 3 devices and 4 brands, and decide to run each combination of levels
of factors for two batteries; that is, each combination of levels of factors, or
“treatment combination” is replicated twice.
Brand
Device 1 2 3 4
1 17.9, 18.1 17.8, 17.8 18.1, 18.2 17.8, 17.9
2 18.2, 18.0 18.0, 18.3 18.4, 18.1 18.1, 18.5
3 18.0, 17.8 17.8, 18.0 18.1, 18.3 18.1, 17.9
Table 4.1 has 3 rows and four column, and two replicates (n=2) for each cell
or treatment combination.
The Model:
Yijk i j I ij ijk
where
We have n observations per cell, and RC cells, total= (nRC) data values,
df=(nRC-1)
Parameter Estimates
Yijk Y... (Yi.. Y... ) (Y. j. Y... ) (Yij. Yi.. Y. j. Y... ) (Yijk Yij. )
(Yij. Y... ) - the difference between the cell mean and the grand mean
(Yi.. Y... ) - the difference between the row mean and the grand mean
(Y. j. Y... ) - the difference between the column mean and the grand mean
Interaction
Suppose we have two factors, A and B, each at two levels, high (H) and low
(L) and an infinite amount of replication (and hence no error in the cell
means). Further suppose that the cell means are as follows:
BL BH
AL 5 8
AH 10 ?
From row 1 and column 1, (AL, BL), where the yield is 5. As we change the
level of factor A (AL AH), holding the level of factor B constant (at BL),
the yield increases by 5 (=10-5). If we hold the level of factors A constant
(at AL) and change the level of factor B (BL BH), the yield increases by 3
(=8-5). What happens when the level of both factors, A and B, changes
(ALAH, BL BH)?
3 possibilities:
if the yield (AH, BH) is 13 (increasing by 8, 3+5), there is no interaction.
if the yield (AH, BH) is greater than 13 (increasing by > 8, 3+5), there is
positive interaction.
if the yield (AH, BH) is less than 13 (increasing by < 8, 3+5, or no increasing
at all), there is negative interaction.
BL BH
AL 5 8
AH 10 17
Holding the level of factor B constant at BL, as ALAH, the yield increases
by 5.
Holding the level of factor B constant at BH, as ALAH, the yield increases
by 9.
if the effect of one factor varies depending on the level of another factor,
there is interaction between 2 factors.
Yield
10 B low
8
B high
5
A low A high
Level of factor A
Yijk Y... (Yi.. Y... ) (Y. j. Y... ) (Yij. Yi.. Y. j. Y... ) (Yijk Yij. )
Yijk Y... (Yi.. Y... ) (Y. j. Y... ) (Yij. Yi.. Y. j. Y... ) (Yijk Yij. )
2 2 2
i j k (Y
ijk Y... ) i j k (Yi.. Y... ) i j k (Y. j. Y... )
2 2
i j k (Y Y Y Y ) i j k (Y Y )
ij. i.. . j. ... ijk ij.
2
i j k (Y Y ) does not depend on the indices j and k;
i.. ...
2 2 2
i j k (Y Y ) = j k [ i (Y Y ) ] nC [ i (Y Y ) ]
i.. ... i.. ... i.. ...
2 2 2
i j k (Y
ijk Y... ) nC i (Yi.. Y... ) nR j (Y. j. Y... )
2 2
n i j (Y Y Y Y ) i j k (Y Y )
ij . i.. . j. ... ijk ij.
2
Without replication, i j k (Y Y ) = 0, since for every value of (i,j),
ijk ij.
Yij. Yijk , i.e. with no replication, a cell mean = individual data value.
Brand
Device 1 2 3 4 Yi..
1 17.9, 18.1 17.8, 17.8 18.1, 18.2 17.8, 17.9
(18.0) (17.8) (18.15) (17.85) 17.95
2 18.2, 18.0 18.0, 18.3 18.4, 18.1 18.1, 18.5
(18.1) (18.15) (18.25) (18.3) 18.20
3 18.0, 17.8 17.8, 18.0 18.1, 18.3 18.1, 17.9
(17.9) (17.9) (18.2) (18.0) 18.00
Y. j . 18.00 (17.95) 18.2 18.05 18.05
Cell means are in parentheses, column means are in the bottom row and row
means are in the last column.
Vint cannot be negative, some strong evidence that Vint =0, E ( MSI ) 2 ,
MSI and MSW are both estimating the same quantity, 2 . One should pool
the two estimates for use as the denominator of Fcalc when testing for
significance of the row and column factors. The pooling leads to a modified
ANOVA table:
Source of Variability SSQ df MS Fcalc
Rows 0.28 2 0.14 6.15
Columns 0.21 3 0.07 3.07
Error 0.41 18 0.028
(0.11+0.3) (6+12)
Total 0.90 23
A factor has fixed levels if the levels of the factor implemented in the
experiment are chosen by the experimenter, and the levels of the factor in
the experiment are the only levels about which inferences are to be made.
Example of a fixed-level factor – sex (male, female).
A factor having random levels is that the levels of the factor implemented in
the experiment are randomly selected (i.e. they are a random sample) from a
large number of possibilities. An example: amount of rainfall.
4 separate groups of 50 people; 2 groups were all males; 2 groups were all
females. 50 men & 50 women (2 groups) were told the brand name was
Frontiersman & were asked about the various opinions about the cigarette.
Another 50 men & 50 women (2 groups) were told the brand name was
April & were asked about the various opinions about the cigarette.
Sex
Brand Name Male Female Both Sexes
Frontiersman 4.44 2.04 3.24
April 3.50 4.52 4.01
Both brand 3.97 3.28
The entries in the cells represent average „intent to purchase‟ (that is, cell
means) for each group on a seven-point scale with 7 representing nearly a
certain purchase of the cigarette and 1 representing nearly zero chance of
purchase of the cigarette.
ANOVA Table
Source of Variability SSQ df MS Fcalc
Sex 23.80 1 23.80 5.61
Brand Name 29.64 1 29.64 6.99
Interaction 146.2 1 146.2 34.48
Error 831.0 196 4.24
Total 1031 199
Model – fixed model. For =0.05, and df = (1,196), c=3.84 and all three
effects are significant. Interaction effect is dominant. The main effect of
brand name is -0.77 (3.24-4.01), it‟s +0.94 (4.44-3.50) for males, and -2.48
(2.04-4.52) for females. The difference in the effects between the two sexes,
3.42 [=0.94-(-2.48)], jumps out.
Yijk i j I ij ijk
2 2 2
i j k (Y
ijk Y... ) nC i (Yi.. Y... ) nR j (Y. j. Y... )
2 2
n i j (Y Y Y Y ) i j k (Y Y )
ij . i.. . j. ... ijk ij.
Without replication, n=1, the last term of both equations equals zero, since,
for every (i,j), Yijk Yij. , i.e. with no replication, a cell mean equals the
individual data value. Then we have:
2 2 2 2
i j (Y Y ) C i (Y Y ) R j (Y Y ) i j (Y Y Y Y )
ij .. i. .. .j .. ij i. .j ..
and TSS = SSBr + SSBc + SSIr,c
df: RC-1 = (R-1) + (C-1) + (R-1)(C-1)
Suppose there is no interaction, Vint=0, then E(MSIr,c) = 2, we can use SSIr,c
and MSIr,c to represent the role of SSW and MSW, respectively, therefore,
TSS = SSBr + SSBc + SSW
Example 4.5: An Unreplicated Numerical Example
Fcalc estimated the ratio with numerator and denominator expected mean
squares of (2 + Vr)/2; however, with the interaction not zero, we actually
estimated (2 + Vr)/(2+ Vint), therefore Fcalc is, on average, smaller than it
deserves to be.
Using SPSS
4.5 Blocking
One solution is to study on one gender only, or two separate studies, but
these will not give an overall measure of the impact of age, and the result for
each gender would have the reliability that a combined study might have. In
addition, interaction effect may happen.
Hypothesis:
Convert the data to ranks – ranks within each row replace each data value
by its rank within its row (1 through 4 as there are 4 data value in a row). If
there is a tie, average the ranks.
Test statistic:
FR = {12 /[ RC (C 1)]} ( R 2j ) 3R(C 1)
j 1toC
Exercises: