One-way ANOVA

Two variables: 1 Categorical variable
(factor/IV), 1 Quantitative variable
Main Question: Do (the means of) the
quantitative variables depend on which group
(given by categorical variable) the individual is in?
ANOVA looks at differences between

Note: We usually refer to the sub-populations or

the same population but with different treatments
applied

At its simplest ANOVA tests the following

H0: The means of all the groups are equal
1 = 2 = 3 = i

Ha: Not all the means are equal


– Similar to t-test

– More versatile than t-test

– Compare one parameter (response variable)

between two or more groups
Why Not Just Use t-tests?

– Tedious when many groups are present

– Using all data increases stability

– Large number of comparisons some may

appear significant by chance
A company has three branches.
Turnover level differs across the three branches and
management wants to know whether this may be
explained by the extent to which employees are
satisfied with their working environment across the
branches. Fifty employees are randomly selected at
each branch and given a questionnaire measuring
how satisfied they currently are with the working
e e .
– Researchers investigate the effects of control type
on firm performance. The research question is
whether a real difference exists in performance
between owner- and manager-controlled firms
(McKean and Kania, 1978).
– Investigators want to investigate whether
demographic factors (e.g. age groups, races,
education level, annual income level, and
employment status) and investment experience
(novice, intermediate, advance) have influence on
retirement planning intention.

The researchers investigate the effects of

ad e de e e c (b e, ee ,
and brown) in ad viewers responses to the
ad (Simpson, Sturgen, and Tanguma)

What can we conclude from the examples?

ANOVA Assumptions
There are Three basic assumptions used in
The populations from which the samples
were taken are normally distributed.
Homogeneity of variance
Random sampling.
Notation for ANOVA

n = number of individuals all together

i = number of groups
x = mean for entire data set is

Group i has
ni = # of individuals in group i
xij = value for individual j in group i
xi = mean for group i
si = standard deviation for group i
How ANOVA works
ANOVA measures two sources of variation in the data and
compares their relative sizes

variation BETWEEN groups

for each data value look at the difference between its group
mean and the overall mean
( xi - x ) 2
variation WITHIN groups
for each data value we look at the difference between that
value and the mean of its group

( xij - xi ) 2
How ANOVA works
The ANOVA F-statistic is a ratio of the Between Group Variation divided
to the Within Group Variation:

Between MSG
Within MSE
This compares the variation between groups (group means to overall mean)
to the variation within groups (individual values to group means). This is
a e e a e A a f Va a ce.

A large F is evidence against H0, since it indicates that there is more

difference between groups than within groups.

Note: it is easier to look at the P-value to indicate whether the H0 is

rejected or not If the P-value is less than or equal to a, reject H0. If the P-
value is greater than a, fail to reject H0.
How ANOVA works
Step 1: The null hypothesis is

H0 : 1 2 3

• Step 2: The alternative hypothesis is

H a : not all of the i are equal

• Step 3: The significance level is =?

(usually is set to one of the values {0.01, 0.05, 0.1}
How ANOVA works
Step 4: Calculate the F-statistic:

Mean Square Group MSG

F or
Mean Square Error MSE

MSG, MSE and the F-statistic are found in the

ANOVA table when the analysis is run on the SPSS
How ANOVA works
Step 5: Find the P-value
Step 6. Reject or fail to reject H0 based on the
Step 7. State your conclusion.
How ANOVA works
Levene's test:
H0: σ12 = σ22 = σ32 = σi2 → Homogeneity of
Ha: σ12 σ22 σ32 σi2
Homogeneity fulfilled → Equal variance assumed
Homogeneity rejected → Equal variance not assumed

•ANOVA is still robust even when the homogeneity assumption is not fulfilled,
as long as the sample sizes are roughly equal or the deviation is only of a
moderate level. As a rule of thumb, if the largest std.dev < (2 x the smallest
std.dev) then we need not to be concerned about this assumption.
•Equal variance assumed or not assumed will affect to Post Hoc test methods
How to perform ANOVA in SPSS?
Post Hoc Test: The results from the ANOVA do not
indicate which of the three groups differ from one another.
To locate the source of this difference we use a post hoc
test (commonly Tukey test and the more conservative is
Scheffé test; equal variance is assumed in these tests).
– Click Post Hoc and check Tukey box, click Continue button.
– Last, click OK button and wait a moment while SPSS analyzes the

• Tukey performs all of the pairwise comparisons between groups.
• Scheffe performs simultaneous joint pairwise comparisons for all
possible pairwise combinations of means. Can be used to examine all
possible linear combinations of group means, not just pairwise
How to perform ANOVA in SPSS?

If equal variance is not assumed, some post

hoc tests could be used:
– Tamhane's T2. Conservative pairwise
comparisons test based on a t-test.
– Dunnett's T3. Pairwise comparison test based
on the Studentized maximum modulus.
– Games-Howell. Pairwise comparison test that
is sometimes liberal.
– Dunnett's C. Pairwise comparison test based
on the Studentized range.
How to perform ANOVA in SPSS?
One IV or Factor

Is F-value significant?

Yes No

Are there more than 2 Stop


Yes No

Do Post Hoc Stop

How to perform ANOVA in SPSS?

This is how
the data set
is shown
How to perform ANOVA in SPSS?

The number of sample

in each region

H e e e

P-value for Le e e Test

Ho: σ1 = σ2 = σ3
Ha: At least one σ is different than
the others
How to perform ANOVA in SPSS?

Result of ANOVA

P-Value for ANOVA

Ho: 1 = 2 = 3
Ha: At least one is
different than the others

Conclusion: There is a difference in

e ee b a fac ac
How to perform ANOVA in SPSS?

South region is significantly

different from others
Test yourself

What is ANOVA?
Why do we use ANOVA?
What are ANOVA assumptions?
How to test ANOVA assumptions?
What do we do when the equal variance is
not fulfilled?
What does it mean when the F value in
ANOVA result is statistically significant?
What does the post hoc test answer?
