Community Project: Checking Normality For Parametric Tests in SPSS
Community Project: Checking Normality For Parametric Tests in SPSS
stcp-marshallsamuels-normalityS
When carrying out tests comparing groups, e.g. t-tests, normality checks should be carried
out separately for each group: put the appropriate grouping variable in the Factor List
Note: Normal curves can be added to histograms by double-clicking on them and using the
button in the Chart Editor window.
It is very unlikely that a histogram of sample data will produce a perfectly smooth normal curve like
the one displayed over the histogram, especially if the sample size is small. As long as the data is
approximately normally distributed, with a peak in the middle and fairly symmetrical, the
assumption of normality has been met.
The normal Q-Q plot is an alternative graphical method of assessing normality to the histogram
and is easier to use when there are small sample sizes. The scatter should lie as close to the line
as possible with no obvious pattern coming away from the line for the data to be considered
normally distributed. Below are the same examples of normally distributed and skewed data.
Q-Q plot of approximately normally
Q-Q plot of skewed data
distributed data
Note: Alternative methods of measuring non-normality include comparing skewness and kurtosis
values with their standard errors which are provided in the Explore output – see the workshops on
SPSS and parametric testing.
• For larger samples (i.e. more than one hundred), the normality tests are overly conservative
and the assumption of normality might be rejected too easily (see robust exceptions below).
Any assessment should also include an evaluation of the normality of histograms or Q-Q plots
as these are more appropriate for assessing normality in larger samples.
Hypothesis test for a test of normality
Null hypothesis: The data is normally distributed
For both of these examples, the sample size is 35 so the Shapiro-Wilk test should be used. For
the skewed data, p = 0.002 suggesting strong evidence of non-normality. For the approximately
normally distributed data, p = 0.582, so the null hypothesis is retained at the 0.05 level of
significance. Therefore, normality can be assumed for this data set and, provided any other test
assumptions are satisfied, an appropriate parametric test can be used.
Note: The residuals are the differences between the observed and expected values. They are
usually available in the ‘Save’ options when carrying out a test. However, for ANOVA, residuals
can only be obtained through the Analyze General Linear Model Univariate menu.
Although non-parametric tests require fewer assumptions and can be used on a wider range of
data types, parametric tests are preferred because they are more sensitive at detecting differences
between samples or an effect of the independent variable on the dependent variable. This means
that to detect any given effect at a specified significance level, a larger sample size is required for
the non-parametric test than the equivalent parametric test when the data is normally distributed.
However, some statisticians argue that non-parametric methods are more appropriate with small
sample sizes.
Subject 1 2 3 4 5 6 7 8 9 10
Before 85.1 108.4 79 109.1 97.3 96 102.1 91.2 89.2 100
After 86 70 78.9 69.7 59.4 55 65.6 50.2 60.5 82
Carry out the relevant normality checks and determine which test is appropriate.
Solutions
Ex 1: The p-value for the Shapiro-Wilk test is 0.001 suggesting that the data is not normally
distributed. However, the histogram suggests that the data is approximately normal with one
outlier. With such a high BMI, this is likely to be a data entry error and therefore removed.
Ex 2: This is paired data and therefore the paired differences should be checked for normality.
Use Transform Compute Variable and calculate the difference between before and after. The
Shapiro-Wilk test has a p-value of 0.005 and the histogram is negatively skewed so a paired t-
test is not appropriate. A Wilcoxon signed rank test should be used instead. For small sample
sizes, it can be difficult to assess non-normality so non-parametric tests are recommended.