Parametric Tests
Parametric Tests
Parametric Tests
Analysis: Parametric
Tests
• Facts • Organized
[classified/arranged]
and/or Which is then
• Figures • Summarized [condensed]
• Analyzed [scrutinized]
• Interpreted [clarified]
• Data: The raw material of statistics is data:
• We usually define data as numbers
• The two kinds of numbers that we use in statistics are:
• Numbers that result from the taking — in the usual sense of the term — of a
measurement (Continuous variables)
• Numbers that result from the process of counting (Discrete variables)
Classification of Data based on type of Variables &
associated measurement scale
Types of Data
Qualitative Quantitative
Data Type
Categorical Numerical
• Even though this appears like a science, there are ways in which one
can manipulate studies and results through various means BIAS /
CHEATING!!!
• The Descriptive Statistics allow us to determine if the criteria have been met
• If these criteria are not met, then we run the risk of using wrong inferential
tests with an increased risk of errors (type I or type II errors) in the analysis of
data
Data Assumptions for Parametric Tests
• In statistical analysis, it is important that all parametric tests should assume
some certain characteristic about the data also known as assumptions
• Violation of these assumptions changes the conclusion of the research and
interpretation of the results all research, whether for a journal article, thesis, or
dissertation, must follow these assumptions for accurate interpretation
• Depending on the parametric analysis, the assumptions vary
• The following are the data assumptions commonly found in statistical research:
1. Random sampling – sample must be randomly selected from population
Non-
Parametric
Parametric Tests
should be used?
• In table D, we consult the row
corresponding to df = n – 1 = 11
• We move across that row to the entry that
is directly in the 95% confidence interval
column
• The desired critical value is t* = 2.201
T-test
2 independent
samples t-test
Two samples
Two paired samples
t-test
• Assumptions:
1. Samples are randomly selected
2. Data for analysis is quantitative
3. Variable follow normal distribution
4. Sample variances are mostly same in both the groups under the study
5. Samples are small, mostly lower than 30
One sample t-test
Steps:
1. A random sample is drawn from the population
2. Comparison of sample mean with population mean is made
3. A statistical decision is made as to whether or not the sample mean is
different from the population
Calculations for one sample t-test
• S2 =
Where:
• = Sample mean Where:
• = Population mean • = Element of sample
• = Standard Error • = Sample mean
• = Degrees of freedom
Decision
• If absolute value of ‘t’ obtained is greater than the value t* given in table D,
then we reject the null hypothesis
• If absolute value of ‘t’ is less than the table value t*, we fail to reject the null
hypothesis
Example 1
which constitutes
the DEPENDENT
an INDEPENDENT whose effect is
variable
variable then determined
(For e.g.,
(e.g., drug, vaccine, a by measurement
new procedure, incidence of
of the FINAL
behavior change) disease, survival
OUTCOME
time, recovery
period)
• t=
• = - 0.2564
State the rule for rejecting the null hypothesis
• p value suggests that the dietary intake of these women was significantly less
• t = - 0.2664 which is < 2.23
• Unpaired t-test is used when two independent random samples come from
normal populations having unknown or same variance
alternative hypothesis
Assumptions:
• Samples are random and independent of each other
• The normal distribution of dependent variable is normal
• The variances are equal in both the groups
Calculations – two independent samples t-test
• The test statistic is given by:
•t=
•S=
• Where and are respectively called Standard Deviations of first and second
group
Weight of babies (in kgs)
born to:
Example – two unpaired samples Non-smoking Heavy
mothers smoking
t-test (n=15) mothers
(n=14)
• Research problem: A study was conducted to 3.99 3.18
compare the birth weights of children born to non-
smoking mothers (n=15) vs. smoking mothers 3.79 2.84
(n=14) 3.60 2.90
• Research Hypothesis: State null hypothesis and 3.73 3.27
alternative hypothesis 3.21 3.85
• H0 = There is no difference between the 3.60 3.52
birthweight (in kgs) of children born to non- 4.08 3.23
smoking and smoking mothers
3.61 2.76
• H1 = There is a difference between the
3.83 3.60
birthweights (in kgs) of children born to non-
3.31 3.75
• Set the level of significance α = 0.05; 0.01; 0.001
smoking and smoking mothers
4.13 3.59
3.26 3.63
3.54 2.38
Decision
(sample) where each element (x) gives a pair of observations (x1 & x2)
• We compare the means of two observations from a single group
Assumptions:
• The outcome variable should be continuous
• The difference between pre-post measurements should be normally
distributed
Calculations
• Where:
• d = difference between x1 and x2
• = Average of d
• SD = Std. deviation for the difference
• n = sample size
Example – two paired samples t-test
Patient Weight
Research Problem: No. (Kgs)
• A study was carried out to evaluate the effect of Before After Diet
a new diet on weight loss. The study population Diet
consists of 12 people who have used the diet for 1 75 70
2 months; their weights before & after the diet
are given 2 60 54
3 68 58
4 98 93
Research Hypothesis:
5 83 78
• State null hypothesis and alternative hypothesis
6 89 84
• H0 = There is no reduction in weight after the diet
7 65 60
• H1 = There is reduction in weight after the diet 8 78 77
9 95 90
• Decision: If we reject the null hypothesis then 10 80 76
there is a significant reduction in weight 11 100 94
Group Statistics
Lower Upper
Equal variances
3.609 .059 -1.812 198 .071 -2.53352 1.39790 -5.29021 .22316
science assumed
score Equal variances not
-1.785 176.555 .076 -2.53352 1.41958 -5.33505 .26800
assumed
How do we compare means for more than two
groups?
How does t-test concept work here? • Instead of using a series of individual
comparisons we examine the
• A versus B
differences among the groups
B versus C
through an analysis that considers
• A versus C B versus D the variation among all groups at
• A versus D C versus D once
So, the chance of getting the wrong result • i.e., ANALYSIS OF VARIANCE
would be:
1 – (0.95 x 0.95 x 0.95 x 0.95) = 26%
Variables in ANOVA
which constitutes
the DEPENDENT
an INDEPENDENT whose effect is
variable
variable then determined
(For e.g.,
(e.g., drug, vaccine, a by measurement
new procedure, incidence of
of the FINAL
behavior change) disease, survival
OUTCOME
time, recovery
period)
• The resulting F statistic is then compared with the critical value of F (critic),
obtained from F tables in much the same way as was done with ‘t’
• If the calculated value exceeds the critical value for the appropriate level of α,
the null hypothesis will be rejected
• A F test is therefore a test of the Ratio of Variances
• F test can also be used on their own, independently of the ANOVA technique, to test
hypothesis about variances
• In ANOVA, the F test is used to establish whether a statistically significant difference exists
in the data being tested
One-Way
ANOVA
Two-Way
Types of ANOVA
• The first step in studying the relationship between two continuous variables is
to draw a scatter plot of the variables to check for linearity
• For the purpose of correlation only, it does not matter on which axis the
variables are plotted
• The nearer the scatter of points is to a straight line, the higher the strength of
association between variables
Types of Correlation
• No Correlation
The Scatter
Plot
• This test is used for testing significance difference between two means (n>30)
Assumptions:
• The sample must be randomly selected
• Data must be quantitative
• Samples should be larger than 30
• Data should be normally distributed
• Sample variances should be almost the same in both the groups of study