Analytics 2 e
Analytics 2 e
Analytics 2 e
Statistical Inference
Statistical Inference
Statistical inference focuses on drawing
conclusions about populations from samples.
◦ Statistical inference includes estimation of population
parameters and hypothesis testing, which involves
drawing conclusions about the value of the parameters of
one or more populations.
Hypothesis Testing
Hypothesis testing involves drawing inferences about
two contrasting propositions (each called a hypothesis)
relating to the value of one or more population
parameters.
H0: Null hypothesis: describes an existing theory
H1: Alternative hypothesis: the complement of H0
Using sample data, we either:
- reject H0 and conclude the sample data provides
sufficient evidence to support H1, or
- fail to reject H0 and conclude the sample data
does not support H1.
Example 7.1: A Legal Analogy for Hypothesis
Testing
In the U.S. legal system, a defendant is innocent
until proven guilty.
◦ H0: Innocent
◦ H1: Guilty
If evidence (sample data) strongly indicates the
defendant is guilty, then we reject H0.
Note that we have not proven guilt or innocence!
Hypothesis Testing Procedure
t = 3.83
Critical value = 1.81
p-value = 0.00166
Reject H0.
Two-Sample Test for Means with
Paired Samples
In many situations, data from two samples are naturally
paired or matched.
When paired samples are used, a paired t-test is more
accurate than assuming that the data come from
independent populations.
Hypotheses (mD is the mean difference between the paired
samples):
t = -10.91
t is smaller than the
hypothesis
Test for Equality of Variances
Test for equality of variances between two samples
using a new type of test, the F-test.
◦ To use this test, we must assume that both samples are drawn
from normal populations.
Hypotheses:
F-test statistic:
F = 3.47
Critical value =
2.91
P-value = 0.029
Reject H0
Analysis of Variance (ANOVA)
Used to compare the means of two or more population
groups.
◦ F = 3.92
◦ Fcrit = 3.46
◦ F > Fcrit
◦ p-value = 0.0356
◦ Reject H0.
Assumptions of ANOVA
The m groups or factor levels being studied
represent populations whose outcome measures
1. are randomly and independently obtained,
2. are normally distributed, and
3. have equal variances.
Test
statistic