Inferential 1 Student
Inferential 1 Student
Non-probability samples
Probability samples
Non probability samples
Convenience samples (ease of access)
sample is selected from elements of a population that
are easily accessible
Snowball sampling (friend of friend.etc.)
Purposive sampling (judgemental)
You chose who you think should be in the
study
Quota sample
Non probability samples
Probability of being chosen is unknown
Cheaper- but unable to generalise
potential for bias
Probability samples
Random sampling
Each subject has a known probability of being
selected
Allows application of statistical sampling
theory to results to:
Generalise
Test hypotheses
Probability samples are the best
Ensure
Representativeness
Precision
Methods used in probability samples
684257954125632140
582032154785962024
362333254789120325
985263017424503686
Systematic sampling
Sampling fraction
Ratio between sample size and population size
Systematic sampling
Cluster sampling
Cluster: a group of sampling units close to each
other i.e. crowding together in the same area or
neighborhood
Cluster sampling
Section 1 Section 2
Section 3
Section 5
Section 4
Stratified sampling
Multi-stage sampling
sampling error...
the differences in samples due to
random fluctuations within the
population
Errors in sample
Quantitative Qualitative
Z 2 2 Z 2 (1 )
n n
D2 D2
(12 22 )xF 2 P (1 - P) F
n n
D 2
D2
Qualitative Research
-is primarily exploratory research. It is used to gain an
understanding of underlying reasons, opinions, and motivations.
It provides insights into the problem or helps to develop ideas or
hypotheses for potential quantitative research.
-Qualitative Research is also used to uncover trends in thought
and opinions, and dive deeper into the problem. Qualitative data
collection methods vary using unstructured or semi-structured
techniques.
-Some common methods include focus groups (group
discussions), individual interviews, and
participation/observations. The sample size is typically small, and
respondents are selected to fulfill a given quota.
Quantitative Research
is used to quantify the problem by way of generating
numerical data or data that can be transformed into useable
statistics.
It is used to quantify attitudes, opinions, behaviors, and other
defined variables.
Quantitative data collection methods are much more
structured than Qualitative data collection methods.
Quantitative data collection methods include various forms of
surveys online surveys, paper surveys, mobile surveys and
kiosk surveys, face-to-face interviews, telephone interviews,
longitudinal studies, website interceptors, online polls, and
systematic observations.
Problem 1
A study is to be performed to determine a
certain parameter in a community. From a
previous study a sd of 46 was obtained.
If a sample error of up to 4 is to be accepted.
How many subjects should be included in this
study at 99% level of confidence?
Answer
2
Z 2
n
D 2
2.582 x 462
n 880.3 ~ 881
4 2
Problem 2
A study is to be done to determine effect of 2
drugs (A and B) on blood glucose level. From
previous studies using those drugs, Sd of BGL
of 8 and 12 g/dl were obtained respectively.
A significant level of 95% and a power of 90%
is required to detect a mean difference
between the two groups of 3 g/dl. How many
subjects should be include in each group?
Answer
( )xF
2 2
n 2
1 2
D
(8 12 )x10.5
2 2
n 2
242.6 ~ 243
3
in each group
Problem 3
It was desired to estimate proportion of anaemic
children in a certain preparatory school. In a
similar study at another school a proportion of
30 % was detected.
Compute the minimal sample size required at a
confidence limit of 95% and accepting a
difference of up to 4% of the true population.
Answer
Z (1 )
2
n 2
D
SEx = N - 1
a smaller standard error indicates
less sampling error
the major factor affecting the size
of the standard error of the mean is
sample size
but, the size of the population
standard deviation also affects the
standard error of the mean
Important concepts about sampling
distributions:
If a sample is representative of the population, the mean (on
a variable of interest) for the sample and the population
should be the same.
However, there will be some variation in the value of sample
means due to random or sampling error. This refers to things
you cant necessarily control in a study or when you collect a
sample.
The amount of variation that exists among sample means
from a population is called the standard error of the mean.
Standard error decreases as sample size increases.
The null hypothesis (H0)...
True False
True False
Type II Error
True Correct
Type I Error
False Correct
researchers use a test of significance to
determine whether to reject or fail to reject
the null hypothesis
involves pre-selecting a level of
probability, (e.g., = .05) that
serves as the criterion to determine
whether to reject or fail to reject the
null hypothesis
Steps in using inferential statistics
t-test
ANOVA
Chi Square
t-test...
used to determine whether two
means are significantly different at
a selected probability level
adjusts for the fact that the
distribution of scores for small
samples becomes increasingly
different from the normal distribution
as sample sizes become increasingly
smaller
the strategy of the t-test is to
compare the actual mean
difference observed to the
difference expected by chance
forms a ratio where the numerator is
the difference between the sample
means and the denominator is the
chance difference that would be
expected if the null hypothesis were
true
after the numerator is divided by the
denominator, the resulting t value is
compared to the appropriate t table
value, depending on the probability
level and the degrees of freedom
if the t value is equal to or greater
than the table value, then the null
hypothesis is rejected because the
difference is greater than would be
expected due to chance
there are two types of t-tests: the
t-test for independent samples
(randomly formed) and the t-test for
nonindependent samples
(nonrandomly formed, e.g.,
matching, performance on a
pre-/post- test, different treatments)
ANOVA...
used to determine whether two or
more means are significantly
different at a selected probability
level
avoids the need to compute duplicate
t-tests to compare groups
the strategy of ANOVA is that total
variation, or variance, can be
divided into two sources: a)
treatment variance (between
groups, variance caused by the
treatment groups) and error
variance (within groups
variance)
forms a ratio, the F ratio, with the
treatment variance as the
numerator (between group
variance) and error variance as the
denominator (within group
variance)
the assumption is that randomly
formed groups of participants are
chosen and are essentially the
same at the beginning of a study
on a measure of the dependent
variable
at the studys end, the question is
whether the variance between the
groups differs from the error
variance by more than what would
be expected by chance
if the treatment variance is
sufficiently larger than the error
variance, a significant F ratio
results, that is, the null hypothesis
is rejected and it is concluded that
the treatment had a significant
effect on the dependent variable
if the treatment variance is not
sufficiently larger than the error
variance, an insignificant F ratio
results, that is, the null hypothesis
is accepted and it is concluded
that the treatment had no
significant effect on the dependent
variable
when the F ratio is significant and
more than two means are involved,
researchers use multiple
comparison procedures (e.g.,
Scheff test, Tukeys HSD test,
Duncans multiple range test)
FANOVA...
used when a research study uses
a factorial design to investigate
two or more independent variables
and the interactions between them
provides a separate F ratio for each
independent variable and each
interaction
Multiple Regression...
a prediction equation that includes
more than one predictor
predictors are variables known to
individually predict (correlate with) the
criterion to make a more accurate
prediction
Chi Square (2)...
a nonparametric test of significance
appropriate for nominal or ordinal
data that can be converted to
frequencies
compares the proportions actually
observed (O) to the proportions
expected (E) to see if they are
significantly different
the chi square value increases as
the difference between observed
and expected frequencies
increases
ANCOVA can also be used to
increase the power of a statistical
test by reducing within-group
(error) variance, that is, to make a
correct decision to reject the null
hypothesis
One- and two- tailed tests of significance...
One-way chi-square
One variable
Two variables
(1 IV with 2 levels; 1 DV) t-test
Two variables
(1 IV with 2+ levels; 1 DV) ANOVA