Sampling
Sampling
Sampling
METHODS
1
LEARNING OBJECTIVES
2
SAMPLING
4
SAMPLING…….
5
6
SAMPLING BREAKDOWN
SAMPLING…….
STUDY POPULATION
SAMPLE
TARGET POPULATION
7
Process
The sampling process comprises several stages:
Defining the population of concern
Specifying a sampling frame, a set of items or
events possible to measure
Specifying a sampling method for selecting
items or events from the frame
Determining the sample size
Implementing the sampling plan
Sampling and data collecting
Reviewing the sampling process
8
Population definition
9
SAMPLING FRAME
In the most straightforward case, such as the
sentencing of a batch of material from production
(acceptance sampling by lots), it is possible to
identify and measure every single item in the
population and to include any one of them in our
sample. However, in the more general case this is not
possible. There is no way to identify all rats in the
set of all rats. Where voting is not compulsory, there
is no way to identify which people will actually vote at
a forthcoming election (in advance of the election)
As a remedy, we seek a sampling frame which has the
property that we can identify every single element
and include any in our sample .
The sampling frame must be representative of the
population
10
PROBABILITY SAMPLING
11
PROBABILITY SAMPLING…….
12
SIMPLE RANDOM SAMPLING
• Applicable when population is small, homogeneous &
readily available
• All subsets of the frame are given an equal
probability. Each element of the frame thus has an
equal probability of selection.
• It provides for greatest number of possible samples.
This is done by assigning a number to each unit in the
sampling frame.
• A table of random number or lottery system is used
to determine which units are to be selected.
13
SIMPLE RANDOM SAMPLING……..
Estimates are easy to calculate.
Simple random sampling is always an EPS design, but not all
EPS designs are simple random sampling.
Disadvantages
If sampling frame large, this method impracticable.
Minority subgroups of interest in population may not be
present in sample in sufficient numbers for study.
14
REPLACEMENT OF SELECTED UNITS
17
SYSTEMATIC SAMPLING……
ADVANTAGES:
Sample easy to select
Suitable sampling frame can be identified easily
Sample evenly spread over entire reference population
DISADVANTAGES:
Sample may be biased if hidden periodicity in population
coincides with that of selection.
Difficult to assess precision of estimate from one survey.
18
STRATIFIED SAMPLING
Where population embraces a number of distinct
categories, the frame can be organized into separate
"strata." Each stratum is then sampled as an
independent sub-population, out of which individual
elements can be randomly selected.
Every unit in a stratum has same chance of being
selected.
Using same sampling fraction for all strata ensures
proportionate representation in the sample.
Adequate representation of minority subgroups of
interest can be ensured by stratification & varying
sampling fraction between strata as required.
19
STRATIFIED SAMPLING……
Finally, since each stratum is treated as an
independent population, different sampling
approaches can be applied to different strata.
21
OVERSAMPLING
Choice-based sampling is one of the stratified
sampling strategies. In this, data are
stratified on the target and a sample is taken
from each strata so that the rare target class
will be more represented in the sample. The
model is then built on this biased sample. The
effects of the input variables on the target
are often estimated with more precision with
the choice-based sample even when a smaller
overall sample size is taken, compared to a
random sample. The results usually must be
adjusted to correct for the oversampling.
22
CLUSTER SAMPLING
Cluster sampling is an example of 'two-stage
sampling' .
First stage a sample of areas is chosen;
Second stage a sample of respondents within
those areas is selected.
Population divided into clusters of homogeneous
units, usually based on geographical contiguity.
Sampling units are groups rather than individuals.
A sample of such clusters is then selected.
All units from the selected clusters are studied.
23
CLUSTER SAMPLING…….
Advantages :
Cuts down on the cost of preparing a sampling
frame.
This can reduce travel and other
administrative costs.
Disadvantages: sampling error is higher for a
simple random sample of same size.
Often used to evaluate vaccination coverage in
EPI
24
CLUSTER SAMPLING…….
• Identification of clusters
– List all cities, towns, villages & wards of cities with
their population falling in target area under study.
– Calculate cumulative population & divide by 30, this
gives sampling interval.
– Select a random no. less than or equal to sampling
interval having same no. of digits. This forms 1st cluster.
– Random no.+ sampling interval = population of 2nd
cluster.
– Second cluster + sampling interval = 4th cluster.
– Last or 30th cluster = 29th cluster + sampling interval
25
CLUSTER SAMPLING…….
• Freq cf cluster • XVI 3500 52500 17
• I 2000 2000 1 • XVII 4000 56500 18,19
• II 3000 5000 2 • XVIII 4500 61000 20
• III 1500 6500 • XIX 4000 65000 21,22
• IV 4000 10500 3
• XX 4000 69000 23
• V 5000 15500 4, 5
• XXI 2000 71000 24
• VI 2500 18000 6
• VII 2000 20000 7 • XXII 2000 73000
• VIII 3000 23000 8 • XXIII 3000 76000 25
• IX 3500 26500 9 • XXIV 3000 79000 26
• X 4500 31000 10 • XXV 5000 84000 27,28
• XI 4000 35000 11, 12 • XXVI 2000 86000 29
• XII 4000 39000 13 • XXVII 1000 87000
• XIII 3500 44000 14,15 • XXVIII 1000 88000
• XIV 2000 46000 • XXIX 1000 89000 30
• XV 3000 49000 16
• XXX 1000 90000
• 90000/30 = 3000 sampling interval
26
Difference Between Strata and Clusters
27
MULTISTAGE SAMPLING
All ultimate units (houses, for instance) selected at last step are
surveyed.
28
MULTISTAGE SAMPLING……..
This technique, is essentially the process of taking random
samples of preceding random samples.
Not as effective as true random sampling, but probably
solves more of the problems inherent to random sampling.
An effective strategy because it banks on multiple
randomizations. As such, extremely useful.
Multistage sampling used frequently when a complete list
of all members of the population not exists and is
inappropriate.
Moreover, by avoiding the use of all sample units in all
selected clusters, multistage sampling avoids the large,
and perhaps unnecessary, costs associated with traditional
cluster sampling.
29
MULTI PHASE SAMPLING
Part of the information collected from whole sample & part from
subsample.
30
NON PROBABILITY SAMPLING
Any sampling method where some elements of population
have no chance of selection (these are sometimes
referred to as 'out of coverage'/'undercovered'), or
where the probability of selection can't be accurately
determined. It involves the selection of elements based
on assumptions regarding the population of interest, which
forms the criteria for selection. Hence, because the
selection of elements is nonrandom, nonprobability
sampling not allows the estimation of sampling errors..
31
NONPROBABILITY
SAMPLING…….
• Nonprobability Sampling includes: Quota
Sampling, Convenience Sampling and Purposive
Sampling. In addition, nonresponse effects may
turn any probability design into a
nonprobability design if the characteristics of
nonresponse are not well understood, since
nonresponse effectively modifies each
element's probability of being sampled.
32
QUOTA SAMPLING
The population is first segmented into mutually exclusive
sub-groups, just as in stratified sampling.
Then judgment used to select subjects or units from
each segment based on a specified proportion.
For example, an interviewer may be told to sample 200
females and 300 males between the age of 45 and 60.
It is this second step which makes the technique one of
non-probability sampling.
In quota sampling the selection of the sample is non-
random.
For example interviewers might be tempted to interview
those who look most helpful. The problem is that these
samples may be biased because not everyone gets a
chance of selection. This random element is its greatest
weakness and quota versus probability has been a matter
33
of controversy for many years
CONVENIENCE SAMPLING
Sometimes known as grab or opportunity sampling or accidental
or haphazard sampling.
A type of nonprobability sampling which involves the sample being
drawn from that part of the population which is close to hand.
That is, readily available and convenient.
The researcher using such a sample cannot scientifically make
generalizations about the total population from this sample
because it would not be representative enough.
For example, if the interviewer was to conduct a survey at a
shopping center early in the morning on a given day, the people
that he/she could interview would be limited to those given there
at that given time, which would not represent the views of other
members of society in such an area, if the survey was to be
conducted at different times of day and several times per week.
This type of sampling is most useful for pilot testing.
In social science research, snowball sampling is a similar technique,
where existing study subjects are used to recruit more subjects
into the sample.
34
CONVENIENCE SAMPLING…….
35
35
Judgmental sampling or Purposive
sampling
- The researcher chooses the sample based on
who they think would be appropriate for the
study. This is used primarily when there is a
limited number of people that have expertise
in the area being researched
36
Questions???
37