Sampling and Sampling Distribution
Sampling and Sampling Distribution
1
Sampling
• Sample survey methodology often use to obtain information
about a large aggregate or population by selecting and
measuring a sample from the population
3
Advantages of sampling:
• Feasibility: Sampling may be the only feasible method of
collecting the information.
• Reduced cost: Sampling reduces demands on resource such
as finance, personnel, and material.
• Greater accuracy: Sampling may lead to better accuracy of
collecting data
• Sampling error: Precise allowance can be made for sampling
error
• Greater speed: Data can be collected and summarized more
quickly
4
Disadvantages of sampling:
• There is always a sampling error.
5
Sampling technique
– Nonprobability sampling
– Probability sampling
6
Probability sampling
• Gives each element a known non-zero chance of being
included in the sample.
– Cluster sampling
– Multi‐stage sampling
8
1. Simple random sampling
9
SRS,…
• Example
– Suppose your school has 500 students and you need to
conduct a short survey on the quality of the food served in
the cafeteria.
– You decide that a sample of 50 students should be
sufficient for your purposes.
– In order to get your sample, you assign a number from 1 to
500 to each student in your school.
– To select the sample, you use a table of randomly generated
numbers.
10
SRS,…
12
Systematic random sampling,…
13
Example
• Therefore, K = 4.
• You will need to select one unit out of every four units to end
14
Example,…
• If you choose 3, the third unit on your frame would be the first
unit included in your sample;
15
Example,…
• With a systematic sample approach there are only four
possible samples that can be selected, corresponding to the
four possible random starts:
• 1, 5, 9, 13...393, 397
16
Example,…
• Each member of the population belongs to only one of the four
samples and each sample has the same chance of being
selected.
• Each unit has a one in four chance of being selected in the
sample.
• The main difference is that with SRS, any combination of 100
units would have a chance of making up the sample, while
with systematic sampling, there are only four possible
samples.
17
Sampling with probability proportional to size
19
Steps in PPS
1. List all clusters with their population size
20
Cluster size Cum. Interval
1 1028 1028 0-1028
• If the number of cluster needed is 5,
2 555 1583 1029-1583
sampling interval is 9156/5=1831 3 390 1584-1973
1973
• Draw a number between 1 and 1831 4 1309 3282 1974-3282
5 698 3980 3283-3980
lets say 480
6 907 4887 3981-4887
• Start from the village including “480” 7 432 5319 4888-5319
and draw the clusters adding the 8 897 6216 5320-6216
9 677 6893 6217-6893
sampling interval
l0 501 7394 6894-7394
• According cluster 1, 4, 5, 6, 7 will be 11 1094 7395-8488
8488
included 12 668 9156 8489-9156
Total 9156
21
Stratified random sampling
22
Stratified random sampling,…
23
Reasons for stratified sampling
24
Cluster sampling
• Sometimes it is too expensive to spread a sample across the
population as a whole.
25
Steps in cluster sampling
• Cluster sampling divides the population into groups or
clusters.
• A number of clusters are selected randomly to represent the
total population, and then all units within selected clusters are
included in the sample.
• No units from non‐selected clusters are included in the
sample—they are represented by those from selected clusters.
• This differs from stratified sampling, where some units are
selected from each group
26
Example
27
Cluster sampling,…
• cost reduction is a reason for using cluster sampling.
28
Cluster sampling,…
29
Cluster sampling
30
Multi‐stage sampling
31
Multi‐stage sampling,…
• In the second stage, population units are picked from within
the selected clusters for a final sample.
32
Multi‐stage sampling,…
• a list of all of the units in the population is not needed. All you
need is a list of clusters and list of the units in the selected
clusters.
33
Multi‐stage sampling,…
34
Non‐probability sampling
• There is an assumption that there is an even distribution of
characteristics within the population
• Any sample would be representative and because of that,
results will be accurate
• Elements are chosen arbitrarily, there is no way to estimate the
probability of any one element being included in the sample
36
Non‐probability sampling,…
37
Types of nonprobability sampling
1. Convenience or haphazard sampling
2. Volunteer sampling
3. Judgment sampling
4. Quota sampling
38
Convenience sampling
39
Convenience sampling,…
• For example;
• a scientist could use this method to determine whether a
lake is polluted.
41
Judgment sampling
• a sample is taken based on certain judgments about the overall
population
42
Judgment sampling,…
43
Judgment sampling,…
44
Quota sampling
• This is one of the most common forms of non‐probability
sampling.
45
Quota sampling,…
46
Quota sampling,…
47
Sampling Distribution
• The distribution of all possible values that can be assumed by
some statistic, computed from samples of the same size
randomly drawn from the same population
49
Sampling Distribution,…
50
Sampling Distribution,…
51
Sampling Distribution,…
52
Sampling Distribution,…
• Example:
– Let us consider a population with four elements represented
by X1 =1, X2=3, X3 =2, and X4 =4.
53
Sampling Distribution,…
• The samples are: (1, 1), (1, 3), (1, 2), (1, 4), (3, 1), (3, 3), (3,
2), (3, 4), (2, 1), (2, 3), (2, 2), (2, 4), (4, 1), (4, 3), (4, 2), (4, 4).
55
Sampling Distribution,…
• The important point here is that we select only one sample out
of all the possible samples using with or without replacement.
56
Sampling Distribution,…
• In that situation, we could find the value of statistic for all the
possible samples.
• As an example, let us consider about the statistic, sample
mean.
• If there are Nn possible samples with replacement then the
number of values of statistic, mean, will be Nn too.
• Similarly, disregarding order the number of samples without
replacement is 𝑁 𝑛
57
Sampling Distribution,…
58
Sampling Distribution of the Sample Mean
• The sampling distribution of the sample mean is widely used
in statistics.
59
Sampling Distribution of the Sample
Mean,…
• Example:
– Let us consider a population of size N = 5 and a sample of
size n = 3.
– The population elements are X1=30, X2 =25, X3 = 40, X4
=27, and X5 =35.
– The random sample is X1, X2, X3).
– The number of samples using without replacement and
disregarding order is 53 =10
60
Sampling Distribution of the Sample
Mean,…
5
𝑖=1 𝑥𝑖 157
• The population mean is µ= =31.40
𝑛 5
10
𝑖=1 𝑥𝑖
µ=
10
314
=31.40
10
61
Sampling Distribution of the Sample
Mean,…
• The population mean of the variable X and the population
mean of means from the random samples of size n are exactly
the same.
• From the above example, we observe that in both the cases, the
mean is 31.40
E(X)= µ
E( 𝑥) =µ
62
Characteristics of the Sampling
Distribution of 𝑥
Mean and Variance of X If X1, X2, . . ., Xn is a random sample of
size n from any distribution with mean µ and variance σ2, then:
1. The µx= µ
2. The variance of 𝑥 is
3. The standard deviation (standard error ) of 𝑥 is
63
Sampling from Normal Population
– If X1, X2, . . ., Xn is a random sample of size n from a
normal population with mean µ and variance σ2 that is
Normal (µ, σ2), then the sample mean has a normal
distribution
– mean =µ
𝜎2
– Variance= 𝑛
– the random variable in a random sample satisfies the
assumption that X ~N (µ, σ2)
𝑥 −𝜇
• Z=𝜎/ 𝑛
~𝑁𝑜𝑟𝑚𝑎𝑙(0,1)
64
Central Limit Theorem
65
Central Limit Theorem,…
• We can use this result when sampling from non-normal
distribution with known variance 𝜎2 and with large sample size
too
66
Sampling Distribution of 𝑥2 − 𝑥 2
67
68
σ2 unknown, Normal population
𝑥−𝜇
t=𝑠/ , has df (n-1), s is sample standard deviation
𝑛
69
Distribution of the sample proportion
71
The end !
THANK YOU
FOR
THANK YOU
FOR
!!
THANK YOU FOR YOUR ATTENTION!!
72