Lecture 3 Sampling and Sampling Distribution - Probability and Non-Probability Sampling
Lecture 3 Sampling and Sampling Distribution - Probability and Non-Probability Sampling
• A random starting point is selected and then every kth member of the
population is selected
• In a systematic random sample the items or individuals of the
population are arranged in some way — alphabetically, in a file drawer
by date received, or by some other method. A random starting point is
selected, and then every kth member of the population is selected for
the sample. In a systematic random sample, you might take all the
items in the population and number them 1, 2, 3,.... Next, a random
starting point is selected, let's say 39. Every kth item thereafter, such as
every 100th, is selected for the sample. This means that 39, 139, 239,
339, and so on would be a part of the sample.
Stratified random sampling.
• It is not logical to expect that the results obtained from a sample will
coincide exactly with those from a population. For example, it is
unlikely that the mean welfare payment for a sample of 50 recipients
is exactly the same as the mean for all 4,000 welfare recipients. We
expect a difference between a sample statistic and its corresponding
population parameter. The difference is called sampling error.
• The difference between a sample statistic and its corresponding
population parameter
Sampling Distribution of the Sample Mean
• Suppose all possible samples of size n are selected from a specified population, and the
mean of each of these samples is computed. The distribution of these sample means is
called the sampling distribution of the sample mean.
• A probability distribution of all possible sample means of a given sample size
• The sampling distribution of the mean is a probability distribution and has the following
major characteristics:
1. The mean of all the sample means will be exactly equal to the population mean.
2. If the population from which the samples are drawn is normal, the distribution of
sample means is also normally distributed.
3. If the population from which the samples are drawn is not normal, the sampling
distribution is approximately normal, provided the samples are “sufficiently” large
(usually accepted to include at least 30 observations).
Sampling Distribution of the Proportion
• Consider a population of N = 5 numbers: 0, 3, 6, 3, 18. It consists of
even numbers 0, 6, 18 and two odd numbers 3, 3. Thus the
population proportion of even numbers is:
X 3
0. 6.
N 5
Now consider a sample of size n 3 , say, the first three population numbers: 0, 3, 6. Two of the
numbers in the sample are even i.e. x 3. Hence the sample proportion of even numbers is
• Ten samples of size can be drawn from a population of . Followings
are the all possible samples along with their sample estimates of
proportions.
Samples Observations x p
ABC 0,3,6 2 2/3
ABD 0,3,3 1 1/3
ABE 0,3,18 2 2/3
ACD 0,6,3 2 2/3
ACE 0,6,18 3 1
ADE 0,3,18 2 2/3
BCD 3,6,3 1 1/3
BCE 3,6,18 2 2/3
1/3 3 0.3
2/3 6 0.6
1 1 0.1
Now, calculate the mean and standard deviation of this sampling distribution of proportion:
p 0.6
(Verify it !)
p 0.2
p 0 .6
(1 ) N n (0.4)(0.6) 53
p 0.04 0.2
n N 1 3 5 1
Random Number Tables
APPENDIX
• How to use a random number table:
• Note: This method is one from a variety of methods of reading numbers from random number tables.
• 1. Assume you have the test scores for a population of 200 students. Each student has been assigned a number from
1 to 200. We want to randomly sample only 5 of the students for this demo.
• 2. Since the population size is a three-digit number, we will use the first three digits of the numbers listed in the table.
• 3. Without looking, point to a starting spot in the table. Assume we land on 75636 (3rd column, 2nd entry).
• 4. This location gives the first three digits to be 756. This choice is too large (> 200), so we choose the next number in
that column. Keep in mind that we are looking for numbers whose first three digits are from 001 to 200 (representing
students).
• 5. The second choice gives the first three digits to be 407, also too large. Continue down the column until you find 5
of the numbers whose first three digits are less than or equal to 200.
• 6. From this table, we arrive at 070 (07015), 038 (03811), 045 (04594), 055 (05542), and 194 (19428).
• 7. RESULT: Students 38, 45, 55, 70, and 194 will be used for our random sample.
• Our sample set of students has been randomly selected where each student had an equal chance of being selected
and the selection of one student did not influence the selection of other student