Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
27 views

Chapter-7-Sampling & Sampling Distributions

This document provides an overview of sampling and sampling distributions in statistics. It defines key terms like population, parameter, statistic, and sampling. It also describes different sampling methods like simple random sampling, stratified random sampling, cluster sampling, and systematic sampling. The document explains how to construct a sampling distribution of the sample mean by taking random samples from a population and calculating the mean of each sample. The sampling distribution shows the relationship between possible sample means and their associated probabilities.

Uploaded by

Simeon solomon
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views

Chapter-7-Sampling & Sampling Distributions

This document provides an overview of sampling and sampling distributions in statistics. It defines key terms like population, parameter, statistic, and sampling. It also describes different sampling methods like simple random sampling, stratified random sampling, cluster sampling, and systematic sampling. The document explains how to construct a sampling distribution of the sample mean by taking random samples from a population and calculating the mean of each sample. The sampling distribution shows the relationship between possible sample means and their associated probabilities.

Uploaded by

Simeon solomon
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

Lecture notes on Introduction to Statistics Chapter 7: Sampling & Sampling Distributions

CHAPTER 7
7. Sampling and Sampling Distribution
Introduction
Given a variable X, if we arrange its values in ascending order and assign probability to each of the
values or if we present Xi in a form of relative frequency distribution the result is called Sampling
Distribution of X.
Definitions:
1. Parameter: Characteristic or measure obtained from a population.
2. Statistic: Characteristic or measure obtained from a sample.
3. Sampling: The process or method of sample selection from the population.
4. Sampling unit: the ultimate unit to be sampled or elements of the population to be sampled.
Examples:
 If some body studies Scio-economic status of the households, households are the sampling unit.
 If one studies performance of freshman students in some college, the student is the sampling
unit.
5. Sampling frame: is the list of all elements in a population under study.
Examples:
- List of households.
- List of students in the registrar office.
6. Errors in sample survey:
There are two types of errors
a) Sampling error:
- It is the discrepancy between the population value and sample value.
- May arise due to inappropriate sampling techniques applied
b) Non sampling errors: are errors due to procedure bias such as:
- Due to incorrect responses
- Measurement
- Errors at different stages in processing the data.
 The Need for Sampling
- Reduced cost
- Greater speed
- Greater accuracy
- Greater scope
- More detailed information can be obtained.

 There are two types of sampling.


1. Random Sampling or probability sampling
- It is a method of sampling in which all elements in the population have a pre-assigned non-zero
probability to be included in to the sample.
Examples:
 Simple random sampling
 Stratified random sampling
 Cluster sampling
 Systematic sampling

Page 1 of 5
Lecture notes on Introduction to Statistics Chapter 7: Sampling & Sampling Distributions

i. Simple Random Sampling:


- It is a method of selecting items from a population such that every possible sample of specific
size has an equal chance of being selected. In this case, sampling may be with or without
replacement. Or
- All elements in the population have the same pre-assigned non-zero probability to be
included in to the sample.
- Simple random sampling can be done either using the lottery method or table of random
numbers.

ii. Stratified Random Sampling:


- The population will be divided in to non-overlapping but exhaustive groups called strata.
- Simple random samples will be chosen from each stratum.
- Elements in the same strata should be more or less homogeneous while different in different
strata.
- It is applied if the population is heterogeneous.
- Some of the criteria for dividing a population into strata are: Sex (male, female); Age (under
18, 18 to 28, 29 to 39,); Occupation (blue-collar, professional, and other).

iii. Cluster Sampling:


- The population is divided in to non-overlapping groups called clusters.
- A simple random sample of groups or cluster of elements is chosen and all the sampling units
in the selected clusters will be surveyed.
- Clusters are formed in a way that elements with in a cluster are heterogeneous, i.e.
observations in each cluster should be more or less dissimilar.
- Cluster sampling is useful when it is difficult or costly to generate a simple random sample.
For example, to estimate the average annual household income in a large city we use cluster
sampling, because to use simple random sampling we need a complete list of households in
the city from which to sample. To use stratified random sampling, we would again need the
list of households. A less expensive way is to let each block within the city represent a
cluster. A sample of clusters could then be randomly selected, and every household within
these clusters could be interviewed to find the average annual household income.

iv. Systematic Sampling:


- A complete list of all elements with in the population (sampling frame) is required.
- The procedure starts in determining the first element to be included in the sample.
- Then the technique is to take the kth item from the sampling frame.
N
- Let, N  population size , n  sample size , k   sampling int erval.
n
- Chose any number between 1 and k . Suppose it is j (1  j  k ) .
- The j unit is selected at first and then ( j  k ) , ( j  2k ) ,....etc until the required
th th th

sample size is reached.

Page 2 of 5
Lecture notes on Introduction to Statistics Chapter 7: Sampling & Sampling Distributions

2. Non Random Sampling or non-probability sampling


- It is a sampling technique in which the choice of individuals for a sample depends on the basis of
convenience, personal choice or interest.
Examples:
 Judgment sampling.
 Convenience sampling
 Quota Sampling.
1. Judgment Sampling
- In this case, the person taking the sample has direct or indirect control over which items are
selected for the sample.
2. Convenience Sampling
- In this method, the decision maker selects a sample from the population in a manner that is
relatively easy and convenient.
3. Quota Sampling
- In this method, the decision maker requires the sample to contain a certain number of items
with a given characteristic. Many political polls are, in part, quota sampling.

Note: let N  population size , n  sample size.


1.Suppose simple random sampling is used
 We have N n possible samples if sampling is with replacement.
N
 We have   possible samples if sampling is with out replacement.
n
2. After this on wards, we consider that samples are drawn from a given population using
simple random sampling.

Sampling Distribution of the sample mean

- Sampling distribution of the sample mean is a theoretical probability distribution that shows the
functional relation ship between the possible values of a given sample mean based on samples of
size n and the probability associated with each value, for all possible samples of size n drawn from
that particular population.
- There are commonly three properties of interest of a given sampling distribution.
 Its Mean
 Its Variance
 Its Functional form.
Steps for the construction of Sampling Distribution of the mean

1. From a finite population of size N , randomly draw all possible samples of size n .
2. Calculate the mean for each sample.
3. Summarize the mean obtained in step 2 in terms of frequency distribution or relative frequency
distribution.

Page 3 of 5
Lecture notes on Introduction to Statistics Chapter 7: Sampling & Sampling Distributions

Example: Suppose we have a population of size N  5 , consisting of the age of five children: 6, 8,
10, 12, and 14. Take samples of size 2 with replacement and construct sampling distribution of the
sample mean.
Solution: N  5, n  2
 We have N  5  25 possible samples since sampling is with replacement.
n 2

Step 1: Draw all possible samples:

6 8 10 12 14
6 (6, 6) (6, 8) (6, 10) (6, 12) (6, 14)
8 (8,6) (8,8) (8,10) (8,12) (8,14)
10 (10,6) (10,8) (10,10) (10,12) (10,14)
12 (12,6) (12,8) (12,10) (12,12) (12,14)
14 (12,6) (14,8) (12,10) (12,12) (12,14)
Step 2: Calculate the mean for each sample:
6 8 10 12 14
6 6 7 8 9 10
8 7 8 9 10 11
10 8 9 10 11 12
12 9 10 11 12 13
14 10 11 12 13 14
Step 3: Summarize the mean obtained in step 2 in terms of frequency distribution.
X Frequency
6 1
7 2
8 3
9 4
10 5
11 4
12 3
13 2
14 1

X , say  X
a) Find the mean of

X   i i 
X f 250
 10  
 i
f 25
 ( X i   X ) 2 f i 100
X , say  X , X    4 2
2 2
b) Find the variance of
 fi 25

Page 4 of 5
Lecture notes on Introduction to Statistics Chapter 7: Sampling & Sampling Distributions

Remark:
1. In general if sampling is with replacement
2
X  2

n
2. If sampling is with out replacement
2  N n
X  2
 
n  N 1 
3. In any case the sample mean is unbiased estimator of the population mean. i.e.
 X    E (X )   (Show!)
- Sampling may be from a normally distributed population or from a non-normally distributed
population.
- When sampling is from a normally distributed population, the distribution of X will posses the
following property.

1. The distribution of X will be normal


2. The mean of X is equal to the population mean , i.e.  X 
3. The variance of X is equal to the population variance divided by the sample size,
2
 X ~ N ( , )
2 n
i.e.  X 
2

n X 
Z  ~ N (0,1)
 n
Central Limit Theorem
 and finite variance  , the sampling
2
Given a population of any functional form with mean
distribution of X , computed from samples of size n from the population will be approximately
2
normally distributed with mean  and variance , when the sample size is large.
n

Page 5 of 5

You might also like