Sampling Distribution
Sampling Distribution
Objectives
• Identify the reason/”Why” for conducting random sampling
• Distinguish parameter and statistic
What is the use of “Statistic/s”
• Decision making
• Q: Why is decision making so difficult?
• A: Because we are never truly certain
• “Nakakatakot ang bagay na hindi tayo sigurado kasi alam nating importante
ito sa atin”
• In every decision we make, we let go of other possibilities/opportunities
• Thus, with information we have, we have to decide, but then again,
we’re bound by constraints- the data we have
• With what we have, can we say that’s enough for us to
conclude/decide about something?
Inferential Statistics
• We deal with decision making problems
• We decide/conclude about traits/characteristics of our population based on
data that we’re able to collect from a sample, a subset of population
• Sometimes, kailangan mo rin “mag-assume”
• With sample or information, you have to reach a conclusion that extends
beyond your collected data
• However, it’s impossible to be 100 % certain if your conclusions about your
population are correct or not
Sampling Distribution
• Equips us with tools to quantify the accuracy of our conclusions by
understanding uncertainty.
• Kahit imposibleng maging sigurado, posibleng masabi kung “gaano
kasigurado”
Sampling Distribution: Basic Definitions
• Population- totality of items, things or people under consideration;
relative(i.e. HUMMs students, this class, ASHS students)
• Sample- subset of population, part taken from population(i.e. Male
students in this class, this class(from ASHS classes) )
• This implies that the sample mean, being a random variable also has
a probability distribution.
Population vs. Statistic
• Population includes all the elements from a set of data.
åX i
X= i=1
n
Statistic
• Suppose (𝑋1 , 𝑋2 , … , 𝑋𝑛 ) is a random sample. A statistic is a random
variable that is a function of 𝑋1 , 𝑋2 , … , 𝑋𝑛 .
• Example:
a. The sample mean is a random variable that is a function of
𝑋1 , 𝑋2 , … , 𝑋𝑛 . Thus, it is a statistic.
• The formula for the sample mean, 𝑿 ഥ is given by:
å( X )
n 2
i
-X
S2 = i=1
n -1
Statistic as Random Variable
• This definition of a statistic is consistent with our previous definition
of “statistic” as a measure describing the sample. Now, however, we
are viewing the statistic as a random variable.
• Even if we cannot predict with certainty what the value of a statistic will
be, we can use its sampling distribution to understand its behavior in
probabilistic terms.
Sampling Distribution
• The probability distribution function of a statistic is called its sampling
distribution.
ഥ)
𝒇(𝑿 1/16 2/16 3/16 4/16 3/16 2/16 1/16
Sampling Distribution of sample mean:
w/replacement
ഥ
𝑿 0 0.5 1 1.5 2 2.5 3
ഥ)
𝒇(𝑿 1/16 2/16 3/16 4/16 3/16 2/16 1/16
Sampling Distribution: w/o replacement
3. Suppose we have four numbers, which are 0, 1, 2 and 3.
• We list all possible samples of size 2, without replacement, and
ത
for each sample compute for the value of the sample mean, 𝑋.
No. Sample ഥ
𝑿 No. Sample ഥ
𝑿
1 0, 1 7 2, 0
2 0, 2 8 2, 1
3 0, 3 9 2, 3
4 1, 0 10 3, 0
5 1, 2 11 3, 1
6 1, 3 12 3, 2
Sampling Distribution: w/o replacement
3. Suppose we have four numbers, which are 0, 1, 2 and 3.
• We list all possible samples of size 2, without replacement, and
ത
for each sample compute for the value of the sample mean, 𝑋.
No. Sample ഥ
𝑿 No. Sample ഥ
𝑿
1 0, 1 0.5 7 2, 0 1.0
2 0, 2 1.0 8 2, 1 1.5
3 0, 3 1.5 9 2, 3 2.5
4 1, 0 0.5 10 3, 0 1.5
5 1, 2 1.5 11 3, 1 2.0
6 1, 3 2.0 12 3, 2 2.5
Sampling Distribution of sample mean: w/o
replacement
ഥ
𝑿 0.5 1 1.5 2 2.5
ഥ)
𝒇(𝑿 2/12 2/12 4/12 2/12 2/12
ത
• We now solve for E(𝑋)
Sampling Distribution of sample mean: w/o
replacement
ഥ
𝑿 0.5 1 1.5 2 2.5
ഥ)
𝒇(𝑿 2/12 2/12 4/12 2/12 2/12
ത
• We now solve for Var(𝑋)
Sampling Distribution of Sample Mean
• In summary,
• For the original random variable X,
• 𝐸(𝑋)=3/2
• 𝑉𝑎𝑟(𝑋)=5/4
• Taking samples of size 2 with replacement,
• 𝐸(𝑋)=3/2
• 𝑉𝑎𝑟(𝑋)=5/8
• Taking samples of size 2 without replacement,
• 𝐸(𝑋)=3/2
• 𝑉𝑎𝑟(𝑋)=5/12
Theorem 1
• If all possible random samples of size n are drawn with replacement
from a finite population of size N with mean μ and standard deviation
σ, then the sample mean 𝑿 ഥ will have mean and variance given by: