0% found this document useful (0 votes)

26 views

Sampling

1. A sampling distribution describes the distribution of all possible sample means that could be drawn from a population. 2. For a sample size of 25 graduates, there is a very low (0.62%) probability that the sample mean would be $750 or less if the population mean was actually $800. 3. Therefore, based on this analysis, the student would conclude that the dean's claim of an average salary of $800 is not justified.

Uploaded by

SAKHAWAT HOSSAIN KHAN MD

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views

Sampling

Uploaded by

SAKHAWAT HOSSAIN KHAN MD

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

Sampling Distributions

Prof. Byeong-Yun Chang, Ph.D.,

MBA in IB
Ajou University

9.1
Sampling Distributions…
A sampling distribution is created by, as the name suggests,
sampling.

The method we will employ on the rules of probability and

the laws of expected value and variance to derive the
sampling distribution.

For example, consider the roll of one and two dice…

9.2
Sampling Distribution of the Mean…
A fair die is thrown infinitely many times,
with the random variable X = # of spots on any throw.

The probability distribution of X is:

x 1 2 3 4 5 6
P(x) 1/6 1/6 1/6 1/6 1/6 1/6

…and the mean and variance are calculated as well:

9.3
Sampling Distribution of Two Dice
A sampling distribution is created by looking at
all samples of size n=2 (i.e. two dice) and their means…

While there are 36 possible samples of size 2, there are only

11 values for , and some (e.g. =3.5) occur more
frequently than others (e.g. =1).
9.4
Sampling Distribution of Two Dice…
The sampling distribution of is shown below:
6/36
P( )
1.0 1/36 5/36
1.5 2/36
2.0 3/36
4/36
)
2.5 4/36
3.0 5/36
3/36
P(

3.5 6/36
4.0 5/36
4.5 4/36 2/36
5.0 3/36
5.5 2/36
6.0 1/36 1/36

1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0

9.5
Compare…
Compare the distribution of X…

1 2 3 4 5 6 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0

…with the sampling distribution of .

As well, note that:

9.6
Generalize…
We can generalize the mean and variance of the sampling of
two dice:

…to n-dice:
The standard deviation of the
sampling distribution is
called the standard error:

9.7
Central Limit Theorem…
The sampling distribution of the mean of a random sample
drawn from any population is approximately normal for a
sufficiently large sample size.

The larger the sample size, the more closely the sampling
distribution of X will resemble a normal distribution.

9.8
Central Limit Theorem…
If the population is normal, then X is normally distributed
for all values of n.

If the population is non-normal, then X is approximately

normal only for larger values of n.

In most practical situations, a sample size of 30 may be

sufficiently large to allow us to use the normal distribution
as an approximation for the sampling distribution of X.

9.9
Sampling Distribution of the Sample Mean
1.

3. If X is normal, X is normal. If X is nonnormal, X is

approximately normal for sufficiently large sample sizes.
Note: the definition of “sufficiently large” depends on the
extent of nonnormality of x (e.g. heavily skewed;
multimodal)

9.10
Sampling Distribution of the Sample Mean
We can express the sampling distribution of the mean simple
as

X−µ
Z=
σ/ n

9.11
Sampling Distribution of the Sample Mean
The summaries above assume that the population is infinitely
large. However if the population is finite the standard error is
σ N−n
σx =
n N −1

where N is the population size and

N−n
N −1

is the finite population correction factor.

9.12
Sampling Distribution of the Sample Mean
If the population size is large relative to the sample size the
finite population correction factor is close to 1 and can be
ignored.

We will treat any population that is at least 20 times larger

than the sample size as large.

In practice most applications involve populations that

qualify as large.

As a consequence the finite population correction factor is

usually omitted.
9.13
Example 9.1(a)…
The foreman of a bottling plant has observed that the
amount of soda in each “32-ounce” bottle is actually a
normally distributed random variable, with a mean of 32.2
ounces and a standard deviation of .3 ounce.

If a customer buys one bottle, what is the probability that

the bottle will contain more than 32 ounces?

9.14
Example 9.1(a)…
We want to find P(X > 32), where X is normally distributed
and µ = 32.2 and σ =.3
 X − µ 32 − 32.2 
P(X > 32) = P >  = P( Z > − .67) = 1 − .2514 = .7486
 σ .3 

“there is about a 75% chance that a single bottle of soda

contains more than 32oz.”

9.15
Example 9.1(b)…
The foreman of a bottling plant has observed that the
amount of soda in each “32-ounce” bottle is actually a
normally distributed random variable, with a mean of 32.2
ounces and a standard deviation of .3 ounce.

If a customer buys a carton of four bottles, what is the

probability that the mean amount of the four bottles will
be greater than 32 ounces?

9.16
Example 9.1(b)…
We want to find P(X > 32), where X is normally distributed
With µ = 32.2 and σ =.3

Things we know:
1) X is normally distributed, therefore so will X.

2) = 32.2 oz.

9.17
Example 9.1(b)…
If a customer buys a carton of four bottles, what is the
probability that the mean amount of the four bottles will be
greater than 32 ounces?

“There is about a 91% chance the mean of the four bottles

will exceed 32oz.”

9.18
Graphically Speaking…
mean=32.2

what is the probability that one bottle will what is the probability that the mean of
contain more than 32 ounces? four bottles will exceed 32 oz?

9.19
Chapter-Opening Example
Salaries of a Business School’s Graduates
In the advertisements for a large university, the dean of
the School of Business claims that the average salary
of the school’s graduates one year after graduation is
$800 per week with a standard deviation of $100.

A second-year student in the business school who has

just completed his statistics course would like to check
whether the claim about the mean is correct.

9.20
Chapter-Opening Example
Salaries of a Business School’s Graduates
He does a survey of 25 people who graduated one year ago
and determines their weekly salary.

He discovers the sample mean to be $750.

To interpret his finding he needs to calculate the probability

that a sample of 25 graduates would have a mean of $750 or
less when the population mean is $800 and the standard
deviation is $100.

After calculating the probability, he needs to draw some

conclusions.
9.21
Chapter-Opening Example
We want to find the probability that the sample mean is less
than $750. Thus, we seek

P( X < 750)
The distribution of X, the weekly income, is likely to be
positively skewed, but not sufficiently so to make the
distribution of X nonnormal. As a result, we may assume that X
is normal with mean
µ x = µ = 800

and standard deviation

σ x = σ / n = 100 / 25 = 20

9.22
Chapter-Opening Example
Thus,

P( X < 750)
 X − µx 750 − 800 
= P < 

 σ x 20 
= P( Z < − 2.5)
= .5 − .4938
= .0062
The probability of observing a sample mean as low as $750 when
the population mean is $800 is extremely small. Because this event
is quite unlikely, we would have to conclude that the dean's claim is
not justified.

9.23
Using the Sampling Distribution for Inference
Here’s another way of expressing the probability calculated from a
sampling distribution.
P(-1.96 < Z < 1.96) = .95
Substituting the formula for the sampling distribution

X−µ
P(−1.96 < < 1.96) = .95
σ/ n

With a little algebra

σ σ
P(µ − 1.96 < X < µ + 1.96 ) = .95
n n

9.24
Using the Sampling Distribution for Inference
Returning to the chapter-opening example where µ = 800, σ = 100,
and n = 25, we compute
100 100
P(800 − 1.96 < X < 800 + 1.96 ) = .95
25 25

P(760.8 < X < 839.2) = .95

This tells us that there is a 95% probability that a sample mean will
fall between 760.8 and 839.2. Because the sample mean was
computed to be $750, we would have to conclude that the dean's
claim is not supported by the statistic.
9.25
Using the Sampling Distribution for Inference
Changing the probability from .95 to .90 changes the probability
statement to
σ σ
P(µ − 1.645 < X < µ + 1.645 ) = .90
n n

9.26
Using the Sampling Distribution for Inference
We can also produce a general form of this statement

σ σ
P(µ − z α / 2 < X < µ + zα / 2 ) =1− α
n n
In this formula α (Greek letter alpha) is the probability that
does not fall into the interval.

To apply this formula all we need do is substitute the values for

µ, σ, n, and α.

9.27
Using the Sampling Distribution for Inference
For example, with µ = 800, σ = 100, n = 25 and α= .01, we
produce
σ σ
P(µ − z .005 < X < µ + z .005 ) = 1 − .01
n n

100 100
P(800 − 2.575 < X < 800 + 2.575 ) = .99
25 25

P(748.5 < X < 851.5) = .99

9.28
Sampling Distribution of a Proportion…
The estimator of a population proportion of successes is the
sample proportion. That is, we count the number of
successes in a sample and compute:

(read this as “p-hat”).

X is the number of successes, n is the sample size.

9.29
Normal Approximation to Binomial…
Binomial distribution with n=20 and p=.5 with a normal
approximation superimposed ( =10 and =2.24)

9.30
Normal Approximation to Binomial…
Binomial distribution with n=20 and p=.5 with a normal
approximation superimposed ( =10 and =2.24)

where did these values come from?!

From §7.6 we saw that:

Hence:
and
9.31
Normal Approximation to Binomial…
Normal approximation to the binomial works best when the
number of experiments, n, (sample size) is large, and the
probability of success, p, is close to 0.5

For the approximation to provide good results two

conditions should be met:
1) np ≥ 5
2) n(1–p) ≥ 5

9.32
Normal Approximation to Binomial…
To calculate P(X=10) using the
normal distribution, we can find
the area under the normal curve
between 9.5 & 10.5

P(X = 10) ≈ P(9.5 < Y < 10.5)

where Y is a normal random variable approximating

the binomial random variable X

9.33
Normal Approximation to Binomial…
In fact:
P(X = 10) = .176
while
P(9.5 < Y < 10.5) = .1742
the approximation is quite good.

P(X = 10) ≈ P(9.5 < Y < 10.5)

where Y is a normal random variable approximating

the binomial random variable X

9.34
Sampling Distribution of a Sample Proportion…
Using the laws of expected value and variance, we can
determine the mean, variance, and standard deviation of .
(The standard deviation of is called the standard error of
the proportion.)

Sample proportions can be standardized to a standard normal

distribution using this formulation:

9.35
Example 9.2
In the last election a state representative received 52% of the
votes cast.

One year after the election the representative organized a

survey that asked a random sample of 300 people whether
they would vote for him in the next election.

If we assume that his popularity has not changed what is the

probability that more than half of the sample would vote for
him?

9.36
Example 9.2
The number of respondents who would vote for the representative
is a binomial random variable with n = 300 and p = .52.

We want to determine the probability that the sample proportion is

greater than 50%. That is, we want to find

P(P̂ > .50)

We now know that the sample proportion P̂ is approximately

normally distributed with mean p = .52 and standard deviation

p(1 − p) / n = (.52)(1 − .52) / 300 = .0288

9.37
Example 9.2
Thus, we calculate

P(P̂ > .50)

 P̂ − p .50 − . 52 
= P > 
 p(1 − p) / n . 0288 
 
= P( Z > − .69)
= .7549

If we assume that the level of support remains at 52%, the

probability that more than half the sample of 300 people
would vote for the representative is 75.49%.
9.38
Sampling Distribution: Difference of two means
The final sampling distribution introduced is that of the
difference between two sample means. This requires:

 independent random samples be drawn from each of two

normal populations

If this condition is met, then the sampling distribution of the

difference between the two sample means, i.e.
will be normally distributed.
(note: if the two populations are not both normally
distributed, but the sample sizes are “large” (>30), the
distribution of is approximately normal)

9.39
Sampling Distribution: Difference of two means
The expected value and variance of the sampling
distribution of are given by:

mean:

standard deviation:

(also called the standard error if the difference between two

means)
9.40
Example 9.3…
Since the distribution of is normal and has a

mean of

and a standard deviation of

We can compute Z (standard normal random variable) in this

way:

9.41
Example 9.3…
Starting salaries for MBA grads at two universities are
normally distributed with the following means and standard
deviations. Samples from each school are taken…
University 1 University 2
Mean 62,000 $/yr 60,000 $/yr
Std. Dev. 14,500 $/yr 18,300 $/yr
sample size n 50 60

What is the probability that the sample mean starting salary of

University #1 graduates will exceed that of the #2 grads?

9.42
Example 9.3…
“What is the probability that the sample mean starting salary
of University #1 graduates will exceed that of the #2 grads?”

We are interested in determinging P(X1 > X2). Converting

this to a difference of means, what is: P(X1 – X2 > 0) ?

Z
“there is about a 74% chance that the sample mean
starting salary of U. #1 grads will exceed that of U. #2”
9.43
From Here to Inference
In Chapters 7 and 8 we introduced probability distributions,
which allowed us to make probability statements about
values of the random variable.

A prerequisite of this calculation is knowledge of the

distribution and the relevant parameters.

9.44
From Here to Inference
In Example 7.9, we needed to know that the probability that
Pat Statsdud guesses the correct answer is 20% (p = .2) and
that the number of correct answers (successes) in 10
questions (trials) is a binomial random variable.

We then could compute the probability of any number of

successes.

9.45
From Here to Inference
In Example 8.2, we needed to know that the return on
investment is normally distributed with a mean of 10% and a
standard deviation of 5%.

These three bits of information allowed us to calculate the

probability of various values of the random variable.

9.46
From Here to Inference
The figure below symbolically represents the use of
probability distributions.

Simply put, knowledge of the population and its

parameter(s) allows us to use the probability distribution to
make probability statements about individual members of the
population.

Probability Distribution ---------- Individual

9.47
From Here to Inference
In this chapter we developed the sampling distribution,
wherein knowledge of the parameter(s) and some
information about the distribution allow us to make
probability statements about a sample statistic.

Population Sampling distribution

----- Statistic
& Parameter ( s )

9.48
From Here to Inference
Statistical works by reversing the direction of the flow of
knowledge in the previous figure. The next figure displays
the character of statistical inference.

Starting in Chapter 10, we will assume that most population

parameters are unknown. The statistics practitioner will
sample from the population and compute the required
statistic. The sampling distribution of that statistic will
enable us to draw inferences about the parameter.

9.49
From Here to Inference

Sampling distribution
Statistic ------ Parameter

9.50

"Perfwall" - Perforated Wood Shear Wall Analysis: Program Description
No ratings yet
"Perfwall" - Perforated Wood Shear Wall Analysis: Program Description
3 pages
CH 9 and 10
No ratings yet
CH 9 and 10
90 pages
Sampling Distribution
No ratings yet
Sampling Distribution
49 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
25 pages
Sampling Distribution of Mean
No ratings yet
Sampling Distribution of Mean
6 pages
Sampling Distribution Web
No ratings yet
Sampling Distribution Web
11 pages
Module 6 - Sampling Distributions
No ratings yet
Module 6 - Sampling Distributions
37 pages
Chapter 7
No ratings yet
Chapter 7
15 pages
Unit9_Probability
No ratings yet
Unit9_Probability
61 pages
Sampling Distribution
No ratings yet
Sampling Distribution
127 pages
Am3 Central Limit Theorem Examples
No ratings yet
Am3 Central Limit Theorem Examples
3 pages
Lec 5
No ratings yet
Lec 5
64 pages
Exercises 4
No ratings yet
Exercises 4
30 pages
Note 7
No ratings yet
Note 7
40 pages
Lesson6 CLT 0
No ratings yet
Lesson6 CLT 0
25 pages
Chap2 Sampling Distns I
100% (1)
Chap2 Sampling Distns I
20 pages
Sampling Distribution and Point Estimates of Parameters: Learning Objectives
No ratings yet
Sampling Distribution and Point Estimates of Parameters: Learning Objectives
9 pages
Central Limit Theorem: Academic Coordinator
No ratings yet
Central Limit Theorem: Academic Coordinator
15 pages
M10 - Ch7 - CLT - Notes 2019W
No ratings yet
M10 - Ch7 - CLT - Notes 2019W
7 pages
6.5 - The Central Limit Theorem: Objectives
No ratings yet
6.5 - The Central Limit Theorem: Objectives
6 pages
Lecture 9
No ratings yet
Lecture 9
14 pages
Discrete Distribution 1
No ratings yet
Discrete Distribution 1
33 pages
Chapter 09
No ratings yet
Chapter 09
25 pages
Lecture Two (2)
No ratings yet
Lecture Two (2)
13 pages
Sampling Distributions PDF
No ratings yet
Sampling Distributions PDF
66 pages
MCD2080 Revision (Unfinished)
No ratings yet
MCD2080 Revision (Unfinished)
11 pages
Chapter 8 - Sampling Distribution
No ratings yet
Chapter 8 - Sampling Distribution
34 pages
Chapter 9
No ratings yet
Chapter 9
16 pages
MLE Dan Bayesian Estimation From Walpole Book
No ratings yet
MLE Dan Bayesian Estimation From Walpole Book
13 pages
Homework03 Solutions
No ratings yet
Homework03 Solutions
6 pages
Sampling & Estimation
No ratings yet
Sampling & Estimation
19 pages
Chapter 5a. Discrete Variable
No ratings yet
Chapter 5a. Discrete Variable
39 pages
Questions_ CLT (1)
No ratings yet
Questions_ CLT (1)
4 pages
Chapter 6 - Problems Set 1 - Sampling Distributions of Sample Means
No ratings yet
Chapter 6 - Problems Set 1 - Sampling Distributions of Sample Means
4 pages
Chapter 4
No ratings yet
Chapter 4
8 pages
Chapter 3
No ratings yet
Chapter 3
142 pages
Chapter 4
No ratings yet
Chapter 4
17 pages
Chapter Two Probabilities Distribution
No ratings yet
Chapter Two Probabilities Distribution
31 pages
Chapter2-New (Compatibility Mode)
No ratings yet
Chapter2-New (Compatibility Mode)
52 pages
Probability Distribution
No ratings yet
Probability Distribution
21 pages
Lab 11
No ratings yet
Lab 11
49 pages
Chapter Review: X X X X
No ratings yet
Chapter Review: X X X X
12 pages
Sampling Distributions and The Central Limit Theorem: © 2010 Pearson Prentice Hall. All Rights Reserved
No ratings yet
Sampling Distributions and The Central Limit Theorem: © 2010 Pearson Prentice Hall. All Rights Reserved
37 pages
Chapter 8 & (Part) Chapter 12: Distribution of Sample Means: Chapters 8 & 12: Page 1
No ratings yet
Chapter 8 & (Part) Chapter 12: Distribution of Sample Means: Chapters 8 & 12: Page 1
14 pages
CLT Q&a
No ratings yet
CLT Q&a
13 pages
Central-Limit-Theorem
No ratings yet
Central-Limit-Theorem
10 pages
Elementary Statistics
100% (1)
Elementary Statistics
22 pages
Sampling Distributions
No ratings yet
Sampling Distributions
21 pages
Bus 173 - 1
No ratings yet
Bus 173 - 1
28 pages
Chapter 6
No ratings yet
Chapter 6
4 pages
Stat 255 Supplement 2011 Fall
100% (1)
Stat 255 Supplement 2011 Fall
78 pages
Stat T 3
100% (2)
Stat T 3
39 pages
Unit 3 (SAMPLE AND SAMPLE DISTRIBUTIONS)
100% (2)
Unit 3 (SAMPLE AND SAMPLE DISTRIBUTIONS)
32 pages
Stt511 Lecture02
No ratings yet
Stt511 Lecture02
27 pages
Homework 1 Solutions
No ratings yet
Homework 1 Solutions
10 pages
Chap 6
No ratings yet
Chap 6
27 pages
Class 10 - Statistics SMT1-2019 2020
No ratings yet
Class 10 - Statistics SMT1-2019 2020
109 pages
HWK4_324
No ratings yet
HWK4_324
11 pages
The Normal Distribution IB
No ratings yet
The Normal Distribution IB
11 pages
Statistics 12th Edition McClave Test Bank 1
100% (66)
Statistics 12th Edition McClave Test Bank 1
16 pages
Probability Theory: A Concise Course
From Everand
Probability Theory: A Concise Course
Y. A. Rozanov
4/5 (2)
My Answer: - Correct Answer: 4.5: 11th ADVANCED Star Batch Paper-1 (4th June)
No ratings yet
My Answer: - Correct Answer: 4.5: 11th ADVANCED Star Batch Paper-1 (4th June)
16 pages
Modeling CO2 Storage in Aquifers With A Fully-Coup
No ratings yet
Modeling CO2 Storage in Aquifers With A Fully-Coup
17 pages
Control Engineering - A Practical Guide
No ratings yet
Control Engineering - A Practical Guide
124 pages
Syllabus Econ 310 Spring 2013
No ratings yet
Syllabus Econ 310 Spring 2013
3 pages
Advanced Thermodynamics: Exergy / Availability
No ratings yet
Advanced Thermodynamics: Exergy / Availability
64 pages
Jozette Roberts Unit Plan On Matrices Portfolio
No ratings yet
Jozette Roberts Unit Plan On Matrices Portfolio
5 pages
Full Syllabus Test - 01 -- RMO Pre-Departure Camp Recorded_Questions_670d845b40612b822716a47b
No ratings yet
Full Syllabus Test - 01 -- RMO Pre-Departure Camp Recorded_Questions_670d845b40612b822716a47b
3 pages
STA 371G (Damien)
No ratings yet
STA 371G (Damien)
8 pages
Revisi TIA-EIA 222 F&G
No ratings yet
Revisi TIA-EIA 222 F&G
8 pages
Duobias 690 Proddoc 20114715
No ratings yet
Duobias 690 Proddoc 20114715
143 pages
Modeling and Simulation of A Spray Column For NOx Absorption
No ratings yet
Modeling and Simulation of A Spray Column For NOx Absorption
15 pages
Senior Model-A&b Pta-17 Maths Npyq-24 - P
No ratings yet
Senior Model-A&b Pta-17 Maths Npyq-24 - P
28 pages
Mathematics of Public Key Cryptography 1st Edition by Steven Galbraith 1107013925 9781107013926 - The 2025 ebook edition is available with updated content
100% (4)
Mathematics of Public Key Cryptography 1st Edition by Steven Galbraith 1107013925 9781107013926 - The 2025 ebook edition is available with updated content
76 pages
Fiitjee: Solutions To Jee (Advanced) - 2019
No ratings yet
Fiitjee: Solutions To Jee (Advanced) - 2019
31 pages
The Internal Model Principle of Control Theory: Automatica September 1976
No ratings yet
The Internal Model Principle of Control Theory: Automatica September 1976
10 pages
Determine Suitability of Database Functionality
No ratings yet
Determine Suitability of Database Functionality
9 pages
5.foundations of AI
No ratings yet
5.foundations of AI
17 pages
MATH4321 Hw1 Solution PDF
No ratings yet
MATH4321 Hw1 Solution PDF
13 pages
MANE 4240 & CIVL 4240 - Introduction To Finite Elements: Total Points: 40 Answer All Questions
No ratings yet
MANE 4240 & CIVL 4240 - Introduction To Finite Elements: Total Points: 40 Answer All Questions
1 page
Tuning of Fuzzy Logic Controller Using Genetic Algorithm: Prepared by
No ratings yet
Tuning of Fuzzy Logic Controller Using Genetic Algorithm: Prepared by
56 pages
STD XII Model Question Paper 2023-24
100% (1)
STD XII Model Question Paper 2023-24
13 pages
Chapter 1 Data Presentation
No ratings yet
Chapter 1 Data Presentation
15 pages
Sample of Thesis Paper Chapter 3
100% (2)
Sample of Thesis Paper Chapter 3
5 pages
Kinematics Is The Study of The Motion of An Object/body Without Considering The Cause of
No ratings yet
Kinematics Is The Study of The Motion of An Object/body Without Considering The Cause of
13 pages
Preprints202310 0773 v6
No ratings yet
Preprints202310 0773 v6
9 pages
Vol-IV 2 of 2
No ratings yet
Vol-IV 2 of 2
103 pages
TAFL Theory (Theory Unit-1 To Unit - 5)
No ratings yet
TAFL Theory (Theory Unit-1 To Unit - 5)
10 pages
Data Centers en Union Europea
No ratings yet
Data Centers en Union Europea
13 pages
7 - Chapter 7-Chapter 7 - Density-Based Clustering Methods
No ratings yet
7 - Chapter 7-Chapter 7 - Density-Based Clustering Methods
30 pages