Chapter 8 Ken Black
Chapter 8 Ken Black
Chapter 8 Ken Black
1
Copyright 2010 John Wiley & Sons, Inc.
Business Statistics, 6
th
ed.
by Ken Black
Chapter 8
Statistical
Inference:
Estimation for
Single Populations
Copyright 2010 John Wiley & Sons, Inc.
2
Learning Objectives
Know the difference between point and interval
estimation.
Estimate a population mean from a sample mean
when o is known.
Estimate a population mean from a sample mean
when o is unknown.
Copyright 2010 John Wiley & Sons, Inc.
3
Learning Objectives
Estimate a population proportion from a sample
proportion.
Estimate the population variance from a sample
variance.
Estimate the minimum sample size necessary to
achieve given statistical goals.
Copyright 2010 John Wiley & Sons, Inc.
4
A point estimate is a static taken from a sample that
is used to estimate a population parameter
Interval estimate - a range of values within which
the analyst can declare, with some confidence, the
population lies
Estimating the Population Mean
Copyright 2010 John Wiley & Sons, Inc.
5
Confidence Interval to Estimate
when o is Known
n
x
x
E
=
n
z x
n
z x
or
n
z x
o
o
o
+ s s
Point estimate
Interval Estimate
Copyright 2010 John Wiley & Sons, Inc.
6
Distribution of Sample Means
for 95% Confidence
.4750 .4750
X
95%
.025 .025
Z
1.96 -1.96 0
Copyright 2010 John Wiley & Sons, Inc.
7
is used to locate the Z value in constructing the
confidence interval
The confidence interval yields a range within which
the researcher feel with some confidence the
population mean is located
Z score the number of standard deviations a value
(x) is above or below the mean of a set of numbers
when the data are normally distributed
Estimating the Population Mean
Copyright 2010 John Wiley & Sons, Inc.
8
For a 95% confidence interval
= .05
/2 = .025
Value of /2 or z
.025
look at the standard normal
distribution table under
.5000 - .0250 = .4750
From Table A5 look up .4750, and read 1.96 as the
z value from the row and column
Estimating the Population Mean
Copyright 2010 John Wiley & Sons, Inc.
9
Confidence Intervals
Z values:
90% - 1.645
95% - 1.96
98% - 2.33
99% - 2.575
Copyright 2010 John Wiley & Sons, Inc.
10
96 . 1 , 85 , 46 , 510
2 /
= = = =
o
o z n x
78 . 519 22 . 500
78 . 9 510 78 . 9 510
85
46
96 . 1 510
85
46
96 . 1 510
2 / 2 /
s s
+ s s
+ s s
+ s s
o
o o
n
z x
n
z x
95% Confidence Interval for
Copyright 2010 John Wiley & Sons, Inc.
11
Example No. 1
A survey was taken of U.S. companies that do
business with firms in India. One of the questions
on the survey was: Approximately how many years
has your company been trading with firms in India?
A random sample of 44 responses to this question
yielded a mean of 10.455 years. Suppose the
population standard deviation for this question
is 7.7 years. Using this information, construct a 90%
confidence interval for the mean number of years that
a company has been trading in India for the population
of U.S. companies trading with firms in India.
Copyright 2010 John Wiley & Sons, Inc.
12
Example No. 2
A study is conducted in a company that employs 800
engineers. A random sample of 50 engineers reveals
that the average sample age is 34.3 years.
Historically, the population standard deviation of the
age of the companys engineers is approximately 8
years. Construct a 98% confidence interval to
estimate the average age of all the engineers in this
company.
Copyright 2010 John Wiley & Sons, Inc.
13
Estimating the Mean of a Normal
Population: Unknown o
The population has a normal distribution.
The value of the population Standard Deviation is
unknown, then sample Std Dev must be used in the
estimation process.
z distribution is not appropriate for these conditions
when the Population Std Dev is unknown, t
distribution is appropriate, and you use the Sample
Std Dev in the t formula
Copyright 2010 John Wiley & Sons, Inc.
14
t Distribution
A family of distributions -- a unique distribution for
each value of its parameter, degrees of freedom (d.f.)
Symmetric, Unimodal, Mean = 0, Flatter than a z
t distribution is used instead of the z distribution for
doing inferential statistics on the population mean
when the population Std Dev is unknown and the
population is normally distributed
With the t distribution, you use the Sample Std Dev
Copyright 2010 John Wiley & Sons, Inc.
15
n
s
x
t
=
A family of distributions - a unique distribution for
each value of its parameter using degrees of freedom
(d.f.)
Symmetric, Unimodal, Mean = 0, Flatter than a z
t formula
t Distribution
Copyright 2010 John Wiley & Sons, Inc.
16
t distribution flatter in middle and have more
area in their tails than the normal distribution
t distribution approach the normal curve as n becomes
larger
t distribution is to be used when the population variance
or population Std Dev is unknown, regardless of the size
of the sample
t Distribution Characteristics
Copyright 2010 John Wiley & Sons, Inc.
17
t table uses the area in the tail of the distribution
Emphasis in the t table is on , and each tail of the
distribution contains /2 of the area under the curve
when confidence intervals are constructed
t values are located at the intersection of the df
value and the selected /2 value
Reading the t Distribution
Copyright 2010 John Wiley & Sons, Inc.
18
1
1 , 2 / 1 , 2 /
1 , 2 /
=
+ s s
n df
n
s
t x
n
s
t x
or
n
s
t x
n n
n
o o
o
n df
n
s
t x
n
s
t x
or
n
s
t x
n
s
t x
n
s
t x
012 . 3
005 . 0
2
99 . 1
2
13 1 , 14 , 29 . 1 , 14 . 2
13 , 005 .
=
=
=
= = = = =
t
n df n s x
o
Solution for Example 3
Copyright 2010 John Wiley & Sons, Inc.
23
size sample =
proportion population =
- 1 =
proportion sample =
2 2
n
p
p q
p
where
n
q p
z p p
n
q p
z p
o o
+ s s
Confidence Interval to Estimate
the Population Proportion
Estimating the population proportion often
must be made
Copyright 2010 John Wiley & Sons, Inc.
24
Example #4
A clothing company produces mens jeans. The jeans
are made and sold with either a regular cut or a boot
cut. In an effort to estimate the proportion of their
mens jeans market in Oklahoma City that prefers
boot-cut jeans, the analyst takes a random sample
of 212 jeans sales from the companys two Oklahoma
City retail outlets. Only 34 of the sales were for
boot-cut jeans. Construct a 90% confidence interval
to estimate the proportion of the population in
Oklahoma City who prefer boot-cut jeans.
Copyright 2010 John Wiley & Sons, Inc.
25
20 . 0 12 . 0
04 . 0 16 . 0 04 . 0 16 . 0
212
) 84 . 0 )( 16 . 0 (
645 . 1 16 . 0
212
) 84 . 0 )( 16 . 0 (
645 . 1 16 . 0
s s
+ s s
+ s s
+ s s
p
p
p
n
q p
z p p
n
q p
z p
645 . 1 % 90
84 . 0 16 . 0 1
- 1 =
16 . 0
212
34
, 34 , 212
=
= =
= = = = =
z Confidence
p q
n
x
p x n
Example # 4
Copyright 2010 John Wiley & Sons, Inc.
26
1
) (
2
2
=
n
x x
s
1 - freedom of degrees
) 1 (
2
2
2
n
s n
=
=
o
_
Estimating the Population Variance
Population Parameter o
2
Estimator of o
2
_
2
formula for Single Variance
Copyright 2010 John Wiley & Sons, Inc.
27
( ) ( )
confidence of level 1
1
1 1
2
2
1
2
2
2
2
2
=
=
s s
o
_
o
_ o o
n df
s n s n
Confidence Interval for o
2
Copyright 2010 John Wiley & Sons, Inc.
28
Two Table Values of _
2
0 2 4 6 8 10 12 14 16 18 20
df = 7
.05
.05
.95
2.16735 14.0671
df 0.950 0.050
1 3.93219E-03 3.84146
2 0.102586 5.99148
3 0.351846 7.81472
4 0.710724 9.48773
5 1.145477 11.07048
6 1.63538 12.5916
7 2.16735 14.0671
8 2.73263 15.5073
9 3.32512 16.9190
10 3.94030 18.3070
20 10.8508 31.4104
21 11.5913 32.6706
22 12.3380 33.9245
23 13.0905 35.1725
24 13.8484 36.4150
25 14.6114 37.6525
Copyright 2010 John Wiley & Sons, Inc.
29
90% Confidence Interval for o
2
007146 . 001101 .
16735 . 2
0022125 ). 1 8 (
0671 . 14
0022125 ). 1 8 (
) 1 ( ) 1 (
________ __________ __________ __________
16735 . 2
0671 . 14
10 . , 7 1 , 8 , 0022125 .
2
2
2
2
1
2
2
2
2
2
2
95 .
2
2
1 .
1
2
2
1
2
05 .
2
2
1 .
2
2
2
s s
s s
s s
= = =
= = =
= = = = =
o
o
_
o
_
_ _ _
_ _ _
o
o o
o
o
s n s n
n df n s
Copyright 2010 John Wiley & Sons, Inc.
30
Example # 5
The U.S. Bureau of Labor Statistics publishes data on the hourly
compensation costs for production workers in manufacturing
for various countries. The latest figures published for Greece
show that the average hourly wage for a production worker in
manufacturing is $16.10. Suppose the business council of
Greece wants to know how consistent this figure is. They
randomly select 25 production workers in manufacturing from
across the country and determine that the standard deviation
of hourly wages for such workers is $1.12. Use this information
to develop a 95% confidence interval to estimate the
population variance for the hourly wages of production
workers in manufacturing in Greece. Assume that the hourly
wages for production workers across the country in
manufacturing are normally distributed.
Copyright 2010 John Wiley & Sons, Inc.
31
( ) ( )
( ) ( )
4277 . 2 7648 . 0
4011 . 12
) 2544 . 1 ( 1 25
3641 . 39
) 2544 . 1 ( 1 25
1 1
2
2
2
2
1
2
2
2
2
2
s s
s s
s s
o
o
_
o
_ o o
s n s n
4011 . 12
3641 . 39
05 . , 24 1 , 25 , 2544 . 1
2
975 .
2
2
05 .
1
2
2
1
2
025 .
2
2
05 .
2
2
2
= = =
= = =
= = = = =
_ _ _
_ _ _
o
o
o n df n s
Example # 5