Interval Estimation
Interval Estimation
Interval Estimation
{
Bias
Example:
1 n 1 n 1 n
E ( x ) E ( xi ) E ( xi ) E ( x )
n i 1 n i 1 n i 1
Example of biased estimator: Sample
variance.
Given sample of size n from the population with unknown mean () and variance
(2) we estimate mean as we already know and variance (intuitively) as:
1 n 1 n 2
T ( xi x ) xi x
2 2
n i 1 n i 1
Sample variance is not an unbiased estimator for the population variance. That is why
when mean and variance are unknown the following equation is used for sample
variance: 1 n
E (T )
n
E (x
i 1
2
i ) E(x 2 )
1 n
[var( xi ) ( E ( xi )) 2 ] [var( x ) ( E ( x )) 2
n i 1
1 n
1 n
[ ] [ 2 ]
2 2
2
s
2
i
n 1 i 1
( x x ) 2
n i 1 n
n 1 2
n
Example
Suppose, you want to estimate mean and sd of score of a batsman
in one day cricket. So, you have randomly chosen 5 different
innings and recorded scores as below
20 52 8 63 11
Find out unbiased estimator of mean and variance.
x 30.8
1 n
s
n 1 i 1
( xi x ) 2
25.07
Interval Estimation
Consider the following statements:
x = 550
A single-valued estimate that conveys little information
about the actual value of the population mean.
We are 90% confident that is in the interval [449,551]
An interval estimate which locates the population mean
within a narrow interval, with a high level of confidence.
We are 99% confident that is in the interval [400,700]
An interval estimate which locates the population mean
within a broader interval, with a lower level of confidence.
Interval Estimator =
Point Estimator Margin of
Error
Elements of Interval Estimation
95%
0 .3
f(x)
0 .2
0 .1
2.5% 2.5%
0 .0
x
x 1.96
x 1.96 n
n
x
2.5% fall below the interval
x
x
x
x 2.5% fall above the interval
x
x
x
We define z as the z value that cuts off a right-tail area of under the standard
2 2
normal curve.
P z z z (1 )
< <
0.2
0.1 2 2
2 2
0.0 (1- )100% Confidence Interval:
-5 -4 -3 -2 -1 0 1 2 3 4 5
z Z z x z
2 2
2 n
Critical Values of z and Levels of
Confidence
S t a n d a rd N o r m al Di s trib utio n
(1 ) z 0.4
2 2 (1 )
0.3
0.99 0.005 2.576
f(z)
0.2
Confidence
Intervals
Mean Proportion
Known
known unknown
Confidence Interval of (
known)
Assumption
Population Standard Deviation, is Known
Sample size is large
0.4 0 .4
0.3 0 .3
f(z)
f(z)
0.2 0 .2
0.1 0 .1
0.0 0 .0
-5 -4 -3 -2 -1 0 1 2 3 4 5 -5 -4 -3 -2 -1 0 1 2 3 4 5
Z Z
0 .4 0 .9
0 .8
0 .3 0 .7
0 .6
0 .5
f(x)
f(x)
0 .2
0 .4
0 .3
0 .1
0 .2
0 .1
0 .0 0 .0
x x
s s
x z / 2 x z / 2
n n
1 n
where s i x x 2
n i 1
Practice Problem 1:
A manufacturer of light bulbs claims that its light bulbs have a mean life hours
with a standard deviation of 85 hours. A random sample of 40 such bulbs is
selected for testing. If the sample produces a mean value of 1505 hours, find out
95% Confidence Interval of .
Solution: Given, n=40 (large), =85 (known), 1-=0.95, =0.05,
x 1505
z / 2 z 0.025 1.96
Therefore,
95% CI of is given by
85 85
1505 40 1.96 , 1505 40 1.96
1478.66 , 1531.34
Practice Problem 2:
Waiting times (in hours) at a popular restaurant are found to have a mean waiting
time of 1.52 hours with sd 2.25hrs. for a sample of 50 customers. Construct the
99% confidence interval for the estimate of the population mean.
Solution: Given, n=50 (large), s=2.25 (estimated), 1-=0.99, =0.01,
z / 2 z 0.005 2.58
x 1.52
Therefore,
99% CI of is given by
2.25 2.25
1.52 2.58 , 1.52 2.58
50 50
1.20 , 2.34
Large-Sample Confidence Intervals
for the Population Proportion, p
For estimating p , a sample is considered large enough when both n p an n q are greater
than 5.
Large-Sample Confidence Intervals
for the Population Proportion, p
Assumptions
Two Categorical Outcomes
Large Sample
p (1 p ) p (1 p )
p Z / 2 p p Z / 2
n n
8.3 Confidence Interval for
the Population Proportion
Example: Recall that Jared Beane wants to estimate the
proportion of all ultra-green cars that obtain over 100 mpg. Use
the sample information to construct a 90% confidence interval
of the population proportion.
Solution: Note that p 7 25 0.28. In addition, the
normality assumption is met since np > 5 and n(1 p) > 5.
Thus,
p 1 p 0.28 1 0.28
p z 2 =0.28 1.645 0.28 0.148
n 28
LO 8.6
Practice Problem 3:
A marketing research firm wants to estimate the share that foreign companies
have in the Indian market for certain products. A random sample of 100
consumers is obtained, and it is found that 34 people in the sample are users
of foreign-made products; the rest are users of domestic products. Give a
95% confidence interval for the share of foreign products in this market.
pq ( 0.34 )( 0.66)
p z 0.34 1.96
2
n 100
0.34 (1.96)( 0.04737 )
0.34 0.0928
0.2472 ,0.4328
Thus, the firm may be 95% confident that foreign manufacturers control
anywhere from 24.72% to 43.28% of the market.
Reducing the Width of Confidence Intervals -
The Value of Information
The width of a confidence interval can be reduced only at the
price of:
a lower level of confidence, or
a larger sample.
The t Distribution
If repeated samples of size n are taken from a normal
population with a finite variance, then the
statistic T follows the t distribution
with (n 1) degrees of freedom, df. X
T
S n
Degrees of freedom determine
the extent of the broadness of the tails of the
distribution; the fewer the degrees of freedom,
the broader the tails.
LO 8.4
8.2 Confidence Interval for the
Population Mean When Is Unknown
Summary of the tdf Distribution
Bell-shaped and symmetric around 0 with asymptotic
LO 8.4
Confidence Interval of (
unknown)
Assumption
Population Distribution is Normal
Population Standard Deviation, is unknown
s s
x t / 2,n 1 x t / 2,n 1
n n
where t / 2 is the value of the t distribution 1 n
with n-1 degrees of freedom that cuts off where s i x x 2
The critical value of t for df = (n -1) = (15 -1) =14 and a right-
tail area of 0.025 is:
15 2
t 0.025 2.145
`2= = 13.125; ` = 3.623
14
The corresponding confidence interval or interval estimate is:
s
x t 0.025
n
3.623
10.37 2.145
15
10.37 1.81
8.56,12.18
Sample-Size Determination
Before determining the necessary sample size, three questions must be
answered:
How close do you want your sample estimate to be to the unknown
parameter? (What is the desired bound, B?)
What do you want the desired confidence level (1-) to be so that the
distance between your estimate and the parameter is less than or equal to
B?
What is your estimate of the variance (or standard deviation) of the
population in question?
For example : (1 - )% Confidence Interval for : x z
2 n
}
Bound, B
Minimum Sample Size: Mean
and Proportion
Minimum required sample size in estimating the population
mean, :
z2 2
n 2 2
B
Bound of estimate : B (Known)
2
2 1.645 10.50
2
z
n 74.58 or 75
D 2
LO 8.7
Example 1
A marketing research firm wants to conduct a survey to estimate the average
amount spent on entertainment by each person visiting a popular resort. The
people who plan the survey would like to determine the average amount spent by
all people visiting the resort to within $120, with 95% confidence. From past
operation of the resort, an estimate of the population standard deviation is
s = $400. What is the minimum required sample size?
z
2 2
n 2
2
B
2 2
(1.96) ( 400)
2
120
42.684 43
Example 2
The manufacturers of a sports car want to estimate the proportion of people in a
given income bracket who are interested in the model. The company wants to
know the population proportion, p, to within 0.01 with 99% confidence. Current
company records indicate that the proportion p may be around 0.25. What is the
minimum required sample size for this survey?
z2 pq
n 2
B2
2.5762 (0.25)(0.75)
. 2
010
124.42 125