Chapter 4
Chapter 4
Chapter 4
Chapter 4
Estimation and Confidence Interval
1
Topics
Distribution of Sample Means and Sample
Proportions
Population Sample
3
Introduction
Parameters Numerical measures obtained from a population to describe the
characteristics of the population.
Example :
Mean height of all students in Malaysia is 1.48 m.
Population parameter = the mean height of all the students
= 1.48 m.
4
Introduction
The symbols and terms used in the sampling theory
Size N n
Mean x
Variance 2 s2
Proportion p
5
Sampling From An Infinite Population
(Or Finite Population With Replacement)
All possible random samples of size n with values
x1,, x2, x3 ,.,xn are selected from an infinite
population ( or a finite population with
replacement) with mean and variance and
2
6
Sampling From An Infinite Population
(or Finite Population With Replacement)
7
Sampling distribution of
the sample mean
The mean of the sample means:
x E( x )
The variance of the sample means:
2
x 2
Var ( x )
n
8
Standard Error Of The Mean
The standard deviation of the sampling distribution
of the sample mean is also known as the standard
error of the mean, denoted by x .
x
n
n sample size
population mean
population standard deviation 9
Standard Error Of The Mean
If the sample size n is large, then the standard
error of the mean is small.
Hence, the sample means will be closer to the
population mean.
10
Standard Error of The mean
2
x E ( X ) E ( X )
2
where
E( X ) x P( X x )
E ( X ) x P( X x )
2 2
11
Example 4.1
Consider a sample of size 2 taken from a set of
integers {0, 2, 4, 6, 8} with replacement. The total
number of samples is 5 x 5 = 25 and is listed below.
13
Example 4.1 Solution
14
Example 4.1 Solution
15
Example 4.1 Solution
2
E( X )
x P( X x )
2
20
16
Example 4.1 Solution
2
x E ( X ) E ( X )
2
20 4 2
2
17
Example 4.1 Solution
18
Example 4.2
19
Example 4.2 Solution
E( X ) 4
2 8 2
4 Var ( X ) 4
n 2 n
20
Example 4.3
Ans: 63.25, 20
21
Example 4.3 Solution
200
For sample B:
n =100x
63.25
10
200
x 20
100
22
Example 4.3 Solution
23
Central Limit Theorem
(a) Sampling distribution of the mean obtained
from any population.
(c) Proportion
xx x xx
f (x ) f (x ) f (x )
x x x
x
25
(a) Sampling Distribution Of The
Means From Any Population
If samples of size n are selected from any
population (with mean and variance 2), the
sampling distribution of the sample mean is
approximately a normal distribution with mean
2
and variance provided n is large (n 30).
n
26
(a) Sampling Distribution Of The
Mean From Any Population
2
X ~ N ( , ) n 30
n
27
Example 4.4
28
Example 4.4 (a) Solution
29
Example 4.4 (a) Solution
30
Example 4.4 (a) Solution
31
Example 4.4 (b) Solution
32
Example 4.4 (b) Solution
33
Example 4.5
Find the probability that the sample mean
selected at random for a sample of size 30 taken
from a population with binomial distribution,
parameters n = 10 and p = 0.5 will exceed the
value of 5.5.
34
Example 4.5 Solution
35
Example 4.6
A random sample of size m is selected form a
population with binomial distribution,
parameters n = 20 and p = 0.4. It is found that
4% of the sample mean is less than 7.5. Find
the value of m.
Ans: m = 59
36
Example 4.6 Solution
X : B(20, 0.4)
E ( X ) np 20(0.4) 8
Var ( X ) npq 20(0.4)(0.6) 4.8)
4.8
X ~ N (8, )
m
P ( X 7.5) 0.04
37
Example 4.6 Solution
4.8
P(Z
7.5 8
) 0.04
0.2857
4.8 m
m m 58.80
P ( Z 1.75) 0.04
59
7.5 8
1.75
4.8
m
38
(b) Sampling Distribution of The
Mean From A Normal Population
f (x ) f (x ) f (x )
x x x
2
X ~ N ( , )
n
40
Example 4.7
The length of a type of worm is normally distributed
with mean 15cm and standard deviation cm.
30
Find the probability that the length of a worm
selected at random lies between 14cm and 16cm.
X ~ N (15,30)
P (14 X 16)
P ( 143015 Z 163015 )
P (0.18 Z 0.18)
1 P ( Z 0.18) P ( Z 0.18)
1 2 P ( Z 0.18)
1 2(0.4286)
0.1428
42
Example 4.7 Solution
X ~ N (15, 30
25 )
P(14 X 16)
P ( 143015 Z 163015 )
25 25
P (0.91 Z 0.91)
1 P ( Z 0.91) P ( Z 0.91)
1 2 P ( Z 0.91)
1 2(0.1814)
0.6372 43
Example 4.8
A sample of size n is taken from a normally
distributed population with mean 80 and
variance 36. Find the value of n if the
probability of the sample mean exceeds 78 is
0.975.
Ans: n = 35
44
Example 4.8 Solution
P(Z 78 80
) 0.975
36
n
36
n 1.020
1 P( Z 78 80
36
n
) 0.975 n 34.6
n 35
P(Z ) 0.025
78 80
36
n
45
Example 4.9
A large random sample of size n is taken from a
normal population with mean 80 and standard
deviation 5. Find the least value of n if the sample
mean
n 2 n 2 n
P( Z ) P( Z ) 0.01
5 5
P( Z 2 n
5 ) 0.01 2 P ( Z 2 5 n ) 0.01
2 n
5 2.33 P( Z 2 n
5 ) 0.005
2 n
2.58
n 33.9 5
n 41.6
Least value of n is 34. Least value of n is 42.
47
(c) Proportion
A proportion is a comparison of two values.
If k is the number of elements with a particular
characteristic taken from a set of n elements, then the
proportion of elements with the particular feature is .
k
denotes the proportion from a population with a
n
particular characteristic.
P denotes the proportion from a sample with a particular
characteristic.
48
(c) Sampling Distribution
of The Proportion
If is the proportion of a population with a
particular characteristic and random samples of size n
are selected, then the probability distribution of the
sample proportions p that have the particular
characteristic is approximately normal with mean
and variance (1 , ) provided n is sufficiently large.
n
(1 )
p ~ N ,
n 49
(c) Sampling Distribution
of The Proportion
A probability distribution of all sample
proportions with a particular characteristic from
a population is called the sampling distribution
of the proportion.
The mean of the sampling distribution of the
proportion is given by:
p E ( p)
50
(c) Standard Error For The Sampling
Distribution Of The Sample Proportion
The standard deviation of the sampling distribution of
the sample proportion is denoted by p .
(1 )
p
n
It is also known as the standard error of the
proportion.
If the size n is large , then the standard error will be
small. Thus, the sample proportion will be closer to
the population proportion.
51
Example 4.10
A company manufactures a type of light bulbs. It
is known that 5% of the bulbs are defective. Find
the probability that a sample of 400 light bulbs
chosen at random will yield a proportion of
defective bulbs of
(a) 0.05(0.95)
p ~ N (0.05, 400 )
~ N (0.05, 0.00011875)
P( p 0.04)
P( Z 0.00011875
0.04 0.05
)
P( Z 0.92)
1 P( Z 0.92)
1 0.1788
0.8212
53
Example 4.10 Solution
(c) 0.05(0.95)
p ~ N (0.05, 400 )
~ N (0.05, 0.00011875)
P( p 0.045)
P(Z 0.045 0.05
0.00011875
)
P ( Z 0.46)
1 P( Z 0.46)
1 0.3228
0.6772
55
Example 4.10 Solution
~ N (0.05, 0.00011875)
P( p 0.048)
P ( Z 0.048 0.05
0.00011875
)
P ( Z 0.18)
P ( Z 0.18)
0.4286
56
Example 4.11
The proportion of students of a secondary school
who wear glasses is 0.35. A study is taken to
determine the proportion of students who wear
glasses by choosing a sample A of 30 students
and another sample B of 50 students. Find the
standard error of proportion for these two samples
and comment on your results.
When n A 30
0.35(1 0.35)
Standard error of sample A: p 0.0871
30
When nB 50
0.35(1 0.35)
Standard error of sample B: p 0.0675
50
58
Example 4.11 Solution
59
Point Estimation
Point estimation refers to the procedure whereby a value of
sample statistic is computed from the data of a sample to be
used as an estimate of the population parameter.
63
Unbiased Estimator For The
Population Mean
If X1, X2, X3 , Xn is a sample of size n taken
from a population with mean and variance 2,
then the unbiased estimator for the population mean
is
n
1 1
X X i ( X 1 X 2 .... X n )
n i 1 n
64
Unbiased Estimator For The
Population Variance
If X1, X2, X3 , Xn is a sample of size n taken
from a population with mean and variance 2,
then the unbiased estimator for the population
variance 2 is
n
(X X ) 2
S
2 i 1
n 1
65
Unbiased Estimator For The
Population Variance
The unbiased estimator formulae for the population
variance can have other forms as shown below :
( x x ) 2
x 2
( x ) 2
(a) S 2 (b) S
2 n
n 1 (n 1)
(c ) S 2
x 2
n ( x ) 2
n 1
(c ) x 128, ( x x ) 312,
2
n8
(d ) x 20 21 22 23 24 25
f 4 14 17 26 20 9
Ans: (a) 48.88, 6.9821; (b) 15, 43.1429; (c) 16; 44.5714;
67
(d) 22.7889, 1.8089
Example4.12
Example 4.15 Solution
19159
n
X i
2
n( X ) 2
S 2
i 1
n 1
19159 8(48.875) 2
8 1 68
6.9821
Example4.12
Example 4.15 Solution
X
x Unbiased estimator for
n population variance is
120
S
2 x 2
n ( X ) 2
8 n 1
15
2102 8(15) 2
8 1
43.1429
69
Example4.12
Example 4.15 Solution
X
x Unbiased estimator for
n population variance is
n
128
8
i
( X X ) 2
S2 i 1
16 n 1
312
8 1
44.5714
70
Example4.12
Example 4.15 Solution
71
Unbiased Estimator For The
Population Proportion
In a random sample of size n taken from a
population with a specific character, then the
unbiased estimator for the population proportion
with the specific character p
is k where k is
n
the number of the sample elements with the
specific character.
72
Example 4.13
A research is conducted in a town to find the
number of residents who owns a credit card. Out
of a random sample of 120 residents, 25 of them
own a credit card. What is the unbiased estimator
for the proportion of town residents who owns a
credit card?
Ans: 0.2083 73
Example 4.13 Solution
25
p 0.2083
120
74
Interval Estimation
An interval is constructed around the point estimate so
that the interval is likely to contain the parameter at a
specified probability denoted by (1 ).
The interval is then said to be constructed with respect to
a given confidence level denoted by (1 )100%. The
interval established is called the confidence interval.
Usually,
Usually,1-
1-
Suppose 1- = .95 and that
the estimator has a normal
distribution, then 95% of
the values of the estimator
lie in the interval: Parameter
Parameter1.96SE
1.96SE
75
To Change the Confidence Level
To change to a general confidence level, 1-, pick a
value of z that put the area of 1- in the center of the z
distribution. (Refer to Table 4 Percentage Points of
the Normal Distribution for the za/2 values)
Tail area za/2
0.05 1.6449
2 2 0.025 1.9600
0.01 2.3263
0.005 2.5758
100(1-)%
100(1-)% Confidence
Confidence Interval: Estimator zza/2a/2SE
Interval: Estimator SE
76
Interval Estimation
(a) Confidence interval for the population mean
with known variance 2.
77
Interval Estimation
(d) Interpretation of confidence interval
78
(1-(w
)x1h-
0
%
f C
o
n
ierz)202%d
iS
tC
x e
zo(nX
f
Mean
c
ind3
With known variance,
79
n m
e
a
:
n
aor)
(1- onfidpecz2inteprv1alforpnu3l0ationprotin:
)0%
C
Confidence Interval for the
Population Proportion
80
Example 4.14
A random sample of n = 50 males showed a mean
average daily intake of dairy products equal to 756
grams with a standard deviation of 35 grams. Find
a 95% confidence interval for the population
average .
(1 )100% 95%
1 0.95
0.05
0.025
2
Z 0.025 1.96
82
Example 4.14 Solution
Note:
The interval must be wider to provide for the increased
confidence that does indeed enclose the true value of m.
(1 )100% 99%
1 0.99
0.01
0.005
2
Z 0.005 2.5758
85
Example 4.15 Solution
87
Example 4.16 Solution
k (1 )100% 98%
P
n 1 0.98
104 0.02
p
150
0.01
2
Z 0.01 2.3263
88
(p2.361n)
=
104
150
p Example 4.16
98% confidence interval
for :
2.3263
0.6933 0.0876
104
150
(0.6057, 0.7809)
(1
150
104
150
)
Solution
89
Ans:(8.70,1) Example 4.17
Random samples of 10 fishes are caught from a
pond and its length, in cm, were measured. The
results are as follows: 9.1, 9.1, 11.3, 10.7, 9.8,
10.2, 10.1, 9.7, 9.9 and 9.5. The lengths of fishes
in the pond are normally distributed with variance
4. Find a 95% symmetrical confidence interval
for the mean lengths of all fishes in the pond.
90
Example 4.17 Solution
91
:
x1.96n Example 4.17
95% confidence interval
for
= 9.94 1.96
9.94 1.24
2
10
(8.7004,11.1796)
Solution
92
Example 4.18
Ans:(90.0062,100.9938)
93
(1 )100% 98%
1 0.98
0.02
2
0.01
Z 0.01 2.3263
x=
Example 4.18
for :
2 .36386,x2.36
8
Solution
94
xx2.32.366881,0x2.2.36.(12)8.2,108
Example 4.18 Solution
95
Example 4.18 Solution
S
2
2
(
n
x
n 1
10
x ) 2
z20.152.170
From Table 4 (Percentage Points
of the Normal Distribution),
z z0.005 2.5758
170.8
Solution
100 1
1.7253
98
S1.72531 Example 4.19
10 2.5758
1.3135
10 0.3383
100
(9.6617,10.3383)
Solution
99
Example 4.20
A supervisor wants to know the proportion of
defective electronic components produced by a
machine. He inspects a random sample of 200
electronic components and found that 40
electronic components is defective. Find a
98% symmetrical confidence interval for the
proportion of all electronic components produced
by the machine.
100
Ans: p = 0.2; (b) (0.1342, 0.2658)
Example 4.20 Solution
(0.2)(0.8)
0.2 2.3263
200
0.2 0.0658
(0.1342, 0.2658)
102
Interpretation Of Confidential Interval
Consider the IQ scores of a normal population with
mean = 4.5 and standard deviation = 2.87.
In a study of sampling distribution of the IQ scores,
the mean score of a randomly chosen sample of 40
participants x is found to be 3.975. Thus, a 90%
confidence interval for the mean score is
2.87
x Z 2 3.975 (1.6449
n 40
(3.975 0.7464)
(3.2286, 4.7214) 103
Interpretation Of Confidential Interval
Therefore, we are 90% confident that the
interval (3.2286, 4.7214) will contain the
population mean .
3.2286 4.7214
104
Interpretation Of Confidential Interval
105
Interpretation Of Confidential Interval
Table below shows the 10 samples and the confidence
intervals obtained in a sampling study.
Sample Sample mean 90% Confidence Interval
1 4.64 (3.89, 5.39)
2 4.56 (3.81, 5.31)
3 3.96 (3.21, 4.71)
4 5.12 (4.37, 5.87)
5 4.24 (3.49, 4.99)
6 3.44 (2.69, 4.19)
7 4.60 (3.85, 5.35)
8 4.08 (3.33, 4.83)
9 5.20 (4.45, 5.95)
10 4.88 (4.13, 4.63) 106
Interpretation Of Confidential Interval
Figure below is a line segment representation of the
confidence intervals.
2
n Z
E
2
where the estimation error
E Z
2 n
108
Example 4.21
The income of a fresh graduate of a university is
known to be normally distributed with standard
deviation RM1000. Find the least sample size
that must be taken so that mean income is
estimated with an error within RM500 with a
confidence of 95%.
Ans: n = 16
109
Example 4.21 Solution
2
2
1000
hence, n z 1.96 15.3664
E 500
2
Therefore, the minimum sample size that should be taken is 16 (round
to the next integer).
110
Choosing Appropriate Sample Size
(ii) Sample size to estimate the population proportion.
If an estimate of the population proportion is not
known, use p = .
2
z
n 2 p (1 p )
E
where the estimation error
p (1 p )
Ez
n
2 111
Example 4.22
A study is conducted in a particular school to
determine the proportion of pupils who smokes.
It is found that 8 out of 120 pupils selected at
random smoke. Find the least sample size so
that the difference between the sample
proportion and the population proportion is less
than 0.05 with confidence at 99%.
Ans: n = 166
112
Example 4.22 Solution
z z0.005 2.5758
2
113
Example 4.22 Solution
2
z
The sample size , n 2 p(1 p)
r
2
2.5758
(0.0667)(1 0.0667)
0.05
165.2081 166
114
Example 4.23
An IT company in Johor is interested to know
the proportion of students in the state who own
a computer. Find the least sample size so that
the difference between the sample proportion
and the population proportion is less than 0.1
with a confidence interval of 98%.
Ans: n = 136
115
Example 4.23 Solution
.
For 98% confidence interval , = 0.02
z z0.01 2.3263
2
116
Example 4.23 Solution
2
z
The sample size, n 2
2 E
2
2.3263
2 0.1
135.2918 136
117
118