Notes 2
Notes 2
0.4
0.30
0.5
0.3
0.4
0.20
Probability
Probability
Probability
Probability
0.20
0.3
0.2
0.2
0.10
0.10
0.1
0.1
0.00
0.00
0.0
0.0
0 1 2 3 4 5 0 1 2 3 4 5 0 1 2 3 4 5 0 1 2 3 4 5
Binomial (n = 15, p = 0.1) Binomial (n = 15, p = 0.3) Binomial (n = 15, p = 0.5) Binomial (n = 15, p = 0.8)
0.25
0.20
0.30
0.20
0.15
0.15
0.15
0.20
Probability
Probability
Probability
Probability
0.10
0.10
0.10
0.10
0.05
0.05
0.05
0.00
0.00
0.00
0.00
0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14
10
Example:
Suppose that it is known that 40% of voters support the Conservative party. We take a random
sample of 6 voters. Let the random variable Y represent the number in the sample who support the
Conservative party.
1. Explain why the distribution of Y might be binomial.
2. Write down the probability distribution of Y as a table of probabilities.
3. Find the mean and variance of Y directly from the probability distribution.
4. Check your answers using the standard results E [Y ] = np and Var [Y ] = np(1p).
• If X ∼ Po(λs) then
(λs)x
P [X = x] = e−λs
x!
for k = 0, 1, 2, . . . .
• The expectation and variance of X are given by
E [X] = λs
Var [X] = λs
0.08
0.10
0.20
0.04
0.05
0.10
0.00
0.00
0.00
0 2 4 6 8 10 12 14 16 18 20 0 2 4 6 8 10 12 14 16 18 20 0 2 4 6 8 10 12 14 16 18 20
11
Example: Yeast
Gossett, the head of quality control at Guiness brewery c. 1920 (and discoverer of the t distribution),
arranged for counts of yeast cells to be made in sample vessels of fluid. He found that at a certain stage
of brewing the counts were Po(0.6). Let X be the count from a sample. Find P [X ≤ 3].
(λs)x
e−λs
x!
• Hence the Binomial distribution T ∼ Bin(n, p), can be approximated by the Poisson T ∼ Po(np)
when np is small.
• This approximation is good if n ≥ 20 and p ≤ 0.05, and excellent if n ≥ 100 and np ≤ 10.
12
The Normal Distribution
N(µ, σ) N(µ, σ)
0.4
1.0
0.8
0.3
0.6
0.2
0.4
0.1
0.2
0.0
0.0
µ − 3σ µ − 2σ µ−σ µ µ+σ µ + 2σ µ + 3σ −3 −2 −1 0 1 2 3
• Unfortunately, there is no neat expression for Φ(z), so in practice we must rely on tables (or
computers) to calculate probabilities.
Example
6.2 Standardisation
X−µ
• If X ∼ N(µ, σ 2 ), then Z = σ is the standardized version of X, and Z ∼ N(0, 1).
• Even more importantly, the distribution function for any normal rv X is given by
� �
x−µ
FX (x) = Φ ,
σ
and so the cumulative probabilities for any normal rv X can be expressed as probabilities of the
standard normal Z.
• This is why only the standard Normal distribution is tabulated.
13
Example
E [X] = µ,
Var [X] = σ 2 .
• If X and Y are Normally distributed then the sum S = X + Y is also Normally distributed
(regardless of whether X and Y are independent).
6.4 Interpolation
Interpolation
• Normal distribution tables are limited and only give us values of Φ(Z) for a fixed number of Z.
• Often, we want to know Φ(Z) for values of Z in between those listed in the tables.
• To do this we use linear interpolation - suppose we are interested in Φ(b), where b ∈ [a, c] and we
know Φ(a) and Φ(c).
• If we draw a straight line connecting Φ(a) and Φ(c) then (since Φ is smooth) we would expect Φ(b)
to lie close to that line. Then
� �
b−a
Φ(b) � Φ(a) + (Φ(c) − Φ(a))
c−a
Example
• Estimate the value of Φ(0.53) by interpolating between Φ(0.5) and Φ(0.6).
14
6.5 Normal Approximation to the Binomial
Binomial (20,0.1) approximated by Normal (2,1.8) Binomial (16,0.5) approximated by Normal (8,4)
0.25
0.4
0.20
0.3
0.15
Probability
Probability
0.2
0.10
0.1
0.05
0.00
0.0
0 1 2 3 4 5 6 7 8 9 11 13 15 17 19 0 1 2 3 4 5 6 7 8 9 10 12 14 16
• Regardless of p, the Bin(n, p) histogram approaches the shape of the normal distribution as n
increases. (This is actually a consequence of the strong law of large numbers; without going into
more detail, the strong law simply says that certain distributions, under certain circumstances,
converge to the normal distribution.)
• We can approximate the binomial distribution by a Normal distribution with the same mean and
variance:
Bin(n, p) is approximately N(np, np(1 − p))
np ≥ 10 and n(1 − p) ≥ 10
P (X ≤ k) � P (X � ≤ k + 1/2)
P (k1 ≤ X ≤ k2 ) � P (k1 − 1/2 ≤ X � ≤ k2 + 1/2)
15