Lecture 7 & 8 Brief Lecture Notes On Probability Distributions: Binomial, Poisson and Normal Distribution
Lecture 7 & 8 Brief Lecture Notes On Probability Distributions: Binomial, Poisson and Normal Distribution
Lecture 7 & 8 Brief Lecture Notes On Probability Distributions: Binomial, Poisson and Normal Distribution
Bernoulli distribution
Our approach begins by first developing the Bernoulli model, which is a
building block for the Binomial. We consider a random experiment that can give
rise to just two possible mutually exclusive and collectively outcomes, which
for convenience we will label “success” and “failure”.
Let p denote the probability of success, so that the probability of failure is (1 –
p). Now define the random variable X so that X takes the value 1 if the outcome
of the experiment is success and 0 otherwise. The probability function of this
random variable is then
P(X=0) = (1 – p) and P(X=1) = p
This distribution is known as Bernoulli distribution.
(0 p ) 2 (1 p) (1 p ) 2 p p (1 p )
1
distribution. Its probability distribution function for the binomial random
variable X = x is
P(x Success in n Independent Trials) = P(x)
n!
P x (1 P ) ( n x ) for x = 0, 1, …….,n
x!(n x)!
and variance
2 E[( X ) 2 ] nP(1 P )
Solution:
a. P(At most 1 sale) = P ( X 1) P ( X 0) P ( X 1).
5!
P(X=0) = (0.4) 0 (0.6) 5 0.078
0!5!
5!
P ( X 1) (0.4)1 (0.6) 4 5(0.4)(0.6) 4 0.259
1!4!
b. P( 2 X 4) P ( X 2) P( X 3) P( X 4)
5!
P ( X 2) (0.4) 2 (0.6) 3 10(0.4) 2 (0.6) 3 0.346
2!3!
5!
P ( X 3) (0.4) 3 (0.6) 2 10(0.4) 3 (0.6) 2 0.230
3!2!
2
5!
P ( X 4) (0.4) 4 (0.6)1 5(0.4) 4 (0.6)1 0.077
4!1!
X P(X=x)
0 0.078
1 0.259
2 0.346
3 0.230
4 0.077
5 0.010
Comments:
This shape is typical for binomial probability when P is neither very large
nor very small.
At least extremes (0 or 5 sales), the probabilities are quite small.
3
a. What is the probability that at most 6 students will enroll if the college offers
admission to 10 more students?
b. What is the probability that more than 12 will actually enroll if admission
offered to 20 students?
c. If 70% of those students admitted actually enroll, what is the probability that
at least 12 out of 15 students will actually enroll?
Solution:
a. This probability can be obtained using the cumulative binomial probability
distribution from Table 3 in the Appendix. The probability of at most 6
students enrolling if n = 10 and P = 0.40 is
P ( X 6 / n 10, P 0.40) 0.945
c. The probability that at least 12 out of 15 students enroll is the same as the
probability that at most 3 out of 15 students do not enroll (the probability of a
student not enrolling is 1 – 0.70 = 0.30).
P ( X 12 / n 15, P 0.70) P ( X 3 / n 15, P 0.30) 0.297
q 1 p .90 .
4
5!
P(X=0; 5, 0.10) = (0.10) 0 (0.90) 5 0 0.5905
0! 5!
(ii) The probability that there will be one defective T-shirt is given by
5!
P(X=1; 5, 0.10) = (0.10)1 (0.90) 51 0..3281
1! 4!
(iv) The probability that there will be at least 3 defective T-shirts is given by
P X 3 1 P X 0 P X 1 P X 2
1 P X 0 P X 1 P X 2
5
2. The number of replacement orders for a part received by a firm in a given
month.
3. The number of dents, scratches, or other defects in a large roll of sheet
metal used to manufacture filters.
We can derive the equation for computing Poisson probabilities directly from
the binomial probability distribution by taking the mathematical limits as
P 0 and n . With these limits the parameter nP is a constant that
specifies the average number of occurrences (success) for a particular time
and/or space.
6
(vii) Number of missing luggage of a particular Airlines
(viii) Number of telephone calls received by a particular telephone exchange in
some units of time
(ix) Number of days that the universities remain closed due to political crises
(x) Number of customers arriving in a particular shop in some units of time
(xi) Number of crimes that are happening in any particular city in a day
Where,
P(x) = the probability of x successes over a given time or space, given
= the expected number of successes per time or space unit: > 0
7
failure. Also assume the first failure does not affect the probability of a second
failure (in some cases, these assumptions may not hold, and more complex
distributions would be used).
From past experience the expected number of failures per day is 3/100, or
0.03 .
=1-0.970446
1 0.996185 0.003815
8
e 2 2 0
P( X 0) e 2 0.1353
0!
e 2 2 1
P ( X 1) 2e 2 0.2707
1!
e 2 2 2
P( X 2) 2e 2 0.2707
2!
e4.2 (4.2)0
Now P ( X 0) 0.015
0!
e4.2 (4.2)1
P ( X 0) 0.063
1!
e4.2 (4.2) 2
P ( X 0) 0.132
2!
P ( X �3) 1 P( X 3) 1 [0.015 0.063 0.132]
1 .021 0.79
Example: In a case study it is found that 2.5% small companies are reported for
bankruptcy in the year 2012. A random sample of 200 companies is selected.
Find the probability that (i) at least 3 companies are reported for bankruptcy, (ii)
exactly 3 companies are reported for bankruptcy, (iii) at best 3 companies are
reported for bankruptcy.
9
Solution:
Let X is a random variable which indicates the no. of companies that are
reported for bankruptcy and follows the Poisson distribution.
Here P 0.025 , np 200 0.025 5
1 P X 0 P X 1 P X 2
=0.8753
(ii) Probability that exactly 3 companies are reported for bankruptcy
e 5 5 3
P ( X 3) 0.1404
3!
0.2650
10
2. The area under the probability density function, f(x), over all values of the
random variable, X, is equal to 1.0.
3. Suppose that this density function is graphed. Let a and b be two possible
values of random variable X, with a < b. Then the probability that X lies
between a and b is the area under the density function between these points.
4. The cumulative distribution function, F( x0 ), is the area under the probability
density function , f(x), up to x0 :
x0
F ( x0 ) f ( x)dx
xm
where and 2 are any numbers such that and 0 2 and e and
are physical constants, e = 2.71828…. and 3.14159.......
11
Properties of the Normal distribution
Suppose that the random variable X follows a normal distribution with
parameters and 2 . Then the following properties hold:
1. The mean of the random variable is :
E( X )
f(x)
For our applied statistical analyses the normal distribution has a number of
important characteristics. It is symmetric. Different central tendencies are
indicated by differences in . In contrast, differences in 2 result in density
functions of different widths. By selecting values for and 2 we can define a
large family of normal probability density functions. Differences in the mean
result in shifts of entire distributions. In contrast, differences in the variance
result in distributions with different widths.
12
Figure: Effects of and 2 on the probability density function of a Normal
random variable.
a. Two normal distributions with different means.
b. Two normal distributions with different variances and mean = 5.
f(x) f(x)
Variance = 1
x x
(a) (b)
This is the area under the normal probability density function to the left of x 0.
As for any proper density function, the total area under the curve is 1; that is,
F ( ) 1
Figure: The shaded area is the probability that X does not exceed x 0 for a
normal random variable.
f(x)
13
x
The probability is the area under the corresponding probability density function
between a and b, as shown in the following Figure.
Figure: Normal density function with the shaded area indicating the probability
that X is between a and b.
f(x)
x
a b
14
We can obtain probabilities for any normally distributed random variable by
first converting the random variable to the standard normally distributed random
variable, Z. There is always a direct relationship between any normally
distributed random variable and Z. That relationship uses the transformation
X
Z
This important result allows us to use the standard normal table to compute
probabilities associated with any normally distributed random variable. Now let
us see how probabilities can be computed for the standard normal Z.
The cumulative distribution function of the standard normal distribution is
tabulated in Table 1 in the Appendix. This table gives values of
F(z) = P(Z z )
for non-negative values of z. For example, the cumulative probability for a Z
value of 1.25 from Table 1 is
F(1.25) = 0.8944
This is the area, designated in the following Figure, for Z less than 1.25.
Because of the symmetry of the normal distribution, the probability that Z >
-1.25 is also equal to .8944. In general, values of the cumulative distribution
function for negative values of Z can be inferred using the symmetry of the
probability density function.
To find the cumulative probability for a negative Z (for example, Z = -1.0)
defined as
F(-Z0 ) = P(Z z 0 ) F (1.0)
we use the complement of the probability for Z = +1, as shown in the following
Figure.
Figure: Standard Normal Distribution for Negative Z equal to –1.
15
F(-1) =0.1587 1 –F(z) = 1 –0.1587 = 0.8413
Example:
The life time of a electric bulbs manufactured by a company A has a mean of
720 hours and standard deviation is of 10 hours. We are assuming that the
distribution of life times of electric bulbs is normal.
(i) What is the probability that the life time of a bulb will be less than or equal
to 740 hours?
(ii) What is the probability that the life time of a bulb will be greater than or
equal to 740 hours?
(iii) What is the probability that the life time of a bulb lies between 730 hours
and 740 hours?
Solution:
Let X is a random variable which indicates the life times of bulbs, and which is
normally distributed with mean 720 hours and standard deviation is 10 hours.
(i) The probability that the life time of a bulb will be less than or equal to 740
hours is given by:
X 740 720
P X 740 P P Z 2 0.50 0.4772 0.9772
10
(ii) The probability that the life time of a bulb will be greater than or equal to
740 hours is given by:
X 740 720
P X 740 P P Z 2 0.50 P 0 Z 2 0.50 .4772 0.0228
10
(iii) The probability that the life time of a bulb lies between 730 hours and 740
hours is given by:
730 720 X 740 720
P 730 X 740 P P1 Z 2 P 0 Z 2 P 0 Z 1
10 10
0.4772 0.3413
0.1359
16
1. A recent study of the hourly wages of maintenance crews for major airlines
showed that the mean hourly wages is $20 with standard deviation is $2. Let
hourly wages of crews is normally distributed. What is the probability that a
crew earns: (i) more than or equal to $ 25 per hour, (ii) between $25 and $30 per
hour, (iii) less than or equal to $30.
17