Chapter 3
Chapter 3
Contents 1
3 Chapter 3 Probability 2
3.1 Events, Sample Spaces, and Probability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
3.2 Unions and Intersections . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.3 Complementary Events . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
3.4 The Additive Rule and Mutually Exclusive Events . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.5 Conditional Probability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
3.6 The Multiplicative Rule and Independent Events . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.7 Some Additional Counting Rules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
3.8 Bayes’s Rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
1
Chapter 3
Chapter 3 Probability
In this chapter, we assume that the population is known and calculate the chances of obtaining various samples from the
population. Thus, we show that probability is the reverse of statistics: In probability, we use information about the population
to infer the probable nature of the sample.
Suppose a coin is tossed once and the up face is recorded. The result we see is called an observation, or measurement, and
An experiment is an act or process of observation that leads to a single outcome that cannot be predicted with certainty.
Question
The sample space of an experiment is the collection of all its sample points which can be represented in set notation as
The following tree diagram shows all possible outcomes for tossing two coins:
2
CHAPTER 3. CHAPTER 3 PROBABILITY 3
Venn diagram is a graphical representation for a sample space as a closed figure, labeled S, containing all possible
sample points.
Figure 3.2: Venn diagrams for the Three Experiments from Table 3-1
The probability of a sample point is a number between 0 and 1 that measures the likelihood that the outcome will occur
when the experiment is performed. This number is usually taken to be the relative frequency of the occurrence of a sample
For example, consider tossing a balanced coin a very large number of times. The following figure supports our reasoning.
2. The probabilities of all the sample points within a sample space must sum to 1
P
(i.e., pi = 1)
Event
Probability of an Event
The probability of an event A is calculated by summing the probabilities of the sample points in the sample space for A.
CHAPTER 3. CHAPTER 3 PROBABILITY 4
1. Define the experiment; that is, describe the process used to make an observation and the type of observation that will
be recorded.
5. Sum the sample point probabilities to get the probability of the event.
Example 3.1.1 A fair die is tossed, and the up face is observed. If the face is even, you win $1. Otherwise, you lose $1.
On the basis of this notion of probability, in the long run, you will win $1 half the time and lose $1 half the time.
Recall that the sample space for this experiment contains six sample points:
S : {1, 2, 3, 4, 5, 6}
1
Since the die is balanced, we assign a probability of 6 to each of the sample points in this sample space.
A : {2, 4, 6}
Since the event A contains three sample points— all with probability 61 —and since no sample points can occur simulta-
neously, we reason that the probability of A is the sum of the probabilities of the sample points in A. Thus, the probability
1 1 1
of A is 6 + 6 + 6 = 21 .
Example 3.1.2 Consider the experiment of tossing two unbalanced coins. Because the coins are not balanced, their outcomes
(H or T) are not equiprobable. Suppose the correct probabilities associated with the sample points are given in the accompanying
table. [Note: The necessary properties for assigning probabilities to sample points are satisfied.] Consider the events
or:
2 2 4
p(A) = p(Observe HT) + p(Observe TH) = + =
9 9 9
Similarly,
4 2 2 8
p(B) = + + =
9 9 9 9
When the sample points run into the thousands or millions, one method of determining the number of sample points for
a complex experiment is to develop a counting system. Start by examining a simple version of the experiment. A second
method of determining the number of sample points for an experiment is to use combinatorial mathematics.
Combinations Rule
Suppose a sample of n elements is to be drawn without replacenent from a set of N elements. Then the number of
N N!
n = n!(N −n)!
Where: n! = n × (n − 1) × (n − 2) × ...3 × 2 × 1 (n factorial) and 0! = 1.
Question
How many possible simple random samples of size 3 (without replacement), can we have from a population of size 6?
CHAPTER 3. CHAPTER 3 PROBABILITY 6
An event can often be viewed as a composition of two or more other events. Such events, which are called compound events,
The union of two events A and B is the event that occurs if either A or B (or both) occurs on a single performance of
the experiment. We denote the union of events A and B by the symbol A ∪ B. A ∪ B consists of all the sample points that
The intersection of two events A and B is the event that occurs if both A and B occur on a single performance of the
experiment. We write A ∩ B for the intersection of A and B. A ∩ B consists of all the sample points belonging to both A and
Question
Family Planning Perspectives reported on a study of over 200,000 births in New Jersey over a recent two-year period.
The study investigated the link between the mother’s race and the age at which she gave birth (called maternal age). The
percentages of the total number of births in New Jersey, by the maternal age and race classifications, are given in the following
table.
Tow-Waye Table: Percentage of New Jersey Birth Mothers, by Age and Race
Race
Maternal White Black
Age (years)
≤ 17 2% 2%
18-19 3% 2%
20-29 41% 12%
≥ 30 33% 5%
A : {A New Jersey birth mother is white.} B : {A New Jersey mother was a teenager when giving birth.}
complementary events:
The complement of an event A is the event that A does not occur—that is, the event consisting of all sample points that
Note from the figure that all sample points in S are included in either A or Ac and that no sample point is in both A and
Ac .
Rule of Complements
P (A) + P (Ac ) = 1
Example 3.3.1 Consider the experiment of tossing fair coins. Define the following event:
a. When 2 coins are tossed, we know that the event A : {Observing at least one head.} consists of the sample points
then
1 1 1 3
p(A) = + + =
4 4 4 4
1
Ac : {TTTTTTTTTT} and P (Ac ) = 1024 (why?)
The probability of the union of events A and B is the sum of the probability of event A and the probability of event B,
P (A ∪ B) = P (A) + P (B) − P (A ∩ B)
Example 3.4.1 Hospital records show that 12% of all patients are admitted for surgical treatment, 16% are admitted for
obstetrics, and 2% receive both obstetrics and surgical treatment. If a new patient is admitted to the hospital, what is the
probability that the patient will be admitted for surgery, for obstetrics, or for both?
and the probability of the event that a patient receives both obstetrics and surgical treatment is: P (A ∩ B) = 0.02
The event that a patient admitted to the hospital receives either surgical treatment, obstetrics treatment, or both is the
union (A ∪ B), the probability of which is given by the additive rule of probability:
P (A ∪ B) = P (A) + P (B) − P (A ∩ B)
Thus, 26% of all patients admitted to the hospital receive either surgical treatment, ob- stetrics treatment, or both.
Events A and B are mutually exclusive if A ∩ B contains no sample points—that is, if A and B have no sample points in
P (A ∩ B) = 0
If two events A and B are mutually exclusive, the probability of the union of A and B equals the sum of the probability
P (A ∪ B) = P (A) + P (B)
Often, we have additional knowledge that might affect the outcome of an experiment, so we may need to alter the probability
of an event of interest. A probability that reflects such additional knowledge is called the conditional probability of the event.
Question
Consider a family that have 3 children. calculate the probability of the following events:
b. All of the children are boys, but we are given the information that the third children is a boy.
To find the conditional probability that event A occurs given that event B occurs, divide the probability that both A and
P (A ∩ B
P (A|B) =
P (B)
When P (B) 6= 0.
CHAPTER 3. CHAPTER 3 PROBABILITY 10
Example 3.5.1 Many medical researchers have conducted experiments to examine the relationship between cigarette smoking
and cancer. Consider an individual randomly selected from the adult male population. Let A represent the event that the
individual smokes, and let Ac denote the complement of A (the event that the individual does not smoke). Similarly, let B
represent the event that the individual develops cancer, and let B c be the complement of that event. Then the four sample
points associated with the experiment are shown in Figure 3.6, and their probabilities for a certain section of the United States
are given in following table. Use these sample point probabilities to examine the relationship between smoking and cancer.
Note
The example above is an observational study. Observational studies can be used to identify risk facotrs not for claiming
causation.
Services (EMS) ability to meet the demand for an ambulance. In one example, the researchers presented the following
scenario. An ambulance station has one vehicle and two demand locations, A and B. The probability that the ambulance can
travel to a location in under eight minutes is 0.58 for location A and 0.42 for location B. The probability that the ambulance
a. Find the probability that EMS can meet demand for an ambulance at location A.
b. Find the probability that EMS can meet demand for an ambulance at location B.
Independent Events
Events A and B are independent events if the occurrence of B does not alter the probability that A has occurred; that is,
P (A|B) = P (A)
P (B|A) = P (B)
If events A and B are independent, then the probability of the intersection of A and B equals the product of the
P (A ∩ B) = P (A)P (B)
The converse is also true: If P (A ∩ B) = P (A)P (B), then events A and B are independent.
Example 3.6.2 Consider the experiment of tossing a fair die, and let
Mutually exclusive events are dependent events and can not occur simultaneously.
P (A|B) = P (B|A) = P (A ∩ B) = 0
You have k sets of elements, n1 in the first set, n2 in the second set, ..., and nk in the kth set. Suppose you wish to form
a sample of k elements by taking one element from each of the k sets. Then the number of different samples that can be
n1n2n3...nk
Permutations Rule
Given a single set of N different elements, you wish to select n elements from the N and arrange them within n positions.
The number of different permutations of the N elements taken n at a time is denoted by PnN and is equal to
N!
PnN = (N −n)!
Partitions Rule
Suppose you wish to partition a single set of N different elements into k sets, with the first set containing n1 elements,
the second containing n2 elements, ..., and the kth set containing nk elements. Then the number of different partitions is
N!
n1 !n2 !n3 !...nk !
Where n1 + n2 + n3 + ... + nk = N .
Exercises :
2- Suppose we have 3 positions and 7 candidates. How many ways we can fill these positions? (each position can be filled
3- Selecting project teams. Suppose you are managing 10 employees and you need to form three teams to work on different
projects. Assume that each employee may serve on any team. In how many different ways can the teams be formed if the
Bayes’s Rule
CHAPTER 3. CHAPTER 3 PROBABILITY 13
Given k mutually exclusive and exhaustive events, B1 , B2 , ..., Bk such that P (B1 ) + P (B2 ) + P (B3 ) + ... + P (Bk ) = 1
P (Bi ∩ A)
P (Bi |A) =
P (A)
P (Bi )P (A|Bi )
=
P (B1 )P (A|B1 ) + P (B2 )P (A|B2 ) + ... + P (Bk )P (A|Bk )
Example 3.8.1 An unmanned monitoring system uses high-tech video equipment and microprocessors to detect intruders.
A prototype system has been developed and is in use outdoors at a weapons munitions plant. The system is designed to detect
intruders with a probability of 0.9. However, the design engineers expect this probability to vary with the weather. The system
automatically records the weather condition each time an intruder is detected. On the basis of a series of controlled tests in
which an intruder was released at the plant under various weather conditions, the following information is available: Given
that the intruder was, in fact, detected by the system, the weather was clear 75% of the time, cloudy 20% of the time, and
raining 5% of the time. When the system failed to detect the intruder, 60% of the days were clear, 30% cloudy, and 10%
rainy. Use this information to find the probability of detecting an intruder, given rainy weather. (Assume that an intruder
has been released at the plant.) Could you find the probability of detecting an intruder, given cloudy weather?
P (D)P (Rainy|D)
P (D|Rainy) =
P (D)P (Rainy|D) + P (Dc )P (Rainy|Dc )
0.05 × 0.9
= = 0.818
0.05 × 0.9 + 0.1 × 0.1
Question
HIV testing and false positives. Bayes’s rule was applied to the problem of HIV testing in The American Statistician
(Aug. 2008). In North America, the probability of a person having HIV is .008. A test for HIV yields either a positive or
negative result. Given that a person has HIV, the probability of a positive test result is .99. (This probability is called the
sensitivity of the test.) Given that a person does not have HIV, the probability of a negative test result is also .99. (This
probability is called the specificity of the test.) The authors of the article are interested in the probability that a person
actually has HIV given that the test is positive. Find the probability of interest for a North American by using Bayes’s rule.
The core content of the slides are from the textbook of this course;
by