Introduction To Statistics and Data Analysis
Introduction To Statistics and Data Analysis
Example:
A sample is to be chosen to answer certain
questions regarding political preferences in a
certain state in the United States. The
sample involves the choice of 1000 families,
and a survey is to be conducted. Now,
suppose it turns out that random sampling is
not used. Rather, all or nearly all of the 1000
families chosen live in an urban setting. It is
believed that political preferences in rural
areas differ from those in urban areas.
The sample along with inferential
statistics allows us to draw conclusions Implication:
about the population, with inferential The sample drawn actually confined
statistics making clear use of elements of the population and thus the inferences need
probability. to be confined to the “limited population”.
The sample of size 1000 described here is
often referred to as a biased sample.
Strata
Sampling units are not homogeneous
and naturally divide themselves into
nonoverlapping groups that are
homogeneous.
Elements in probability allow us to Stratified random sampling
draw conclusions about characteristics of
Example:
A sample survey is conducted in
order to gather preliminary opinions
regarding a bond referendum that is being
considered in a certain city. The city is
subdivided into several ethnic groups which
represent natural strata.
Implication:
In order not to disregard or
overrepresent any group, separate random
samples of families could be chosen from
each group.
2. Experimental Design
The concept of randomness or
random assignment plays a huge role in the
area of experimental design.
Example:
We may record the number of accidents that occur monthly at the intersection of Driftwood Lane
and Royal Oak Drive, hoping to justify the installation of a traffic light; we might classify items
coming off an assembly line as “defective” or “non-defective”; or we may be interested in the
volume of gas released in a chemical reaction when the concentration of an acid is varied.
Hence, the statistician is often dealing with either numerical data, representing counts or
measurements, or categorical data, which can be classified according to some criterion.
We shall refer to any recording of information, whether it be numerical or categorical, as
an observation.
Example:
The numbers 2, 0, 1, and 2, representing the number of accidents that occurred for each month
from January through April during the past year at the intersection of Driftwood Lane and Royal
Oak Drive, constitute a set of observations. Similarly, the categorical data N, D, N, N, and D,
representing the items found to be defective or non-defective when five items are inspected, are
recorded as observations.
Statisticians use the word experiment to describe any process that generates a set of data.
Example:
A simple example of a statistical experiment is the tossing of a coin. In this experiment,
there are only two possible outcomes, heads or tails.
Another experiment might be the launching of a missile and observing of its velocity at
specified times.
The opinions of voters concerning a new sales tax can also be considered as observations
of an experiment.
Sample space
The set of all possible outcomes of a statistical experiment and is represented by the
symbol S.
Example 1.1:
Consider the experiment of tossing a die. If we are interested in the number that shows on
the top face, the sample space is
𝑆1 = {1, 2, 3, 4, 5, 6}.
If we are interested only in whether the number is even or odd, the sample space is
simply
𝑆2 = {𝑒𝑣𝑒𝑛, 𝑜𝑑𝑑}.
Example 1.2:
Suppose that three items are selected at random from a manufacturing process. Each item is
inspected and classified defective, D, or non-defective, N. To list the elements of the sample
space providing the most information, we construct the tree diagram.
Sample spaces with a large or infinite number of sample points are best described by a
statement or rule method.
For example, if the possible outcomes of an experiment are the set of cities in the world with a
population over 1 million, our sample space is written
𝑆 = {𝑥 | 𝑥 𝑖𝑠 𝑎 𝑐𝑖𝑡𝑦 𝑤𝑖𝑡ℎ 𝑎 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑜𝑣𝑒𝑟 1 𝑚𝑖𝑙𝑙𝑖𝑜𝑛}
1.2 Events
A subset of a sample space. For any given experiment, we may be interested in the
occurrence of certain events rather than in the occurrence of a specific element in the sample
space.
For instance, we may be interested in the event A that the outcome when a die is tossed is
divisible by 3. The outcome is an element of the subset 𝐴 = {3, 6} of the sample space 𝑆" in the
die example.
We may be interested in the event B that the number of defectives is greater than 1 in
Example 1.1. This will occur if the outcome is an element of the subset of the sample space S.
The complement of an event A with respect to S is the subset of all elements of S that are
not in A. We denote the complement of A by the symbol 𝐴′.
Consider an experiment where the smoking habits of the employees of a manufacturing
firm are recorded. A possible sample space might classify an individual as a nonsmoker, a light
smoker, a moderate smoker, or a heavy smoker. Let the subset of smokers be some event. Then
all the nonsmokers correspond to a different event, also a subset of S, which is called the
complement of the set of smokers.
Example 1.4.
Let R be the event that a red card is selected from an ordinary deck of 52 playing cards,
and let S be the entire deck. Then R is the event that the card selected from the deck is not a red
card but a black card.
The intersection of two events A and B, denoted by the symbol A ∩ B, is the event
containing all elements that are common to A and B.
Example 1.5.
1. In the tossing of a die we might let A be the event that an even number occurs and B
the event that a number greater than 3 shows. Then the subsets A = {2, 4, 6} and B = {4, 5, 6}
are subsets of the same sample space S = {1, 2, 3, 4, 5, 6}.
Two events A and B are mutually exclusive, or disjoint, if A ∩ B = φ, that is, if A and B
have no elements in common.
Example 1.6.
1. Let V = {a, e, i, o, u} and C = {l, r, s, t}; then it follows that V ∩ C = φ. That is, V and
C have no elements in common and, therefore, cannot both simultaneously occur.
2. A cable television company offers programs on eight different channels, three of which
are affiliated with ABC, two with NBC, and one with CBS. The other two are an educational
channel and the ESPN sports channel. Suppose that a person subscribing to this service turns on
a television set without first selecting the channel. Let A be the event that the program belongs to
the NBC network and B the event that it belongs to the CBS network. Since a television program
cannot belong to more than one network, the events A and B have no programs in common.
Therefore, the intersection A ∩ B contains no programs, and consequently the events A and B
are mutually exclusive
The union of the two events A and B, denoted by the symbol A∪B, is the event containing
all the elements that belong to A or B or both.
Example 1.7.
1. In the die-tossing experiment, if A = {2, 4, 6} and B = {4, 5, 6}, we might be
interested in either A or B occurring or both A and B occurring. Such an event, called the union
of A and B, will occur if the outcome is an element of the subset {2, 4, 5, 6}.
2. Let A = {a, b, c} and B = {b, c, d, e}; then A ∪ B = {a, b, c, d, e}.
3. Let P be the event that an employee selected at random from an oil drilling company
smokes cigarettes. Let Q be the event that the employee selected drinks alcoholic beverages.
Then the event P ∪ Q is the set of all employees who either drink or smoke or do both.
4. If M = {x | 3 <x< 9} and N = {y | 5 <y< 12}, then M ∪ N = {z | 3 <z< 12}.
The relationship between events and the corresponding sample space can be illustrated
graphically by means of Venn diagrams.
𝐴 ∩ 𝐵 = 𝑟𝑒𝑔𝑖𝑜𝑛𝑠 1 𝑎𝑛𝑑 2,
𝐵 ∩ 𝐶 = 𝑟𝑒𝑔𝑖𝑜𝑛𝑠 1 𝑎𝑛𝑑 3,
𝐴 ∪ 𝐶 = 𝑟𝑒𝑔𝑖𝑜𝑛𝑠 1, 2, 3, 4, 5, 𝑎𝑛𝑑 7,
𝐵 ∩ 𝐴 = 𝑟𝑒𝑔𝑖𝑜𝑛𝑠 4 𝑎𝑛𝑑 7,
𝐴 ∩ 𝐵 ∩ 𝐶 = 𝑟𝑒𝑔𝑖𝑜𝑛 1,
(𝐴 ∪ 𝐵) ∩ 𝐶 = 𝑟𝑒𝑔𝑖𝑜𝑛𝑠 2, 6, 𝑎𝑛𝑑 7,
A situation where we select a card at random from an ordinary deck of 52 playing cards and
observe whether the following events occur:
𝐴: the card is red,
𝐵: the card is the jack, queen, or king of diamonds,
𝐶: the card is an ace.
1. A ∩ 𝜙 = 𝜙. 6. 𝜙’ = S.
2. A ∪ 𝜙 = A. 7. (A ) = A.
3. A ∩ A = φ. 8. (A ∩ B) = A ∪ B
4. A ∪ A = S. 9. (A ∪ B) = A ∩ B
5. S = 𝜙.
Compiled and prepared by: ENGR. K.T. CABANLIG
1.3 Counting Sample Points
One of the problems that the statistician must consider and attempt to evaluate is the
element of chance associated with the occurrence of certain events when an experiment is
performed.
If an operation can be performed in n1 ways, and if for each of these ways a second
operation can be performed in n2 ways, then the two operations can be performed together in
𝑛" 𝑛! ways.
Example 1.9
How many sample points are there in the sample space when a pair of dice is thrown once?
Solution:
n1 = 6 ways.
n2 = 6 ways.
𝑛" 𝑛! = (6)(6) = 36 possible ways.
Example 1.10
A developer of a new subdivision offers prospective home buyers a choice of Tudor,
rustic, colonial, and traditional exterior styling in ranch, two-story, and split-level floor plans. In
how many different ways can a buyer order one of these homes?
Example 1.11
If a 22-member club needs to elect a chair and a treasurer, how many different ways can
these two to be elected?
Solution:
For the chair position, there are 22 total possibilities
There are 21 possibilities to elect the treasurer
Multiplication rule:
𝑛" × 𝑛! = 22 × 21 = 462 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑡 𝑤𝑎𝑦𝑠
Example 1.12
Sam is going to assemble a computer by himself. He has the choice of chips from two brands, a
hard drive from four, memory from three, and an accessory bundle from five local stores. How
many different ways can Sam order the parts?
Solution:
𝑛" = 2, 𝑛! = 4, 𝑛# = 3, 𝑎𝑛𝑑 𝑛% = 5
𝑛" × 𝑛! × 𝑛# × 𝑛% = 2 × 4 × 3 × 5 = 120
Example 1.12
Consider the three letters a, b, and c. The possible permutations are abc, acb, bac, bca, cab, and
cba. Thus, we see that there are 6 distinct arrangements.
𝑛" 𝑛! 𝑛# = (3)(2)(1) = 6 𝑝𝑒𝑟𝑚𝑢𝑡𝑎𝑡𝑖𝑜𝑛𝑠
Example 1.13
In one year, three awards (research, teaching, and service) will be given to a class of 25
graduate students in a statistics department. If each student can receive at most one award, how
many possible selections are there?
Solution:
!-! !-!
25𝑃# = = 25! 22! = !!! = (25)(24)(23) = 13, 800.
(!-+#)!
Example 1.14
A president and a treasurer are to be chosen from a student club consisting of 50 people.
How many different choices of officers are possible if
(a) there are no restrictions
Solution:
-.!
50𝑃! = = (50)(49)
%/!
Circular permutations
Permutations that occur by arranging objects in a circle
Theorem 3: The number of permutations of n objects arranged in a circle is (n − 1)!
If the letters b and c are both equal to x, then the 6 permutations of the letters a, b, and c
become axx, axx, xax, xax, xxa, and xxa, of which only 3 are distinct. Therefore, with 3 letters, 2
Theorem 4: The number of distinct permutations of 𝒏 things of which 𝒏𝟏 are of one kind, 𝒏𝟐
of a second kind, ... , 𝒏𝒌 of a 𝒌𝒕𝒉 kind is
𝑛!
𝑛" ! 𝑛! ! … 𝑛$ !
Example 1.15
In a college football training session, the defensive coordinator needs to have 10 players
standing in a row. Among these 10 players, there are 1 freshman, 2 sophomores, 4 juniors, and 3
seniors. How many different ways can they be arranged in a row if only their class level will be
distinguished?
Solution:
10!
= 12,600
1! 2! 3! 4!
Often, we are concerned with the number of ways of partitioning a set of n objects into r
subsets called cells.
Theorem 5: The number of ways of partitioning a set of n objects into r cells with n1 elements
in the first cell, n2 elements in the second, and so forth, is
where 𝒏𝟏 + 𝒏𝟐 + ··· + 𝒏𝒓 = 𝒏.
Example 1.16
In how many ways can 7 graduate students be assigned to 1 triple and 2 double hotel
rooms during a conference?
Solution:
We are interested in the number of ways of selecting r objects from n without regard to
order. These selections are called combinations. combination is actually a partition with two
cells, the one cell containing the r objects selected and the other cell containing the (n −r) objects
that are left. The number of such combinations, denoted by
Solution:
The number of ways of selecting 3 cartridges from 10 is
Example 1.17
How many different letter arrangements can be made from the letters in the word STATISTICS?
The probability of an event A is the sum of the weights of all sample points in A. Therefore,
0 ≤ 𝑃(𝐴) ≤ 1, 𝑃(𝜙) = 0, 𝑎𝑛𝑑 𝑃(𝑆) = 1.
Furthermore, if A1, A2, A3, ... is a sequence of mutually exclusive events, then
𝑃(𝐴" ∪ 𝐴! ∪ 𝐴# ∪···) = 𝑃(𝐴" ) + 𝑃(𝐴! ) + 𝑃(𝐴# ) + ··· .
Example 1.18
A coin is tossed twice. What is the probability that at least 1 head occurs?
Solution:
𝑆 = {𝐻𝐻, 𝐻𝑇, 𝑇𝐻, 𝑇𝑇}.
𝐴 = {𝐻𝐻, 𝐻𝑇, 𝑇𝐻} 𝑎𝑛𝑑
1 1 1 3
𝑃(𝐴) = + + =
4 4 4 4
Solution:
S = {1, 2, 3, 4, 5, 6}
E = {1, 2, 3}
1 2 1 4
𝑃(𝐸) = + + =
9 9 9 9
Example 1.20
In Example above, let A be the event that an even number turns up and let B be the event that a
number divisible by 3 occurs. Find P(A ∪ B) and P(A ∩ B). Solution: For the events A = {2, 4,
6} and B = {3, 6}, we have
A ∪ B = {2, 3, 4, 6} and A ∩ B = {6}
By assigning a probability of 1/9 to each odd number and 2/9 to each even number, we have
2 1 2 2 7
𝑃(𝐴 ∪ 𝐵) = + + + =
9 9 9 9 9
2
𝑃(𝐴 ∩ 𝐵) =
9
Rule: If an experiment can result in any one of N different equally likely outcomes, and if exactly
n of these outcomes corresponds to event A, then the probability of event A is
𝑛
𝑃(𝐴) =
𝑁
Example 1.21
A statistics class for engineers consists of 25 industrial, 10 mechanical, 10 electrical, and 8 civil
engineering students. If a person is randomly selected by the instructor to answer a question, find
the probability that the student chosen is
(a) an industrial engineering major
(b) a civil engineering or an electrical engineering major.
Solution:
I – industrial engineering
M – mechanical engineering
E - electrical engineering
C - civil engineering
Solution:
The number of ways of being dealt 2 aces from 4 cards is
If the outcomes of an experiment are not equally likely to occur, the probabilities must be
assigned on the basis of prior knowledge or experimental evidence.
For example, if a coin is not balanced, we could estimate the probabilities of heads and
tails by tossing the coin a large number of times and recording the outcomes.
Solution:
Using the additive rule
𝑃(𝐴 ∪ 𝐵) = 𝑃(𝐴) + 𝑃(𝐵) − 𝑃(𝐴 ∩ 𝐵) = 0.8 + 0.6 − 0.5 = 0.9
Example 1.24
What is the probability of getting a total of 7 or 11 when a pair of fair dice is tossed?
Solution:
1 1 2
𝑃(𝐴 ∪ 𝐵) = 𝑃(𝐴) + 𝑃(𝐵) = + =
6 18 9
Example 1.25
If the probabilities are, respectively, 0.09, 0.15, 0.21, and 0.23 that a person purchasing a
new automobile will choose the color green, white, red, or blue, what is the probability that a
given buyer will purchase a new automobile that comes in one of those colors?
Solution:
Compiled and prepared by: ENGR. K.T. CABANLIG
Let G – Green
W – White
R – Red
B – Blue
𝑃(𝐺 ∪ 𝑊 ∪ 𝑅 ∪ 𝐵) = 𝑃(𝐺) + 𝑃(𝑊) + 𝑃(𝑅) + 𝑃(𝐵)
= 0.09 + 0.15 + 0.21 + 0.23 = 0.68
Example 1.26
If the probabilities that an automobile mechanic will service 3, 4, 5, 6, 7, or 8 or more
cars on any given workday are, respectively, 0.12, 0.19, 0.28, 0.24, 0.10, and 0.07, what is the
probability that he will service at least 5 cars on his next day at work?
Solution:
Let E – event that at least 5 cars are serviced
Solution:
Let M – event that a cable meets specifications
S – event that the cable is too small
L – event that the cable is too large
(a)
𝑃(𝑀) = 0.99
(1 − 0.99)
𝑃(𝑆) = 𝑃(𝐿) = = 0.005
2
using Theorem 9
𝑃(𝑋 ≥ 1990) + 𝑃(𝑋 < 1990) = 1
𝑃(𝑋 ≥ 1990) = 1 − 𝑃(𝑆) = 1 − 0.005 = 0.995
SW:
1. Registrants at a large convention are offered 6 sightseeing tours on each of 3 days. In
how many ways can a person arrange to go on a sightseeing tour planned by this convention?
2. In a fuel economy study, each of 3 race cars is tested using 5 different brands of
gasoline at 7 test sites located in different regions of the country. If 2 drivers are used in the
study, and test runs are made once under each distinct set of conditions, how many test runs are
needed?
3. In how many different ways can a true-false test consisting of 9 questions be
answered?
4. (a) How many distinct permutations can be made from the letters of the word
COLUMNS?
(b) How many of these permutations start with the letter M?
5. (a) How many three-digit numbers can be formed from the digits 0, 1, 2, 3, 4, 5, and 6
if each digit can be used only once?
(b) How many of these are odd numbers?
(c) How many are greater than 330?
6. The probability that an American industry will locate in Shanghai, China, is 0.7, the
probability that it will locate in Beijing, China, is 0.4, and the probability that it will locate in
either Shanghai or Beijing or both is 0.8. What is the probability that the industry will locate (a)
in both cities? (b) in neither city?
7. If 3 books are picked at random from a shelf containing 5 novels, 3 books of poems,
and a dictionary, what is the probability that
(a) the dictionary is selected?
(b) 2 novels and 1 book of poems are selected?
8.A pair of fair dice is tossed. Find the probability of getting
(a) a total of 8;
(b) at most a total of
Example 1.28
Suppose that our sample space S is the population of adults in a small town who have
completed the requirements for a college degree. We shall categorize them according to gender
and employment status.
One of these individuals is to be selected at random for a tour throughout the country to publicize
the advantages of establishing new industries in the town.
Solution:
Let M − a man is chosen
E − the one chosen is employed
P(E ∩ M)
𝑃(𝑀|𝐸) =
𝑃(𝐸)
460 23
P(E ∩ M) = =
900 45
600
𝑃(𝐸) = = 23
900
P(E ∩ M) 23/45 23
𝑃(𝑀|𝐸) = = =
𝑃(𝐸) 2/3 30
OR
460 23
𝑃(𝑀|𝐸) = =
600 30
Example 1.29
The probability that a regularly scheduled flight departs on time is P(D)=0.83; the
probability that it arrives on time is P(A)=0.82; and the probability that it departs and arrives on
time is P(D ∩ A)=0.78. Find the probability that a plane
(a) arrives on time, given that it departed on time,
P(D ∩ A) 0.78
𝑃(𝐴|𝐷) = = = 0.94
𝑃(𝐷) 0.83
Example 1.30
The concept of conditional probability has countless uses in both industrial and
biomedical applications. Consider an industrial process in the textile industry in which strips of a
particular type of cloth are being produced. These strips can be defective in two ways, length and
nature of texture. For the case of the latter, the process of identification is very complicated. It is
known from historical information on the process that 10% of strips fail the length test, 5% fail
the texture test, and only 0.8% fail both tests. If a strip is selected randomly from the process and
a quick measurement identifies it as failing the length test, what is the probability that it is
texture defective?
Solution:
Let L – length defective
T – texture defective
Given that the strip is length defective, the probability that this strip is texture defective is given
by
P(T ∩ L) 0.008
𝑃(𝑇|𝐿) = = = 0.08
𝑃(𝐿) 0.1
Independent Events
Two events A and B are independent if and only if
𝑃(𝐵|𝐴) = 𝑃(𝐵) 𝑜𝑟 𝑃(𝐴|𝐵) = 𝑃(𝐴)
assuming the existences of the conditional probabilities.
Theorem 10: If in an experiment the events A and B can both occur, then
𝑃(𝐴 ∩ 𝐵) = 𝑃(𝐴)𝑃(𝐵|𝐴), provided 𝑃(𝐴) > 0
The probability that both A and B occur is equal to the probability that A occurs
multiplied by the conditional probability that B occurs, given that A occurs.
Example 1.31
Suppose that we have a fuse box containing 20 fuses, of which 5 are defective. If 2 fuses
are selected at random and removed from the box in succession without replacing the first, what
is the probability that both fuses are defective? Solution: We shall let A be the event that the first
fuse is defective and B the event that the second fuse is defective; then we interpret A ∩ B as the
event that A occurs and then B occurs after A has occurred. The probability of first removing a
defective fuse is 1/4; then the probability of removing a second defective fuse from the
remaining 4 is 4/19. Hence,
Solution:
Compiled and prepared by: ENGR. K.T. CABANLIG
1 4 1
𝑃(𝐴 ∩ 𝐵) = { | { | =
4 19 19
Example 1.32
One bag contains 4 white balls and 3 black balls, and a second bag contains 3 white balls
and 5 black balls. One ball is drawn from the first bag and placed unseen in the second bag.
What is the probability that a ball now drawn from the second bag is black?
Solution:
Let 𝐵" – black ball form bag 1
𝐵! – black ball from bag 2
𝑊" − white ball from bag 1
𝑃[(𝐵" ∩ 𝐵! ) 𝑜𝑟 (𝑊" ∩ 𝐵! )]
= 𝑃(𝐵" ∩ 𝐵! ) + 𝑃(𝑊" ∩ 𝐵!
= 𝑃(𝐵" )𝑃(𝐵! |𝐵" ) + 𝑃(𝑊" )𝑃(𝐵! |𝑊" )
3 6 4 5 38
= { |{ | + { |{ | =
7 9 7 9 63
Example 1.33
A small town has one fire engine and one ambulance available for emergencies. The
probability that the fire engine is available when needed is 0.98, and the probability that the
ambulance is available when called is 0.92. In the event of an injury resulting from a burning
Solution:
Let 𝐴 – event that fire engine is available
𝐵 – event that the ambulance is available
𝑃(𝐴 ∩ 𝐵) = 𝑃(𝐴)𝑃(𝐵) = (0.98)(0.92) = 0.9016
Example 1.34
An electrical system consists of four components as illustrated. The system works if
components A and B work and either of the components C or D works. The reliability
(probability of working) of each component is also shown. Find the probability that (a) the entire
system works and (b) the component C does not work, given that the entire system works.
Assume that the four components work independently.
Solution:
Let 𝐴 , 𝐵, 𝐶, 𝐷 – constitute a serial circuit system, whereas the subsystem C and D itself is
parallel circuit system.
Theorem 12: If, in an experiment, the events A1, A2,...,Ak can occur, then
𝑃(𝐴" ∩ 𝐴! ∩···∩ 𝐴$ ) = 𝑃(𝐴" )𝑃(𝐴! |𝐴" )𝑃(𝐴# |𝐴" ∩ 𝐴! ) ··· 𝑃(𝐴$ |𝐴" ∩ 𝐴! ∩···∩ 𝐴$ − 1)
Example 1.35
Three cards are drawn in succession, without replacement, from an ordinary deck of
playing cards. Find the probability that the event A1 ∩ A2 ∩ A3 occurs, where A1 is the event
that the first card is a red ace, A2 is the event that the second card is a 10 or a jack, and A3 is the
event that the third card is greater than 3 but less than 7.
Solution:
Let 𝐴" – the first card is a red ace
𝐴! – the second card is a 10 or a jack
𝐴# − the second card is a 10 or a jack
2 8 12
𝑃(𝐴" ) = , 𝑃(𝐴! |𝐴" ) = , 𝑃(𝐴# |𝐴" ∩ 𝐴! ) = ′
52 51 50
Total Probability
Suppose that our sample space S is the population of adults in a small town who have
completed the requirements for a college degree. We shall categorize them according to gender
and employment status.
One of these individuals is to be selected at random for a tour throughout the country to publicize
the advantages of establishing new industries in the town.
Suppose that we are now given the additional information that 36 of those employed and
12 of those unemployed are members of the Rotary Club. We wish to find the probability of the
event A that the individual selected is a member of the Rotary Club. Referring to Figure below,
we can write A as the union of the two mutually exclusive events 𝐸 ∩ 𝐴 and 𝐸′ ∩ 𝐴. Hence,
Theorem 13: If the events B1, B2,...,Bk constitute a partition of the sample space S such that
P(Bi) = 0 for i = 1, 2,...,k, then for any event A of S,
Proof : Consider the Venn diagram. The event A is seen to be the union of the mutually
exclusive events
𝐵" ∩ 𝐴, 𝐵! ∩ 𝐴, . . . , 𝐵$ ∩ 𝐴
that is,
𝐴 = (𝐵1 ∩ 𝐴) ∪ (𝐵2 ∩ 𝐴) ∪···∪ (𝐵𝑘 ∩ 𝐴)
Using Corollary 1.2 of Theorem 1.7 and Theorem 1.10, we have
𝑃(𝐴) = 𝑃[(𝐵" ∩ 𝐴) ∪ (𝐵! ∩ 𝐴) ∪···∪ (𝐵$ ∩ 𝐴)]
= 𝑃(𝐵" ∩ 𝐴) + 𝑃(𝐵! ∩ 𝐴) + ··· + 𝑃(𝐵$ ∩ 𝐴)
$
€ 𝑃(𝐵" ∩ 𝐴)
56"
$
€ 𝑃(𝐵" )𝑃(𝐴|𝐵5 )
56"
Example 1.35
In a certain assembly plant, three machines, B1, B2, and B3, make 30%, 45%, and 25%,
respectively, of the products. It is known from past experience that 2%, 3%, and 2% of the
products made by each machine, respectively, are defective. Now, suppose that a finished
product is randomly selected. What is the probability that it is defective?
Solution:
Let 𝐴" – the product is defective
𝐵" – the product is made by machine B1
𝐵! – the product is made by machine B2
𝐵! – the product is made by machine B3
Applying the rule of elimination, we can write
𝑃(𝐴) = 𝑃(𝐵1)𝑃(𝐴|𝐵1) + 𝑃(𝐵2)𝑃(𝐴|𝐵2) + 𝑃(𝐵3)𝑃(𝐴|𝐵3)
𝑃(𝐵1)𝑃(𝐴|𝐵1) = (0.3)(0.02) = 0.006,
𝑃(𝐵2)𝑃(𝐴|𝐵2) = (0.45)(0.03) = 0.0135,
Theorem 2.14: (Bayes’ Rule) If the events 𝑩𝟏 , 𝑩𝟐 ,...,Bk constitute a partition of the sample
space S such that 𝑷(𝑩𝒊) ≠ 0 for i = 1, 2,...,k, then for any event A in S such that P(A) = 0
𝑃(𝐵𝑟 ∩ 𝐴) 𝑃(𝐵& )𝑃(𝐴|𝐵& )
𝑃(𝐵& |𝐴) = $ = $ 𝑓𝑜𝑟 1,2, … , 𝑘
∑56" 𝑃(𝐵𝑖 ∩ 𝐴) ∑56" (𝐵5 )𝑃(𝐴|𝐵5 )
Proof : By the definition of conditional probability,
𝑃(𝐵& ∩ 𝐴)
𝑃(𝐵& |𝐴) =
𝑃(𝐴)
and then using Theorem 1.13 in the denominator, we have
𝑃(𝐵& ∩ 𝐴) 𝑃(𝐵& )𝑃(𝐴|𝐵& )
𝑃(𝐵& |𝐴) = $ = $ ) ,
∑56" 𝑃(𝐵𝑖 ∩ 𝐴) ∑56" 𝑃(𝐵5 )𝑃(𝐴|𝐵5 )
which completes the proof.
Instead of asking for 𝑃(𝐴) in Example 1.35, by the rule of elimination, suppose that we
now consider the problem of finding the conditional probability 𝑃(𝐵5 |𝐴). In other words,
suppose that a product was randomly selected and it is defective. What is the probability that this
product was made by machine Bi? Questions of this type can be answered by using the following
theorem, called Bayes’ rule:
If a product was chosen randomly and found to be defective, what is the probability that it was
made by machine B3?
Solution:
Using Bayes’ rule to write
𝑃(𝐵# )𝑃(𝐴|𝐵# )
𝑃(𝐵# |𝐴) =
𝑃(𝐵" )𝑃(𝐴|𝐵" ) + 𝑃(𝐵! )𝑃(𝐴|𝐵! ) + 𝑃(𝐵# )𝑃(𝐴|𝐵# )
0.005 0.005 10
𝑃(𝐵# |𝐴) = = =
0.006 + 0.0135 + 0.005 0.0245 49
In view of the fact that a defective product was selected, this result suggests that it
probably was not made by machine B3.
Solution:
𝑃(𝑃" ) = 0.30, 𝑃(𝑃! ) = 0.20, 𝑎𝑛𝑑 𝑃(𝑃# ) = 0.50,
we must find 𝑃(𝑃7 |𝐷) for j = 1, 2, 3. Bayes’ rule (Theorem 1.14) shows
𝑃(𝑃1)𝑃(𝐷|𝑃1)
𝑃(𝑃" |𝐷) =
𝑃(𝑃1)𝑃(𝐷|𝑃1) + 𝑃(𝑃2)𝑃(𝐷|𝑃2) + 𝑃(𝑃3)𝑃(𝐷|𝑃3)
(0.30)(0.01)
=
(0.3)(0.01) + (0.20)(0.03) + (0.50)(0.02)
0.003
= = 0.158
. 019
(0.03)(0.20)
𝑃(𝑃! |𝐷) = = 0.316
0.019
(0.02)(0.50)
𝑃(𝑃# |𝐷) = = 0.526
0.019
The conditional probability of a defect given plan 3 is the largest of the three; thus a defective for
a random product is most likely the result of the use of plan 3.
Sample mean
The mean is simply a numerical average.
Suppose that the observations in a sample are 𝑥" , 𝑥! , . . . , 𝑥' . The sample mean, denoted by 𝑥†, is
'
𝑥5 𝑥" + 𝑥! + ⋯ + 𝑥'
𝑥̅ = € =
𝑛 𝑛
56"
Sample median
The purpose of the sample median is to reflect the central tendency of the sample in such
a way that it is uninfluenced by extreme values or outliers.
Given that the observations in a sample are 𝑥" , 𝑥! , . . . , 𝑥' , arranged in increasing order of
magnitude, the sample median is
𝑥'8"
𝑥† = 𝑖𝑓 𝑛 𝑖𝑠 𝑜𝑑𝑑
2
1 𝑥' 𝑥'
𝑥† = ‰ + Š 𝑖𝑓 𝑛 𝑖𝑠 𝑒𝑣𝑒𝑛
2 2 2+1
Trimmed means
A trimmed mean is computed by “trimming away” a certain percent of both the largest
and the smallest set of values.
Measures of Variability
Process and product variability is a fact of life in engineering and scientific systems: The
control or reduction of process variability is often a source of major difficulty. More and more
process engineers and managers are learning that product quality and, as a result, profits derived
from manufactured products are very much a function of process variability.
The sample standard deviation, denoted by s, is the positive square root of 𝑠 ! , that is,
𝑠‹𝑠 !
Scatter Plot
Example:
A textile manufacturer who designs an experiment where cloth specimen that contain various
percentages of cotton are produced.
Tensile Strength
The analysis of the data should revolve around a different type of model, one that postulates a
type of structure relating the population mean tensile strength to the cotton concentration. A
model may be written
𝜇<,> = 𝛽. + 𝛽"? + 𝛽! 𝐶 ! ,
where 𝜇<,> is the population mean tensile strength, which varies with the amount of cotton in the
product C. The implication of this model is that for a fixed cotton level, there is a population of
tensile strength measurements and the population mean is μt,c. This type of model, called a
regression model.
Stem-and-Leaf Plot
Statistical data, generated in large masses, can be very useful for studying the behavior of
the distribution if presented in a combined tabular and graphic display called a stem-and-leaf
plot.
Example:
Consider the data on the table, which specifies the “life” of 40 similar car batteries recorded to
the nearest tenth of a year. The batteries are guaranteed to last 3 years.
Histogram
Data are grouped into different classes or intervals which can be constructed by counting
the leaves belonging to each stem and noting that each stem defines a class interval.
Graphical tools such as what we see in the figures above aids in the characterization of
the nature of the population, and one property of population is distribution or probability
distribution.
A distribution is said to be symmetric if it can be folded along a vertical axis so that the
two sides coincide. A distribution that lacks symmetry with respect to a vertical axis is said to be
skewed. The distribution illustrated in Figure 1.8(a) is said to be skewed to the right since it has
a long right tail and a much shorter left tail. In Figure 1.8(b) we see that the distribution is
symmetric, while in Figure 1.8(c) it is skewed to the left.
Examples: 1
𝑥̅ = (3.4 + 2.5 + 4.8 + 2.9 + 3.6 + 2.8
1. The following measurements were 15
recorded for the drying time, in hours, of a + 3.3 + 5.6 + 3.7 + 2.8
certain brand of latex paint. + 4.4 + 4.0 + 5.2 + 3.0
3.4 2.5 4.8 2.9 3.6 2.8 + 4.8)
3.3 5.6 3.7 2.8 4.4 𝑥̅ = 3.787
4.0 5.2 3.0 4.8 (c) Calculate the sample median.
Assume that the measurements are a simple 2.5 2.8 2.8 2.9 3.0 3.3
random sample. 3.4 3.6 3.7 4.0 4.4
(a) What is the sample size for the above 4.8 4.8 5.2 5.6
sample? (d) Plot the data by way of a dot plot.
(b) Calculate the sample mean for these
data.
(c) Calculate the sample median.
(d) Plot the data by way of a dot plot. (e) Compute the 20% trimmed mean for the
(e) Compute the 20% trimmed mean for the above data set.
above data set. 2.9 3.0 3.3 3.4 3.6 3.7
(f) Is the sample mean for these data more or 4.0 4.4 4.8
less descriptive as a center of location than 1
𝑥̅ = (2.9 + 3.0 + 3.3 + 3.4 + 3.6 + 3.7
the trimmed mean? 15
+ 4.0 + 4.4 + 4.8)
Solution: 𝑥̅ = 3.678
(a) What is the sample size for the above (f) Is the sample mean for these data more or
sample? less descriptive as a center of location than
= 15 the trimmed mean? They are about the
(b) Calculate the sample mean for these same.
data. (g) Compute the sample variance and
sample standard deviation.
where H and NH in the table stand for 9. For married couples living in a certain
Hypertension and Non-hypertension, suburb, the probability that the husband will
respectively. If one of these individuals is vote on a bond referendum is 0.21, the
selected at random, find the probability that probability that the wife will vote on the
the person is: referendum is 0.28, and the probability that
Let: A - a person is experiencing both the husband and the wife will vote is
hypertension 0.15. What is the probability that
B – a person is a heavy smoker
C – a person is non-smoker SOLUTION:
LET: H -the husband will vote on the bond
(a) experiencing hypertension, given that the referendum,
person is a heavy smoker; W- the wife will vote on the bond
referendum
30
𝑃(𝐴|𝐵) = 𝑃(𝐻) = 0.21, 𝑃(𝑊) = 0.28,
49
𝑃(𝐻 ∩ 𝑊) = 0.15
(b) a nonsmoker, given that the person is (a) at least one member of a married couple
experiencing no hypertension. will vote?
48 16 𝑃(𝐻 ∪ 𝑊) = 𝑃(𝐻) + 𝑃(𝑊) − 𝑃(𝐻 ∩ 𝑊)
𝑃(𝐶|𝐴′) = = 0.21 + 0.28 − 0.15 = 0.34
93 31
8. In the senior year of a high school (b) a wife will vote, given that her husband
graduating class of 100 students, 42 studied will vote?
mathematics, 68 studied psychology, 54 𝑃(𝐻 ∩ 𝑊) 0.15 5
𝑃(𝑊|𝐻) = = =
studied history, 22 studied both mathematics 𝑃(𝐻) 0.21 7
and history, 25 studied both mathematics
and psychology, 7 studied history but (c) a husband will vote, given that his wife
neither mathematics nor psychology, 10 will not vote?
studied all three subjects, and 8 did not take 𝑃(𝐻 ∩ 𝑊′)
any of the three. Randomly select a student 𝑃(𝐻|𝑊′) =
𝑃(𝑊′)
from the class and find the probabilities of 𝑃(𝑀 ∩ 𝑊′) = 𝑃(𝑀) − 𝑃(𝑀 ∩ 𝑊)
the following events. = 0.21 − 0.15 = 0.06
𝑃(𝑊 D ) = 1 − 𝑃(𝑊) = 1 − 0.28 = 0.72
Solution:
(a) A person enrolled in psychology takes all 𝑃(𝐻 ∩ 𝑊′)
three subjects. 𝑃(𝐻|𝑊′) =
10 𝑃(𝑊′)
𝑃(𝑀 ∩ 𝑃 ∩ 𝐻) = 0.06
68 = 0.083
0.72
Example 2.
If a sample space contains a finite A shipment of 20 similar laptop computers
number of possibilities or an unending to a retail outlet contains 3 that are
sequence with as many elements as there are defective. If a school makes a random
whole numbers, it is called a discrete purchase of 2 of these computers, find the
sample space. probability distribution for the number of
A discrete random variable is one defectives.
which may take on only a countable number
Compiled and prepared by: ENGR. K.T. CABANLIG
Solution: Thus, the probability distribution of X is
Let X be a random variable whose values x
are the possible numbers of defective
computers purchased by the school. Then x
can only take the numbers 0, 1, and 2.
The cumulative distribution function 𝐹(𝑥)
Following a hypergeometric distribution (a of a discrete random variable X with
discrete probability distribution that probability distribution 𝑓(𝑥) is
describes the probability of "𝒌" successes 𝐹(𝑥) = 𝑃(𝑋 ≤ 𝑥) = € 𝑓(𝑡)
[random draws for which the object drawn
<I;
has a specific feature] in "𝒏" draws, without 𝑓𝑜𝑟 − ∞ < 𝑥 < ∞.
replacement, from a finite population of size
"𝑵" that contain exactly "𝑲" the objects Example 2.
with that feature, wherein each draw is If a car agency sells 50% of its inventory of
either a success or a failure). In contrast, a certain foreign car equipped with side
the binomial distribution describes the airbags, find a formula for the probability
probability of "𝒌" successes in "𝒏" draws distribution of the number of cars with side
with replacement. airbags among the next 4 cars sold by the
𝑋=𝑥 agency.
Solution:
The probability of selling an automobile
with side airbags = 0.5
where:
Points in the sample space = 2 × 2 × 4 =
16
Probability Histogram
Example 3:
Suppose that the error in the reaction
temperature, in °𝐶, for a controlled
laboratory experiment is a continuous 2 1
random variable X having the probability 𝑃(0 < 𝑋 ≤ 1) = 𝐹(1) − 𝐹(0) = −
9 9
density function 1
=
9
Example:
(a) Verify that f(x) is a density function. 5. 1 Classify the following random variables
J ! !
𝑥 𝑑𝑥 𝑥 # ! 8 1 as discrete or continuous:
ž 𝑓(𝑥)𝑑𝑥 = ž = |+" = + X: the number of automobile accidents per
+J +" 3 9 9 9 year in Virginia. − 𝐷𝑖𝑠𝑐𝑟𝑒𝑡𝑒
=1 Y : the length of time to play 18 holes of
golf. −𝐶𝑜𝑛𝑡𝑖𝑛𝑢𝑜𝑢𝑠
(b) Find P(0 < X ≤ 1). M: the amount of milk produced yearly by a
" particular cow. −𝐶𝑜𝑛𝑡𝑖𝑛𝑢𝑜𝑢𝑠
𝑥 ! 𝑑𝑥 𝑥 # " 1 N: the number of eggs laid each month by a
𝑃(0 < 𝑋 ≤ 1) = ž = |. =
. 3 9 9 hen. −𝐷𝑖𝑠𝑐𝑟𝑒𝑡𝑒
P: the number of building permits issued
each month in a certain city. −𝐷𝑖𝑠𝑐𝑟𝑒𝑡𝑒
Q: the weight of grain produced per acre.
−𝐶𝑜𝑛𝑡𝑖𝑛𝑢𝑜𝑢𝑠
Compiled and prepared by: ENGR. K.T. CABANLIG
5. An overseas shipment of 5 foreign 𝑠 = {𝐻𝐻𝐻, 𝑇𝐻𝐻𝐻
automobiles contains 2 that have slight paint ,HTHHH,TTHHH,TTTHHH,HTT,HHH,TH
blemishes. If an agency receives 3 of these THHH,HHTHHH, …}
automobiles at random, list the elements of − 𝐶𝑜𝑛𝑡𝑖𝑛𝑢𝑜𝑢𝑠
the sample space S, using the letters B and N
for blemished and non-blemished, 8. Determine the value c so that each of the
respectively; then to each sample point following functions can serve as a
assign a value x of the random variable X probability distribution of the discrete
representing the number of automobiles with random variable X:
paint blemishes purchased by the agency. (a) 𝑓(𝑥) = 𝑐(𝑥 ! + 4), 𝑓𝑜𝑟 𝑥 = 0, 1, 2, 3;
Solution: Solution:
∑; 𝑓(𝑥) = 1
#
€ 𝑐(𝑥 ! + 4) = 1
;6.
= 𝑐 [(0! + 4) + (1! + 4) + (2! + 4)
+ (3 + 4)] = 1
= 𝑐[4 + 5 + 8 + 13] = 1
= 30𝑐 = 1
1
𝑐=
6. Let W be a random variable giving the 30
number of heads minus the number of tails (b)
in three tosses of a coin. List the elements of
the sample space S for the three tosses of the 𝑓𝑜𝑟 𝑥 = 0, 1, 2
coin and to each sample point assign a value !
w of W. € 𝑓(𝑥) = 1
;6.
Solution:
2! 3!
𝑐 { |{ |
0! − (2 − 0!)! 3! (3 − 3)!
2! 3!
+{ |{ |
1! − (2 − 1!)! 2! (3 − 2)!
2! 3!
+{ |{ |¡ = 1
2! − (2 − 2!)! 1! (3 − 1)!
1
𝑐=
10
7. A coin is flipped until 3 heads in 10. The shelf life, in days, for bottles of a
succession occur. List only those elements certain prescribed medicine is a random
of the sample space that require 6 or less variable having the density function
tosses. Is this a discrete sample space?
Explain.
Solution:
Compiled and prepared by: ENGR. K.T. CABANLIG
Find the probability that a bottle of this (a) less than 120 hours;
medicine will have a shell life of 120
(a) at least 200 days 𝑃 {𝑋 < | = 𝑃(𝑋 < 1.2)
100
".!
Solution:
Let X = values 0, 1, and 2
Find the probability that over a period of one
year, a family runs their vacuum cleaner
If 𝑋 = 1
5𝐶! × 2𝐶"
𝑃(𝑋 = 1) =
7𝐶#
5! 2!
{ ( |×{ ( | Probability Histogram
2! 5 − 2)! 1! 2 − 1)!
=
7!
{ (7 | 13. Find the cumulative distribution function
3! − 3)!
10 × 2 of the random variable X representing the
= number of defectives in Exercise 12. Then
35
4 using F(x), find
= = 0.571 𝐹(𝑥) = 𝑃(𝑋 = 𝑥) =
7
0, 𝑥 < 0
If 𝑋 = 2 𝑓(0), 0 ≤ 𝑥 < 1
5𝐶" × 2𝐶! 𝑓(0) + 𝑓(1), 1 ≤ 𝑥 < 2
𝑃(𝑋 = 3) =
7𝐶# 𝑓(0) + 𝑓(1) + 𝑓(2), 2 ≤ 𝑥
5! 2!
{ ( ) |×{ ( |
1! 5 − 1 ! 2! 2 − 2)! if 𝐹(0 < 𝑥) = 0
=
7!
{ (7 | if 𝐹(0)
3! − 3)!
5×1 1 5!
=
35
= = 0.143
7 5𝐶# {3! (5 − 3)!|
= 𝑓(0) = =
7𝐶# { 7!
|
3! (7 − 3)!
2
=
7
if 𝐹(1)
= 𝑓(0) + 𝑓(1)
5𝐶# 5𝐶! × 2𝐶"
= +
7𝐶# 7𝐶#
5! 2!
2 {2! (5 − 2)!| × {1! (2 − 1)! |
= +
7 7!
{ (7 |
3! − 3)!
2 4 6
= + =
7 7 7
if 𝐹(2)
𝑓(𝑥, 𝑦)
𝑓(𝑥|𝑦) = , 𝑝𝑟𝑜𝑣𝑖𝑑𝑒𝑑 ℎ(𝑦) > 0.
ℎ(𝑦)
probability distribution f(x, y) and marginal = ž ž ž 𝑒 +;! +;" +;# 𝑑𝑥" 𝑑𝑥! 𝑑𝑥#
! " .
distributions g(x) and h(y), respectively. The
random variables X and Y are said to be = (1 − 𝑒 +! )(𝑒 +" − 𝑒 +# )𝑒 +! = 0.0372.
statistically independent if and only if
𝑓(𝑥, 𝑦) = 𝑔(𝑥)ℎ(𝑦) Example 20
for all (x, y) within their range. A coin is biased such that a head is three
times as likely to occur as a tail. Find the
Let 𝑋" , 𝑋! , . . . , 𝑋' be n random expected number of tails when this coin is
variables, discrete or continuous, with joint tossed twice.
probability distribution 𝑓(𝑥" , 𝑥! , . . . , 𝑥' ) and
marginal distribution
3 3 9
𝑃(𝑋 = 0) = 𝑃(𝐻𝐻) = × =
4 4 16
Find the expected value of X.
𝑃(𝑋 = 1) = 𝑃(𝐻𝑇) + 𝑃(𝑇𝐻)
3 1 3 1 3 Solution:
= × + × = 4 " 𝑥 ln 4
4 4 4 4 8 𝐸(𝑋) = ž 𝑑𝑥 =
𝜋 . 1+𝑥 ! 𝜋
1 1 1
𝑃(𝑋 = 2) = 𝑃(𝑇𝑇) = × =
4 4 16 Example 23
If a dealer’s profit, in units of $5000, on a
The probability Distribution: new automobile can be looked upon as a
random variable X having the density
function
9 3
𝜇 = 𝐸(𝑋) = {0 × | + {1 × |
16 8 find the average profit per automobile.
1 1
+ {2 × | =
16 2 Solution:
"
1
Example 21 𝑓(𝑥) = ž 2𝑥(1 − 𝑥) 𝑑𝑥 =
. 3
A private pilot wishes to insure his airplane
1
for $200,000. The insurance company 𝐸(𝑋) = × ($5,000) = $1,667.67
estimates that a total loss will occur with 3
probability 0.002, a 50% loss with
Example 24
probability 0.01, and a 25% loss with
Suppose that you are inspecting a lot of
probability 0.1. Ignoring all other partial
1000 light bulbs, among which 20 are
losses, what premium should the insurance
defectives. You choose two light bulbs
company charge each year to realize an
randomly from the lot without replacement.
average profit of $500?
Let
Solution:
𝐸(𝑋) = (200,000) × (0.002) Find the probability that at least one light
+ (100,000) × (0.01) bulb chosen is defective
+ (50,000) × (0.1)
+ (0) × (0.888) Solution:
= $6,400 𝑃(𝑋" + 𝑋! ≥ 1)
𝜇P (𝑋) = € 𝑔(𝑥)𝑓(𝑥) 2 3
= (0 − 1) × + (0 − 2) ×
; 70 70
1 1 3
= {25 × | + {169 × | + (1 − 0) ×
6 2 70
1 9
+ {361 × | = 209 + (1 − 2) ×
3 70
9
Example 26 + (2 − 0) ×
70
From a sack of fruit containing 3 oranges, 2 18
apples, and 3 bananas, a random sample of 4 + (2 − 1) ×
70
pieces of fruit is selected. If X is the number 3
of oranges and Y is the number of apples in + (3 − 0) ×
70
the sample, 2
+ (3 − 1) ×
the joint probability distribution of X and Y 70
+ (3 − 2) × 0
−2 − 6 + 3 − 9 + 18 + 18 + 9 + 4
=
70
𝑥 = 0,1,2,3 𝑦 = 0,1,2
1
=
2
Example 28
The proportion of the budget for a certain
type of industrial company that is allotted to
environmental and pollution control is
coming under scrutiny. A data collection
project determines that the distribution of
these proportions is given by
(b) Variability of continuous observations about the
Example 2: mean.
Find the mean and variance of the binomial
random variable of Example 1, and then use
Chebyshev’s theorem to interpret the
interval μ ± 2σ.
Chebyshev’s Theorem
If a random variable has a small
variance or standard deviation, we would
expect most of the values to be grouped
around the mean. Therefore, the probability
that the random variable assumes a value
within a certain interval about the mean is Variability of discrete observations about the mean.
Interpretation: Example 3:
Chebyshev’s theorem states that the number The complexity of arrivals and
of recoveries among 15 patients who departures of planes at an airport is such that
contracted the disease has a probability of at computer simulation is often used to model
least 3/4 of falling between 2.206 and 9.794 the “ideal” conditions. For a certain airport
or, because the data are discrete, between 2 with three runways, it is known that in the
and 10 inclusive. ideal set ting the following are the
probabilities that the individual runways are
Multinomial Experiments and the accessed by a randomly arriving commercial
Multinomial Distribution jet:
The binomial experiment becomes a Runway 1: 𝑝" = 2/9,
multinomial experiment if we let each trial Runway 2: 𝑝! = 1/6,
have more than two possible outcomes. Runway 3: 𝑝# = 11/18.
In general, if a given trial can result What is the probability that 6 randomly
in any one of k possible outcomes arriving airplanes are distributed in the
𝐸" , 𝐸! , . . . , 𝐸$ with probabilities following fashion?
𝑝" , 𝑝! , . . . , 𝑝$ , then the multinomial Runway 1: 2 airplanes,
distribution will give the probability that Runway 2: 1 airplane,
𝐸" occurs 𝑥" times, 𝐸! occurs 𝑥! times, ... , Runway 3: 3 airplanes
Example 4:
Lots of 40 components each are
6! 2! 1 11# deemed unacceptable if they contain 3 or
= × ! × × # = 0.1127
2! 1! 3! 9 6 18 more defectives. The procedure for sampling
a lot is to select 5 components at random
II. Hypergeometric Distribution and to reject the lot if a defective is found.
Hypergeometric distribution does not What is the probability that exactly 1
require independence and is based on defective is found in the sample if there are
sampling done without replacement. 3 defectives in the entire lot?
Solution: Using the hypergeometric
The total number of samples of size distribution with 𝑛 = 5, 𝑁 = 40, 𝑘 =
n chosen from N items is . These 3, 𝑎𝑛𝑑 𝑥 = 1, we find the probability of
samples are assumed to be equally likely. obtaining 1 defective to be
There are
Solution:
ways of selecting x successes from the ℎ(1; 40, 5, 3)
𝑘 that are available, and for each of these
ways we can choose the 𝑛 − 𝑥 failures in
Example 5:
Find the mean and variance of the random
𝑚𝑎𝑥{0, 𝑛 − (𝑁 − 𝑘)} ≤ 𝑥 ≤ 𝑚𝑖𝑛{𝑛, 𝑘}
variable of Example 4 and then use
Chebyshev’s theorem to interpret the
The range of 𝑥 can be determined by
interval μ ± 2σ.
the three binomial coefficients in the
definition, where 𝑥 and 𝑛 − 𝑥 are no more
Solution:
than 𝑘 and 𝑁 − 𝑘, respectively, and both of
𝑁 = 40, 𝑛 = 5, 𝑎𝑛𝑑 𝑘 = 3,
them cannot be less than 0. Usually, when (5)(3) 3
both k (the number of successes) and 𝑁 − 𝜇 = = = 0.375
𝑘 (the number of failures) are larger than 40 8
Solution:
a. In Figure (a)
𝐴 = 1 − 𝑎𝑟𝑒𝑎 𝑡𝑜 𝑡ℎ𝑒 𝑙𝑒𝑓𝑡
𝐴 = 1 − 0.3015
= 0.0.6985 (𝑎𝑟𝑒𝑎 𝑡𝑜 𝑡ℎ𝑒 𝑙𝑒𝑓𝑡)
Solution: From Table A.3
a. 𝑘 = 0.52
𝑧 = 1.84
𝐴 = 1 − 𝑎𝑟𝑒𝑎 𝑡𝑜 𝑡ℎ𝑒 𝑙𝑒𝑓𝑡 𝑜𝑓 𝑍 b. In figure b
𝐴 = 1 − 0.9671 = 0.0329 −0.18 = 0.4286
b. 𝐴 = 0.4286 − 0.4197 = 0.0089
𝑧 = −1.97 𝑎𝑛𝑑 𝑧 = 0.86 𝑘 = −2.37
𝐴 = 0.8051 − 0.0244 = 0.7807
Example 3:
Example 2: Given a random variable X having a normal
(a) 𝑃(𝑍 > 𝑘) = 0.3015 and distribution with μ = 50 and σ = 10, find the
(b) 𝑃(𝑘 < 𝑍 < −0.18) = 0.4197. probability that X assumes a value between
45 and 62.
Solution:
45 − 50
𝑧" = = −0.5
10
62 − 50
𝑧! = = 1.2
10
𝑃(45 < 𝑋 < 62) = 𝑃(−0.5 < 𝑍 < 1.2)
= 𝑃(𝑍 < 1.2) − 𝑃(𝑍 < −0.5)
= 0.8849 − 0.3085 = 0.5764
(𝜇 − 2𝜎) − 𝜇
𝑧" = = −2
𝜎
(𝜇 + 2𝜎) − 𝜇
𝑧! = = 2 Solution:
𝜎
Hence, 778 − 800
𝑃(𝜇 − 2𝜎 < 𝑋 < 𝜇 + 2𝜎) = 𝑃(𝑍 𝑧" = = −0.55
40
< 2) − 𝑃(𝑍 < −2)
= 0.9772 − 0.0228 = 0.9544 834 − 800
𝑧! = = 0.85
which is a much stronger statement than that 40
given by Chebyshev’s theorem. 𝑃(778 < 𝑋 < 834)
= 𝑃(−0.55 < 𝑍 < 0.85)
Example 5: = 𝑃(𝑍 < 0.85) − 𝑃(𝑍 < −0.55)
A certain type of storage battery lasts, on
average, 3.0 years with a standard deviation Example 8:
of 0.5 year. Assuming that battery life is Gauges are used to reject all components for
normally distributed, find the probability which a certain dimension is not within the
that a given battery will last less than 2.3 specification 1.50 ± d. It is known that this
years. measurement is normally distributed with
mean 1.50 and standard deviation 0.2.
Assuming 𝑥 = 4
using binomial distribution
𝑃(𝑋 = 4) = 𝑏(4; 15, 0.4) = 0.1268
The shaded region under the normal curve between the two ordinates 𝑥" =
3.5 𝑎𝑛𝑑 𝑥! = 4.5
3.5 − 6
𝑧" = = −1.32
1.897
4.5 − 6
𝑧2 = = −0.79
1.897
𝑃(𝑋 = 4)
= 𝑏(4; 15, 0.4)
≈ 𝑃(−1.32 < 𝑍 < −0.79)
= 𝑃(𝑍 < −0.79) − 𝑃(𝑍 < −1.32)
= 0.2148 − 0.0934 = 0.1214.
Note: The normal approximation is most useful in calculating binomial sums for large values of
n.
EXAMPLES:
12. A soft-drink machine is regulated so that
it discharges an average of 200 milliliters
per cup. If the amount of drink is normally
distributed with a standard deviation equal
𝑃(𝑋 < 30) ≈ 𝑃(𝑍 < −2.14) to 15 milliliters,
= 0.0162. (a) what fraction of the cups will contain
more than 224 milliliters?
Example 11:
A multiple-choice quiz has 200 questions, Solution:
each with 4 possible answers of which only 𝑥 − 𝜇 224 − 200
1 is correct. What is the probability that 𝑧= = 1.6
𝜎 15
sheer guesswork yields from 25 to 30
correct answers for the 80 of the 200 𝑃(𝑥 > 224) = 𝑃(𝑍 > 1.6)
problems about which the student has no = 1 − 𝑃(𝑍 < 1.6)
knowledge? 1 − 0.9452 = 0.0548
𝑃(−0.6 ≤ 𝑋 ≤ 0.6) (b) If the office opens at 9:00 A.M. and the
= 𝑃(−0.6 ≤ 𝑍 ≤ 0.6) lawyer leaves his house at 8:45 A.M. daily,
= 𝑃(𝑍 < 0.6) − 𝑃(𝑍 > −0.6) what percentage of the time is he late for
= 0.7257 − 0.2743 = 0.4514 work?
(c) how many cups will probably overflow if
230- milliliter cups are used for the next Solution:
1000 drinks? 𝑃(𝑋 > 15) = 1 − 𝑃(𝑋 ≤ 15)
15 − 24
Solution: = 1 − 𝑃 {𝑋 < |
3.8
𝑥 − 𝜇 230 − 200 = 1 − 𝑃(𝑍 ≤ −2.37)
𝑧= = =2
𝜎 15 = 1 − 0.0089 = 0.9911 = 99.11%
𝑃(𝑋 > 230) = 𝑃(𝑍 > 2) (c) If he leaves the house at 8:35 A.M. and
= 1 − 𝑃(𝑍 < 2) = 1 − 0.9772 = 0.0228 coffee is served at the office from 8:50 A.M.
until 9:00 A.M., what is the probability that
𝐸(𝑋) = 𝑛 × 𝑝 = 1000 × 0.0228 he misses coffee?
= 22.8 = 23
Solution:
(d) below what value do we get the smallest 𝑃(𝑋 > 25) = 1 − 𝑃(𝑋 ≤ 25)
25% of the drinks? 25 − 24
= 1 − 𝑃 {𝑋 < |
3.8
Solution: = 1 − 𝑃(𝑍 ≤ 0.23)
𝑃(𝑋, 𝑥) = 0.25 1 − 0.6026 = 0.3974
𝑥−𝜇
𝑃 ‰𝑍 < Š = .25
𝜎 (d) Find the length of time above which we
𝑃(𝑍 < −0.68) = 0.25 find the slowest 15% of the trips.
𝑥−𝜇
= −0.68
𝜎 Solution:
𝑥 = (−0.68 × 𝜎) + 𝜇
𝑃(𝑋 > 𝑥) = 0.15
𝑥 = (−0.67 × 15) + 200
1 − 𝑃(𝑋 > 𝑥) = 0.15
𝑥 = 189.95
𝑃(𝑋 < 𝑥) = 1 − 0.15 = 0.85
13. 5 A lawyer commutes daily from his 𝑥−𝜇
suburban home to his midtown office. The 𝑃 ‰𝑍 ≤ Š = 0.85
𝜎
average time for a one-way trip is 24 𝑃(𝑍 ≤ 1.04) = 0.85
minutes, with a standard deviation of 3.8 𝑥−𝜇
minutes. Assume the distribution of trip = 1.04
𝜎
times to be normally distributed. 𝑥 = (1.04 × 𝜎) + 𝜇
(a) What is the probability that a trip will 𝑥 = (1.04 × 3.8) + 24
take at least 1/2 hour? 𝑥 = 27.952
The following is the definition of the gamma The mean and variance of the exponential
distribution distribution are
The continuous random variable X has a 𝜇 = 𝛽 𝑎𝑛𝑑 𝜎 ! = 𝛽 !
gamma distribution, with parameters α and
β, if its density function is given by Relationship to the Poisson Process
The Poisson process allows for the
GAMMA DISTRIBUTION use of the discrete distribution. The Poisson
The continuous random variable X distribution is used to compute the
has a gamma distribution, with parameters α probability of specific numbers of “events”
and β, if its density function is given by during a particular period of time or span of
1 ;
+ space
𝑓(𝑥; 𝛼, 𝛽) = [ 𝑥 [+" 𝑒 \
𝛽 𝛤(𝛼)
if x> 0 𝑃(𝑋 > 𝑥) = 𝑒 +V;
Example 6:
In a certain city, the daily
consumption of electric power, in millions
of kilowatt-hours, is a random variable X
having a gamma distribution with mean μ =
6 and variance σ2 = 12.
(a) Find the values of α and β.
𝛼𝛽 ! = 12
Compiled and prepared by: ENGR. K.T. CABANLIG
Compiled and prepared by: ENGR. K.T. CABANLIG
Compiled and prepared by: ENGR. K.T. CABANLIG
FUNDAMENTAL SAMPLING An Observation
DISTRIBUTIONS AND DATA • Each observation in a population is
DESCRIPTION a value of a random variable X having some
Random Sampling probability distribution f(x).
The outcome of a statistical • For example, if one is inspecting
experiment may be recorded either as a items coming off an assembly line for
numerical value or as a descriptive detect, then each observation in the
representation. population might be a value 0 or 1 of the
In this chapter, we focus on sampling Bernoulli random variable X with
from distributions or populations and study probability distribution
such important quantities as the sample
mean and sample variance. 𝑏(𝑥; 1, 𝑝) = 𝑝 ; 𝑞"+; , 𝑥 = 0, 1
where 0 indicates a non-defective item and 1
Population indicates a defective one. p is the probability
Consists of the totality of the of any item being defective and 𝑞 = 1 − 𝑝.
observations with which we are concerned. • When we refer to the population
• The totality of observations, f(x), i.e, binomial or normal distributions,
whether their number be finite or infinite, we mean a population whose observations
constitutes what we call a population. are values of a random variable having the
• The word population previously probability distribution f(x).
referred to observations obtained from
statistical studies about people.
• Today, statisticians use the term to Sample
refer to observations relevant to anything of A subset of a population.
interest, whether it be groups of people, • In the statistical inference,
animals, or all possible outcomes from some statisticians are interested in arriving at
complicated biological or engineering conclusions concerning a population when it
system. is impossible or impractical to observe the
entire set of observations that make up the
Populations: Examples population.
• If there 600 students in the school • We must depend on a subset of
whom we classified according to blood type, observations from the population to help us
we say that we have a population of size make inferences concerning that same
600. population.
• The number of the cards in a deck, • If our inferences are to be valid, we
the heights of residents in a city, and the must obtain samples that are representative
lengths of cars in a parking lot are examples of the population. • Any sampling procedure
of populations with finite number. The total that produces inferences that consistently
number of observations is also a finite overestimate or consistently under-estimate
number. some characteristic of the population is said
• The observations obtained by to be biased.
measuring the atmospheric pressure every
day or all measurements of the depth of a Random sample
lake are examples of populations whose Let 𝑋" , 𝑋! , . . . , 𝑋' be n independent
sizes are infinite. random variables, each having the same
probability distribution 𝑓(𝑥). Define
Example 2:
Find the variance of the data 3, 4, 5, Central Limit Theorem
6, 6, and 7, representing the number of trout The Central Limit Theorem states
caught by a random sample of 6 fishermen that the sampling distribution of the sample
on June 19, 1996, at Lake Muskoka. means approaches a normal distribution as
the sample size gets larger — no matter
Solution: what the shape of the population
1 13 distribution. This fact holds especially true
𝑠! = [(6)(171) − (31)! ] = for sample sizes over 30. All this is saying is
(6)(5) 6
that as you take more samples, especially
13 large ones, your graph of the sample
𝑠 = Â = 1.47 means will look more like a normal
6
distribution.
𝑅 =7−3=4
Theorem 2. If X is the mean of a random
sample of size n taken from a population
Sampling Distribution of Mean
with mean µ and variance 𝝈𝟐 , then the
The probability distribution of a
limiting form of the distribution of Z
statistic is called a sampling distribution.
• Since a statistic is a random
𝑋Ž − 𝜇
variable that depends only on the observed 𝑍= 𝜎
samples, it must have a probability -
distribution. √𝑛
as n → ∞, is the standard normal
• The sampling distribution of a
distribution N(0,1).
statistic depends on the distribution of the
• If n ≥ 30, the normal approximation
population, the size of the samples, and the
will be satisfactory regardless of population
method of choosing the samples.
shape.
• If n < 30, the approximation is
Sample Mean
good only if the population is not too
• Suppose a random sample of n
different from a normal distribution.
observations, 𝑋" , 𝑋! , . . . , 𝑋' , is taken from a
• If the population is known to be
normal distribution with mean µ and
normal, the sampling distribution of 𝑋Ž is
variance 𝜎 ! , 𝑖. 𝑒. 𝑋5 ∼ 𝑁(µ, 𝜎 ! ), 𝑖 =
normal for any size of n.
1, . . . , 𝑛.
• The sample mean
'
1 1
𝑋Ž = (𝑋" + 𝑋! +··· +𝑋' ) = € 𝑋5
𝑛 𝑛
56"
t-Distribution
Its applications revolve around
inferences on a population mean or the
difference between two population means.
Use of the Central Limit Theorem and the
normal distribution is certainly helpful in
this context. However, it was assumed that
the population standard deviation is known.
Compiled and prepared by: ENGR. K.T. CABANLIG
𝑡...- leaves an area of 0.05 to the right, and
−𝑡...!- leaves an area of 0.025 to the left,
we find a total area of
1 − 0.05 − 0.025 = 0.925
𝑏𝑒𝑡𝑤𝑒𝑒𝑛 − 𝑡...!- 𝑎𝑛𝑑 𝑡...-
Hence
𝑃(−𝑡...!- < 𝑇 < 𝑡...- ) = 0.925
F-Distribution
The F-distribution finds enormous
The t-distribution curves for v = 2, 5, and ∞. application in comparing sample variances.
Applications of the F-distribution are found
in problems involving two or more samples.
Let U and V be two independent
random variables having chi-squared
distributions with v1 and v2 degrees of
freedom, respectively. Then the distribution
of the random variable
𝑈
𝑣"
𝐹 =
𝑉
𝑣!
is given by the density function
Symmetry property (about 0) of the t- S!
𝑣 + 𝑣! 𝑣" b ! c
distribution. 𝛤Í " Ή Š
2 𝑣!
ℎ(𝑓) = 𝑣" 𝑣
𝛤 ‰ 2 Š 𝛤 ‰ 2! Š
The t-distribution is used extensively S!
in problems that deal with inference about 𝑓 ! +"
− S! 8S"
the population mean. or in problems that 1 + 𝑣" 𝑓 !
involve comparative samples (i.e., in cases { 2 |
where one is trying to determine if means 𝐼𝑓 𝑓 > 0,
from two samples are significantly
different). ℎ(𝑓) = 0, 𝐼𝑓 𝑓 ≤ 0
Example 4: This is known as the F-distribution with v1
The t-value with v = 14 degrees of freedom and v2 degrees of freedom (d.f.).
that leaves an area of 0.025 to the left, and
therefore an area of 0.975 to the right, is
𝑡..FE- = −𝑡...!- = −2.145
Example 5:
Find 𝑃(−𝑡...!- < 𝑇 < 𝑡...- )
Solution:
Compiled and prepared by: ENGR. K.T. CABANLIG
The lengths of time, in minutes, that 10
patients waited in a doctor’s office before
receiving treatment were recorded as
follows: 5, 11, 9, 5, 10, 15, 6, 10, 5, and 10.
Treating the data as a random sample, find
(a) the mean;
5 + 11 + 9 + 5 + 10 + 15 + 6 + 10 + 5 + 10
=
10
= 8.6 𝑚𝑖𝑛
(b) the median;
= 5, 5, 5, 6, 9, 10, 10, 10, 11, 15
Typical F-distributions 9 + 10
= = 9.5 𝑚𝑖𝑛
2
(c) the mode.
5 ( 3 times)& 10 ( 3 times)
Example 2:
The reaction times for a random sample of 9
subjects to a stimulant were recorded as 2.5,
3.6, 3.1, 4.3, 2.9. 2.3, 2.6, 4.1, and 3.4
seconds. find
(a) the range;
𝑅𝑎𝑛𝑔𝑒 = 4.3 − 2.3 = 2
(b) the standard deviation.
Illustration of the fα for the Fdistribution.
'
1
Theorem 4: Writing 𝒇𝜶(𝒗𝟏 , 𝒗𝟐 ) for fα with 𝑆! = €(𝑋𝑖 − 𝑋Ž )!
𝑛−1
𝒗𝟏 𝒂𝒏𝒅 𝒗𝟐 degrees of freedom, we obtain 56"
1 (2.5 − 3.2)! + (3.6 − 3.2)! + (3.1 − 3.2)!
𝑓" − 𝛼(𝑣" , 𝑣! ) = (4.3 − 3.2)! + (2.9 − 3.2)! + (2.3 − 3.2)! +
𝑓[ (𝑣! , 𝑣" )
(2.6 − 3.2)! + (4.1 − 3.2)! + (3.4 − 3.2)!
=
The F-Distribution with Two Sample 9−1
Variances = 0.4975
𝜎 = √0.4975 = 0.7053
Theorem 5: If 𝑺𝟐𝟏 𝒂𝒏𝒅 𝑺𝟐𝟐 are the variances
of independent random samples of size
𝒏𝟏 𝒂𝒏𝒅 𝒏𝟐 taken from normal populations Example 3:
with variances 𝝈𝟐𝟏 𝒂𝒏𝒅 𝝈𝟐𝟐 , respectively, The numbers of incorrect answers on a true-
then false competency test for a random sample
𝑆"! /𝜎"! 𝜎!! 𝑆"! of 15 students were recorded as follows: 2,
𝐹 = ! ! = ! ! 1, 3, 0, 1, 3, 6, 0, 3, 3, 5, 2, 1, 4, and 2.
𝑆! /𝜎! 𝑆! 𝜎"
calculate the variance using the formula
has an F-distribution with v1 = n1 − 1 and
v2 = n2 − 1 degrees of freedom Solution:
𝑥̅ = 2.4
Example 1: (a) of form
𝑃(𝑋Ž < 775) = 𝑃(𝑍 < −2.5) = 0.0062 (d) Find 𝑃(𝑇 > −𝑡...!- )
= 1 − 0.025 = 0.975
Example 5:
For a chi-squared distribution, find Example 8:
! A manufacturing firm claims that the
(a) 𝜒....- when v = 5;
batteries used in their electronic games will
= 16.750
last an average of 30 hours. To maintain this
Solution:
0.6 + 0.7 + 0.7 + 0.3 + 0.4 + 0.5
𝑥̅ = 0.4 + 0.2
8
= 0.475
1
𝑠! = [(0.6 − 0.475)!
8−1
+ (0.7 − 0.475)!
+ (0.7 − 0.475)!
+ (0.3 − 0.475)!
+ (0.4 − 0.475)!
+ (0.5 − 0.475)!
+ (0.4 − 0.475)!
+ (0.2 − 0.475)! = 0.0336
𝑠 = 0.183
0.475 − 0.5
𝑡= = −0.386
0.183
√8