Probability
Probability
It is a truth very certain that, when it is not in our power to determine what is true, we ought to follow what is most probable
- Rene Descartes
Probability Models
Concerned with the study of random (or chance) phenomena Random experiment: An experiment that can result in different outcomes, even though it is repeated in the same manner every
time
Managerial Implication: Due to this random nature additional capacities for any setup are required
Although random, certain statistical regularities are to be captured to form the model Model: An abstraction of real world problem
Sample Space
A set of all possible outcomes of a random experiment Denoted by S Examples:
Speed of a train at a given time, with no other additional information: S = {any value between 0 to infinity}
4
Out of the previous examples, which ones are discrete sample spaces???
Events
A collection of certain sample points A subset of the sample space Denoted by E Examples:
Getting an odd number in dice throwing experiment S = {1, 2, 3, 4, 5, 6}; E = {1, 3, 5} Getting a defective part within the first three inspected parts S = {d, gd, ggd, gggd, ggggd, }; E = {d, gd, ggd} Event of a component not failing before time t: S = [0, ); E = [t, )
Events (cont)
Complement of an event E1: E1 = S E1 Intersection of two events: E3 = E1 Union of two events: E4 = E1 U E2 U Two events A and B are mutually exclusive if: A B= U E2
B = and A U B = S
S B U
B
S A
AUB
B D
Defective
NotDefective
Head
Tail
Defective
NotDefective
Head
Tail Head
Tail
Defective
NotDefective
Probability
Probability of an event represents the relative likelihood that performance of the experiment will result in occurrence of that event
P(A) = 0.5
B = Getting two heads in two successive tosses of a coin P(B) = 0.25
10
The objective of inferential statistics is to make an inference about a population of interest based on the information obtained from the sample from that population
11
Probability
Population
12
Relative Frequency Approach: probability based on long term relative frequency --- requires a large amount of historical data
Subjective Probability: probability based on ones judgement (e.g. how may days will it take for a consignment to reach from Indore to Delhi?)
13
14
Addition Rules
If A and B are any two events then, P(A U B) = P(A) + P(B) P(A U B)
U U
B
C)
15
An Example
16
An Example
What is the sample space? Which are the simple events? What is the probability of each simple event?
17
An Example
You have pulled out a slice of pizza that has tomato on it. What is the probability that your slice will have mushrooms?
18
Conditional Probability
The conditional probability of A given B is:
P(A|B) = P(A
B) / P(B)
19
Multiplication Rule
The conditional probability rule can be rewritten to get the multiplication rule: U P(A B) = P(A|B) x P(B) = P(B|A) x P(A)
20
Example
A consulting firm is bidding for two jobs, one with each of two large multinational corporations. The company executives estimate that the probability of obtaining the consulting job with
21
Example
In a standard government run lottery in Europe and North America, you choose 6 out of 49 numbers (1 through 49). You win the biggest prize if these 6 are drawn. (The prize money is divided
between all those who choose the lucky numbers. If no one wins,
then most of the prize money is put back into next week's lottery). You are offered two tickets A = (1,2,3,4,5,6) or B = (39,36,32,21,14, and 3). Do you prefer A, B, or are you indifferent between the two?
22
An Example
23
Independence
Two events are said to be independent if occurrence of one has no effect on the chances for the occurrence of the other.
24
Independence
Two events A and B are independent if any one of the following equivalent statements are true: P(A|B) = P(A)
P(B|A) = P(B)
U P(A B) = P(A) x P(B)
25
A couple has two girls. What is the chance of delivering a baby girl
for the third time?
26
P(Loan 3 defaults) = p3
If the three loans are from three different companies, but in the same business, example suppliers of auto parts. Loan repayment problem with one of the customers is very likely an indication of similar problems with remaining two customers. Is the assumption of independence valid in such a case?
27
28
independent.
29
First three conditions simply assert that any two events are independent, a property known as pairwise independence. The fourth condition is also important and does not automatically follow from the first three. Conversely, the fourth condition does not imply the first three
31
Education
Poor (<$25K) 10.51 10.93 6.36 2.23 0.62 0.16 0.09 30.9
Lower Middle ($25K to ($50K to $50K) $75K) 4.26 10.57 8.36 4.16 1.26 0.22 0.21 29.04 1.32 5.84 5.84 4.15 1.36 0.29 0.29 19.09
Upper (>$75K) 0.68 3.87 5.23 6.57 2.84 0.99 0.80 20.98
Row total
No HS diploma High school grad. Some college Bachelors deg. Masters deg. Professional deg. Doctorate Column total
All cells show percentages Sample size is 98 million households Source: Statistics for Business Decision Making and Analysis by Stine and Foster 33
In general, Given k mutually exclusive and exhaustive events S1, S2, Sk and any other event A, the probability of event A can be expressed as:
P(A) = P(A
S1) + P(A
P(A) = P(A
B) + P(A
Bc)
S2) + + P(A
Sk )
34
35
Bayes Rule
36
Bayes Rule
P(Si) is called as the prior probability of Si because it does not take into account any other behaviour and is based on historical data
In practice, Bayes Rule can be applied to re-evaluate (or update) the prior belief of probabilities of S1, S2, , Sk, when new information is obtained (as a result of event A happening)
37
output, and that in City 2 the remaining 40%. The central quality
assurance reported that 5% of the bottles produced in City 1 plant were defective. Likewise, 10% of the bottles from the City 2 plant were defective. The Controller of Operations for ABC Plastics wants to apportion
the cost of poor quality fairly between the two plants. How can that be
done?
38
60%.
Usually in this kind of process, after making an initial presentation to the potential client by the vendor, the client organization most often asks the vendor to make a repeat presentation to the higher authorities from the
client side. Historically suppose for this particular client, 70% of the
successful vendors had been asked to do a second presentation. However, 50% of the unsuccessful vendors were also asked to do the second presentation. Suppose the startup firm in question has been asked to do a second presentation. What is the revised estimate of the probability of selling the application using the additional information in the previous paragraph?
39
3. It is slightly more likely than not, that patient does have the strep throat.
4. It is very much more likely than not, that patient does have the strep throat.
40
sand. Behind the third door, there is the prize, a car. Once the contestant
has picked one of the doors, the host of the game show (who knows where the prize is) opens one of remaining doors and shows that there is sand behind it. The host then gives the contestant a chance to switch to
the remaining unopened door. What should the contestant do, continue
with the same door or switch to the remaining door?
41
Additional Examples
Samples of emission from three suppliers are classified for conformance to air-quality specifications. The results from 100 samples are summarized as follows:
Confirms
Supplier 1 Supplier 2 Supplier 3 22 25 30
Not confirms
8 5 10
Let A denote the event that the sample is from supplier 2 and let B denote the event that the sample confirms the specifications. If a sample is selected at random, determine the following probabilities:
(a) P(A),
(b) P(B),
B), (e) P(A U B),
(c) P(A),
(f) P(A U B)
(d) P(A
42
Additional Examples
A lot contains 15 castings from a local supplier and 25 castings from a supplier from the next state. Two castings are selected randomly without replacement from the lot of 40. Let event A be
that the first casting is selected from the local supplier and the
event B be that the second casting is selected from the local supplier. Determine the following:
P(A
P(A U B)
P(A) P(B|A) B)
43
Additional Examples
Software to detect fraud in customer phone cards tracks the number of metropolitan areas where the calls originate each day. It is found that 1% of legitimate callers originate calls from more
44
Additional Examples
A business group owns three five-star hotels (say, A, B, and C) in India. By studying the past behavior of the revenue obtained from the three hotels month by month, it has been observed that the probability of
45