02 Probability
02 Probability
https://www.visualcapitalist.com/measuring-perceptions-of-uncertainty/
Probability notation
Careful counting
Data
Subjective judgment
Other probabilities
Where do probabilities come from?
Careful counting
Data
Subjective judgment
Other probabilities
Example: draw a card at random
from a standard 52-card deck
What is 𝑃 𝑑𝑖𝑎𝑚𝑜𝑛𝑑 ?
# 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠 𝑜𝑓 𝑖𝑛𝑡𝑒𝑟𝑒𝑠𝑡 𝑌
𝑃= =
𝑡𝑜𝑡𝑎𝑙 # 𝑜𝑓 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠 𝑁
What is 𝑃 𝑆 ≥ 9 ?
10
𝑃 𝑆≥9 = ≈ 0.28
36
10
𝑃 𝑆≥9 = ≈ 0.28
36
Where do probabilities come from?
Careful counting
Data
Subjective judgment
Other probabilities
𝑃 𝑛𝑒𝑤𝑏𝑜𝑟𝑛 𝑏𝑎𝑏𝑦 𝑖𝑠 𝑎 𝑔𝑖𝑟𝑙
100
=
206
≈ 0.485
Joe will be in a
P ≈ 0.009
car accident this year
Emma Pierson & Stanford colleagues analyzed racial bias across
100 million traffic stops
Data set incorporates 21 state patrol agencies and 35 municipal police
departments over 10 years
Each panel shows a different time for the onset of dusk (end of civil twilight).
Where do probabilities come from?
Careful counting
Data
Subjective judgment
Other probabilities
𝑊𝑖𝑙𝑙𝑖𝑎𝑚𝑠 𝑤𝑖𝑛𝑠
P 𝑎𝑛𝑜𝑡ℎ𝑒𝑟
𝑊𝑖𝑚𝑏𝑙𝑒𝑑𝑜𝑛
= 0.3 ?
= 0.2 ?
𝑃 = 0.2 𝑃 = 0.3
Give me $10.
If Serena wins another Wimbledon,
I’ll repay your $10…and I’ll pay you
another $30.
Expected payoff:
𝐸 = 10 ∗ 0.8 − 30 ∗ 0.2
=8−6
=2
Expected payoff:
𝐸 = 30 ∗ 0.3 − 10 ∗ 0.7
=9−7
=2
OK!
𝑃 𝐴𝑝𝑝𝑙𝑒 𝑠𝑡𝑜𝑐𝑘 𝑢𝑝 𝑛𝑒𝑥𝑡 𝑦𝑒𝑎𝑟 = 0.7 ? ≈(Buy!)
0.28
Careful counting
Data
Subjective judgment
Other probabilities
Where do probabilities come from?
Careful counting
Data
Subjective judgment
Other probabilities
Negation Rule
Then 𝑃 𝐴 𝑜𝑟 𝐵 =
𝑃 𝐴 + 𝑃 𝐵 − 𝑃(𝐴, 𝐵)
Combining probability rules
Hogwarts has 280 students: 70 each in Gryffindor, Ravenclaw, Hufflepuff,
and Slytherin. There are 40 first-year students, 10 in each house. A
student is randomly selected for the next Tri-Wizard Tournament.
What is 𝑃(𝑛𝑒𝑖𝑡ℎ𝑒𝑟 𝑌𝑒𝑎𝑟 1 𝑛𝑜𝑟 𝑅𝑎𝑣𝑒𝑛𝑐𝑙𝑎𝑤)?
Careful counting
Data
Subjective judgment
Other probabilities
What questions are there?
Conditional Probability
𝑃(𝐴|𝐵) ≠ 𝑃(𝐵|𝐴)
Perhaps the single most important fact to
remember about conditional probabilities:
𝑃(𝐴|𝐵) ≠ 𝑃(𝐵|𝐴)
𝑃(𝐴|𝐵) ≠ 𝑃(𝐵|𝐴)
Moral of the story:
Always be specific about what’s on the left-hand side
and what’s on the right-hand side.
Where do probabilities come from?
Careful counting
Data
Subjective judgment
Other probabilities
P cancer = 15Τ200
P die, cancer = 3Τ200
P die|cancer = 3Τ15
In general, we can estimate 𝑃(𝐴|𝐵) as:
𝑃(𝐴, 𝐵)
𝑃 𝐴𝐵 =
𝑃(𝐵)
Multiplication Rule
𝑃 𝐴, 𝐵 = 𝑃(𝐴 𝐵 ∗ 𝑃(𝐵)
Example: Mammograms
P cancer = 15Τ200
P die, cancer = 3Τ200
P die|cancer = 3Τ15
Using the multiplication rule:
𝑃(𝑑𝑖𝑒, 𝑐𝑎𝑛𝑐𝑒𝑟)
𝑃 𝑑𝑖𝑒 𝑐𝑎𝑛𝑐𝑒𝑟 =
𝑃(𝑐𝑎𝑛𝑐𝑒𝑟)
Example: Mammograms
P cancer = 15Τ200
P die, cancer = 3Τ200
P die|cancer = 3Τ15
Using the multiplication rule:
𝑃(𝑑𝑖𝑒, 𝑐𝑎𝑛𝑐𝑒𝑟)
𝑃 𝑑𝑖𝑒 𝑐𝑎𝑛𝑐𝑒𝑟 =
𝑃(𝑐𝑎𝑛𝑐𝑒𝑟)
3Τ200 3
= = ≈ 0.2
15Τ200 15
Example: Mammograms
P cancer = 15Τ200
P die, cancer = 3Τ200
P die|cancer = 3Τ15
Or the re-arranged multiplication rule:
P cancer = 15Τ200
P die, cancer = 3Τ200
P die|cancer = 3Τ15
Or the re-arranged multiplication rule:
Liked Band of
Brothers
56 6
Didn’t like it
14 24
Probability and
Contingency Tables Liked Saving
Didn’t like it
Private Ryan
Liked Band of
Brothers
56 6
Didn’t like it
14 24
56
P Likes 𝑆𝑃𝑅 Likes 𝐵𝐵 = ≈ 0.9
56 + 6
Some numbers
Network TV
$130 million for one $400 million
season of 10 episodes commissioning pilots
7,000 individual period 113 shows
costumes
13 make it to a
One $35,000 royal
second season
Jimmy Cliff 0 1 0 1 0
Pretty Lights 1 1 0 1 0
Lila Downs 0 0 0 0 1
Rebelution 1 1 0 1 0
Brand New 0 0 0 1 0
Frank Turner 1 0 1 0 0
Local Natives 0 0 0 1 0
77
𝑃 𝑝𝑙𝑎𝑦𝑒𝑑 𝐴𝐶𝐿, 𝑝𝑙𝑎𝑦𝑒𝑑 𝐿𝑜𝑙𝑙𝑎 = ≈ 0.062
(719 + 361 + 81 + 77)
Probability and
Contingency Tables
Didn’t play Lolla Played Lolla
77
𝑃 𝑝𝑙𝑎𝑦𝑒𝑑 𝐴𝐶𝐿 𝑝𝑙𝑎𝑦𝑒𝑑 𝐿𝑜𝑙𝑙𝑎) = ≈ 0.176
(361 + 77)
Let’s try this out together with the music festivals data!