Common: Statistical Distributions
Common: Statistical Distributions
Common: Statistical Distributions
STATISTICAL
DISTRIBUTIONS
AN OVERVIEW
What?
In Statistics, a distribution is simply a way to understand how a
set of data points are spread over some given range of values.
Set of
data-points
Probability
or
Frequency
In other words, it shows how likely is it that any player from the
NBA is of a certain height. Most players are around the
mean/average height, fewer are much taller, or much shorter.
Mean height of
NBA Players
Probability
of any randomly
selected player
being of height x
Height (x)
Normal Distribution 2
The spread of the values in our population is measured using a
metric called standard deviation.
- 1 Mean + 1
SD Height SD
- 2 Mean + 2
SD Height SD
Say we know that for all players in the NBA, the mean height is 200cm
and the standard deviation is 7cm.
LeBron is 6cm taller than the mean (206cm - 200cm). Since the standard
deviation is 7cm, he is 0.86 standard deviations (6cm / 7cm) above the mean.
Mean LeBron
200cm 206cm
Our value of 0.86 standard deviations is called the z-score. This can be
converted to a percentile using the probability density function (or a lookup
table) giving us our answer. LeBron James is taller than 80.5% of
players in the NBA!
t-Distribution
Just like a normal distribution, a t-distribution is symmetrical around the
mean, and the breadth is based around the deviation within the data.
t-Distribution
t-Distribution (1 DF)
(5 DF)
Height
As the sample size, and thus the degrees of freedom gets larger, the t-
distribution tends towards a normal distribution - as with a larger sample
we’re more certain around estimating the true population statistics.
Binomial Distribution
A Binomial Distribution can end up looking a lot like the shape of a normal
distribution. The main difference is that instead of plotting continuous data,
it instead plots a distribution of two possible discrete outcomes for
example, the results from flipping a coin.
Imagine flipping a coin 10 times, and from those 10 flips, noting down how
many were "Heads". It could be any number between 1 and 10.
24.6%
20.5% 20.5%
Probability
of 11.7% 11.7%
outcome
4.4% 4.4%
1% 1%
0.1% 0.1%
0 1 2 3 4 5 6 7 8 9 10
Number of "Heads"
If the coin we are using is indeed fair (not biased to heads or tails) then the
distribution of outcomes should start to look the plot above. In the vast
majority of cases we get 4, 5, or 6 "heads" from each set of 10 flips, and the
likelihood of getting more extreme results is much more rare!
Bernoulli Distribution
The Bernoulli Distribution is a special case of the Binomial Distribution. It
considers only two possible outcomes, success or failure, true or false.
83.3%
Probability
of
outcome
16.7%
Yes No
Rolled a 6
Uniform Distribution
A Uniform Distribution is a distribution in which all events are equally
likely to occur.
Below, we’re looking at the results from rolling a die many, many times.
We’re looking at which number we got on each roll and tallying these up.
If we roll the die enough times (and the die is fair) we should end up with a
completely uniform probability where the chance of getting any outcome is
exactly the same
Probability
of 16.7% 16.7% 16.7% 16.7% 16.7% 16.7%
outcome
1 2 3 4 5 6
Result from dice roll
Poisson Distribution
A Poisson Distribution is a discrete distribution similar to the Binomial
Distribution (in that we’re plotting the probability of whole numbered
outcomes)
Unlike the other distributions we have seen however, this one is not
symmetrical - it is instead bounded between 0 and infinity
22.3%
20.9%
17.8%
Probability 13.0%
11.4%
of
outcome
6.1%
4.1%
2.8%
1.1%
0.4% 0.1% 0.01%
0 1 2 3 4 5 6 7 8 9 10 >>>>> Inf
A curriculum based on
input from hundreds of
leaders, hiring managers,
and recruiters
https://data-science-infinity.teachable.com
BUILD YOUR
PORTFOLIO
https://data-science-infinity.teachable.com
EARN THE
CERTIFICATION
https://data-science-infinity.teachable.com
LAND AN
AMAZING ROLE
https://data-science-infinity.teachable.com
Taught by former Amazon
& Sony PlayStation Data
Scientist Andrew Jones
What do DSI
students say?
"I had over 40 interviews without an offer.
After DSI I quickly got 7 offers including
one at KPMG and my amazing new role
at Deloitte!"
- Ritesh
- Keith
"I'm now at University, and my Data
Science related subjects are a piece of
cake after completing this course!
https://data-science-infinity.teachable.com