0% found this document useful (0 votes)

2 views

Week 5-8 Short Notes

The document provides comprehensive notes on continuous random variables, covering their functions, expected values, variances, and various inequalities. It also discusses joint densities, independence, empirical distributions, sample statistics, and limit theorems including the Central Limit Theorem. Additionally, it introduces specific distributions such as Gamma, Beta, and Cauchy, along with important results related to normal distributions and their properties.

Uploaded by

Vijayrajtnu

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Week 5-8 Short Notes

Uploaded by

Vijayrajtnu

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Statistics for Data Science - 2

Week 5 Notes

1. Functions of continuous random variable:

Suppose X is a continuous random variable with CDF FX and PDF fX and suppose
g : R → R is a (reasonable) function. Then, Y = g(X) is a random variable with CDF
FY determined as follows:

• FY (y) = P (Y ≤ y) = P (g(X) ≤ y) = P (X ∈ {x : g(x) ≤ y})

• To evaluate the above probability
– Convert the subset Ay = {x : g(x) ≤ y} into intervals in real line.
– Find the probability that X falls in those intervals.
R
– FY (y) = P (X ∈ AY ) = AY fX (x)dx
• If FY has no jumps, you may be able to differentiate and find a PDF.

2. Theorem: Monotonic differentiable function

Suppose X is a continuous random variable with PDF fX . Let g(x) be monotonic for
dg(x)
x ∈ supp(X) with derivative g ′ (x) = . Then, the PDF of Y = g(X) is
dx
1 −1
fY (y) = fX (g (y))
|g ′ (g −1 (y))|
• Translation: Y = X + a
fY (y) = fX (y − a)
• Scaling: Y = aX
1
fY (y) = fX (ya)
|a|
• Affine: Y = aX + b
1
fY (y) = fX ((y − b)a)
|a|
• Affine transformation of a normal random variable is normal.

3. Expected value of function of continuous random variable:

Let X be a continuous random variable with density fX (x). Let g : R → R be a function.
The expected value of g(X), denoted E[g(X)], is given by
Z ∞
E[g(X)] = g(x)fX (x)dx
−∞

whenever the above integral exists.

1
• The integral may diverge to ±∞ or may not exist in some cases.

4. Expected value (mean) of a continuous random variable:

Mean, denoted E[X] or µX or simply µ is given by
Z ∞
E[X] = xfX (x)dx
−∞

5. Variance of a continuous random variable:

2
Variance, denoted Var[X] or σX or simply σ 2 is given by
Z ∞
2
Var(X) = E[(X − E[X]) ] = (x − µ)2 fX (x)dx
−∞

• Variance is a measure of spread of X about its mean.

• Var(X) = E[X 2 ] − E[X]2

X E[X] Var(X)
a+b (b−a)2
Uniform[a, b] 2 12
1 1
Exp(λ) λ λ2

Normal(µ, σ 2 ) µ 2
σ

6. Markov’s inequality:
If X is a continuous random variable with mean µ and non-negative supp(X) (i.e. P (X <
0) = 0), then
µ
P (X > c) ≤
c
7. Chebyshev’s inequality:
If X is a continuous random variable with mean µ and variance σ 2 , then
1
P (|X − µ| ≥ kσ) ≤
k2

8. Marginal density: Let (X, Y ) be jointly distributed where X is discrete with range
TX and PMF pX (x).
For each x ∈ TX , we have a continuous random variable Yx with density fYx (y).
fYx (y) : conditional density of Y given X = x, denoted fY |X=x (y).

• Marginal density of Y
P
– fY (y) = pX (x)fY |X=x (y)
x∈TX

2
9. Conditional probability of discrete given continuous: Suppose X and Y are
jointly distributed with X ∈ TX being discrete with PMF pX (x) and conditional densi-
ties fY |X=x (y) for x ∈ TX . The conditional probability of X given Y = y0 ∈ supp(Y ) is
defined as

pX (x)fY |X=x (y0 )

• P (X = x | Y = y0 ) =
fY (y0 )

3
Statistics for Data Science - 2
Week 6 Notes
Continuous Random Variables

1. Joint density: A function f (x, y) is said to be a joint density function if

• f (x, y) ≥ 0, i.e. f is non-negative.
R∞ R∞
• f (x, y)dxdy = 1
−∞ −∞

2. 2D uniform distribution: Fix some (reasonable) region D in R2 with total area |D|.
We say that (X, Y ) ∼ Uniform(D) if they have the joint density
(
1
(x, y) ∈ D
fXY (x, y) = |D|
0 otherwise

3. Marginal density: Suppose (X, Y ) have joint density fXY (x, y). Then,
y=∞
• X has the marginal density fX (x) =
R
fXY (x, y)dy.
y=−∞
x=∞
• Y has the marginal density fY (y) =
R
fXY (x, y)dx.
x=−∞
– In general the marginals do not determine joint density.
4. Independence: (X, Y ) with joint density fXY (x, y) are independent if
• fXY (x, y) = fX (x)fY (y)
– If independent, the marginals determine the joint density.
5. Conditional density: Let (X, Y ) be random variables with joint density fXY (x, y).
Let fX (x) and fY (y) be the marginal densities.
• For a such that fX (a) > 0, the conditional density of Y given X = a, denoted as
fY |X=a (y), is defined as

fXY (a, y)
fY |X=a (y) =
fX (a)

• For b such that fY (b) > 0, the conditional density of X given Y = b, denoted as
fX|Y =b (x), is defined as

fXY (x, b)
fX|Y =b (x) =
fY (b)
6. Properties of conditional density: Joint = Marginal × Conditional, for x = a and
y = b such that fX (a) > 0 and fY (b) > 0.

• fXY (a, b) = fX (a)fY |X=a (b) = fY (b)fX|Y =b (a)

Page 2
Statistics for Data Science - 2
Week 7 Notes
Statistics from samples and Limit theorems

1. Empirical distribution:
Let X1 , X2 , . . . , Xn ∼ X be i.i.d. samples. Let #(Xi = t) denote the number of times
t occurs in the samples. The empirical distribution is the discrete distribution with
PMF
#(Xi = t)
p(t) =
n
• The empirical distribution is random because it depends on the actual sample
instances.
• Descriptive statistics: Properties of empirical distribution. Examples :
– Mean of the distribution
– Variance of the distribution
– Probability of an event
• As number of samples increases, the properties of empirical distribution should
become close to that of the original distribution.

2. Sample mean:
Let X1 , X2 , . . . , Xn ∼ X be i.i.d. samples. The sample mean, denoted X, is defined to
be the random variable
X1 + X 2 + . . . + Xn
X=
n
• Given a sampling x1 , . . . , xn the value taken by the sample mean X is x =
x1 + x2 + . . . + xn
. Often, X and x are both called sample mean.
n

3. Expected value and variance of sample mean:

Let X1 , X2 , . . . , Xn be i.i.d. samples whose distribution has a finite mean µ and variance
σ 2 . The sample mean X has expected value and variance given by

σ2
E[X] = µ, Var(X) =
n
• Expected value of sample mean equals the expected value or mean of the distri-
bution.
• Variance of sample mean decreases with n.
4. Sample variance:
Let X1 , X2 , . . . , Xn ∼ X be i.i.d. samples. The sample variance, denoted S 2 , is defined
to be the random variable
(X1 − X)2 + (X2 − X)2 + . . . + (Xn − X)2
S2 = ,
n−1

where X is the sample mean.

5. Expected value of sample variance:

Let X1 , X2 , . . . , Xn be i.i.d. samples whose distribution has a finite variance σ 2 . The
2 (X1 − X)2 + (X2 − X)2 + . . . + (Xn − X)2
sample variance S = has expected value
n−1
given by
E[S 2 ] = σ 2

• Values of sample variance, on average, give the variance of distribution.

• Variance of sample variance will decrease with number of samples (in most cases).
• As n increases, sample variance takes values close to distribution variance.

6. Sample proportion:
The sample proportion of A, denoted S(A), is defined as

number of Xi for which A is true

S(A) =
n
• As n increases, values of S(A) will be close to P (A).
• Mean of S(A) equals P (A).
• Variance of S(A) tends to 0.

7. Weak law of large numbers:

Let X1 , X2 , . . . , Xn ∼ iid X with E[X] = µ, Var(X) = σ 2 .
X1 + X2 + . . . + Xn
Define sample mean X = . Then,
n
σ2
P (|X − µ| > δ) ≤
nδ 2

Page 2
Statistics for Data Science - 2
Week 8 Notes
Statistics from samples and Limit theorems

1. Moment generating function (MGF):

Let X be a zero-mean random variable (E[X] = 0). The MGF of X, denoted MX (λ),
is a function from R to R defined as
MX (λ) = E[eλX ]
•
MX (λ) = E[eλX ]
λ2 X 2 λ3 X 3
= E[1 + λX + + + . . .]
2! 3!
λ2 λ3
= 1 + λE[X] + E[X ] + E[X 3 ] + . . .
2
2! 3!
λk
That is coefficient of in the MGF of X gives the kth moment of X.
k!
2 2
• If X ∼ Normal(0, σ 2 ) then, MX (λ) = eλ σ /2
• Let X1 , X2 , . . . , Xn ∼ i.i.d. X and let S = X1 + X2 + . . . + Xn , then
MS (λ) = (E[eλX ])n = [MX (λ)]n
It implies that MGF of sum of independent random variables is product of the
individual MGFs.

2. Central limit theorem: Let X1 , X2 , . . . , Xn ∼ iid X with E[X] = µ, Var(X) = σ 2 .

Define Y = X1 + X2 + . . . + Xn . Then,
Y − nµ
√ ≈ Normal(0, 1).
nσ
3. Gamma distribution:
X ∼ Gamma(α, β) if PDF fx (x) ∝ xα−1 e−βx , x>0
• α > 0 is a shape parameter.
• β > 0 is a rate parameter.
1
• θ = is a scale parameter.
β
α
• Mean, E[X] =
β
α
• Variance, Var(X) = 2
β
4. Beta distribution:
X ∼ Beta(α, β) if PDF fx (x) ∝ xα−1 (1 − x)β−1 , 0<x<1

• α > 0, β > 0 are the shape parameters.

α
• Mean, E[X] =
α+β
αβ
• Variance, Var(X) = 2
(α + β) (α + β + 1)

5. Cauchy distribution:
1 α
X ∼ Cauchy(θ, α2 ) if PDF fx (x) ∝
π α + (x − θ)2
2

• θ is a location parameter.
• α > 0 is a scale parameter.
• Mean and variance are undefined.

6. Some important results:

• Let Xi ∼ Normal(µi , σi2 ) are independent and let Y = a1 X1 + a2 X2 + . . . an Xn ,

then
Y ∼ Normal(µ, σ 2 )
where µ = a1 µ1 + a2 µ2 + . . . an µn and σ 2 = a21 σ12 + a22 σ22 + . . . a2n σn2
That is linear combinations of i.i.d. normal distributions is again a normal distri-
bution.

• Sum of n i.i.d. Exp(β) is Gamma(n, β).

1 1
• Square of Normal(0, σ ) is Gamma , 2 .
2
2 2σ
X
• Suppose X, Y ∼ i.i.d. Normal(0, σ 2 ). Then, ∼ Cauchy(0, 1).
Y

• Suppose X ∼ Gamma(α, k), Y ∼ Gamma(β, k) are independent random vari-

X
ables, then ∼ Beta(α, β).
X +Y

• Sum of n independent Gamma(α, β) is Gamma(nα, β).

n 1
• If X1 , X2 , . . . , Xn ∼ i.i.d. Normal(0, σ ) , then
2
X12 +X22 +. . .+Xn2 ∼ Gamma , .
2 2σ 2

Page 2

n 1
• Gamma , is called Chi-square distribution with n degrees of freedom, de-
2 2
noted χ2n .

• Suppose X1 , X2 , . . . , Xn ∼ i.i.d. Normal(µ, σ 2 ). Suppose that X and S 2 denote

the sample mean and sample variance, respectively, then
(n − 1)S 2
(i) ∼ χ2n−1
σ2
(ii) X and S 2 are independent.

Page 3

St2334-Cheatsheet Organized
No ratings yet
St2334-Cheatsheet Organized
2 pages
cs109 Final Cheat 3 PDF
No ratings yet
cs109 Final Cheat 3 PDF
13 pages
ALL ST218 Lecture Notes
No ratings yet
ALL ST218 Lecture Notes
87 pages
Chapter 3
33% (3)
Chapter 3
26 pages
Feedback: Question Text
No ratings yet
Feedback: Question Text
60 pages
Week 5
No ratings yet
Week 5
3 pages
W5 Notes
No ratings yet
W5 Notes
3 pages
OptimalLinearFilters PDF
No ratings yet
OptimalLinearFilters PDF
107 pages
MIT14 381F13 Lec1 PDF
No ratings yet
MIT14 381F13 Lec1 PDF
8 pages
Basic Probability and Statistics: Random Variables Distribution Functions Various Probability Distributions
No ratings yet
Basic Probability and Statistics: Random Variables Distribution Functions Various Probability Distributions
39 pages
Random Variables and Process
No ratings yet
Random Variables and Process
31 pages
D Models
No ratings yet
D Models
5 pages
LECT3 Probability Theory
No ratings yet
LECT3 Probability Theory
42 pages
Lecture Notes 1 36-705 Brief Review of Basic Probability
No ratings yet
Lecture Notes 1 36-705 Brief Review of Basic Probability
7 pages
Probability Review Stochastic
No ratings yet
Probability Review Stochastic
23 pages
Distributions and Normal Random Variables
No ratings yet
Distributions and Normal Random Variables
8 pages
1 Math Fundamentals: 1.1 Integrals, Factors and Techniques
No ratings yet
1 Math Fundamentals: 1.1 Integrals, Factors and Techniques
11 pages
Basic Statistics and Probability Theory
No ratings yet
Basic Statistics and Probability Theory
45 pages
Memo Proba
No ratings yet
Memo Proba
2 pages
Midterm2 Cheatsheet Annotated
No ratings yet
Midterm2 Cheatsheet Annotated
3 pages
Random Variables PDF
No ratings yet
Random Variables PDF
64 pages
AE 248: AI and Data Science: Prabhu Ramachandran 2024-01-01
No ratings yet
AE 248: AI and Data Science: Prabhu Ramachandran 2024-01-01
12 pages
Introductory Probability and The Central Limit Theorem
No ratings yet
Introductory Probability and The Central Limit Theorem
11 pages
R Variables
No ratings yet
R Variables
9 pages
Review
No ratings yet
Review
6 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
Exam P Review Sheet
No ratings yet
Exam P Review Sheet
12 pages
323 egec
No ratings yet
323 egec
18 pages
Nonlife Actuarial Models: Claim-Severity Distribution
No ratings yet
Nonlife Actuarial Models: Claim-Severity Distribution
62 pages
Formula Sheet
No ratings yet
Formula Sheet
19 pages
Intro To Data Science Lecture 2
No ratings yet
Intro To Data Science Lecture 2
12 pages
Formula Sheet STAT1301
No ratings yet
Formula Sheet STAT1301
3 pages
Chap2 PDF
No ratings yet
Chap2 PDF
20 pages
Notes
No ratings yet
Notes
56 pages
ST3236_Note3
No ratings yet
ST3236_Note3
17 pages
W8_Notes
No ratings yet
W8_Notes
3 pages
091 - MA8451 MA6451 Probability and Random Processes - Important Question PDF
No ratings yet
091 - MA8451 MA6451 Probability and Random Processes - Important Question PDF
19 pages
Refresher Probabilities Statistics PDF
No ratings yet
Refresher Probabilities Statistics PDF
3 pages
lec23 random variable - Copy
No ratings yet
lec23 random variable - Copy
16 pages
02-Random Variables2
No ratings yet
02-Random Variables2
47 pages
05 Random Signal
No ratings yet
05 Random Signal
40 pages
Revision - Elements or Probability: Notation For Events
No ratings yet
Revision - Elements or Probability: Notation For Events
20 pages
Section06 Solutions
No ratings yet
Section06 Solutions
11 pages
Addis Ababa Science & Technology University Department of Electrical & Computer Engineering
No ratings yet
Addis Ababa Science & Technology University Department of Electrical & Computer Engineering
63 pages
ProbabilityStatistics_Probability2 (1)
No ratings yet
ProbabilityStatistics_Probability2 (1)
11 pages
02-Random Variables
No ratings yet
02-Random Variables
44 pages
Math Statistics
No ratings yet
Math Statistics
4 pages
Random Variables: Fall 2017 Instructor: Ajit Rajwade
No ratings yet
Random Variables: Fall 2017 Instructor: Ajit Rajwade
74 pages
College Statistics
No ratings yet
College Statistics
244 pages
Lecture Notes 1: Brief Review of Basic Probability (Casella and Berger Chapters 1-4)
100% (1)
Lecture Notes 1: Brief Review of Basic Probability (Casella and Berger Chapters 1-4)
14 pages
revision_concepts
No ratings yet
revision_concepts
5 pages
Unit 1 - Digital Communication - WWW - Rgpvnotes.in
No ratings yet
Unit 1 - Digital Communication - WWW - Rgpvnotes.in
11 pages
Continuous Random Variables: Scott Sheffield
No ratings yet
Continuous Random Variables: Scott Sheffield
102 pages
NOTES_DC
No ratings yet
NOTES_DC
109 pages
17 Notes MFML Probreview
No ratings yet
17 Notes MFML Probreview
19 pages
2 Random Variables
No ratings yet
2 Random Variables
36 pages
02-Random Variables
No ratings yet
02-Random Variables
38 pages
Statistics Study Guide: Matthew Chesnes The London School of Economics September 22, 2001
No ratings yet
Statistics Study Guide: Matthew Chesnes The London School of Economics September 22, 2001
22 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Elementary Calculus
From Everand
Elementary Calculus
George N. Frempong
No ratings yet
Long-Memory Time Series: Theory and Methods
From Everand
Long-Memory Time Series: Theory and Methods
Wilfredo Palma
No ratings yet
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
PSNM - Ch. 2
No ratings yet
PSNM - Ch. 2
41 pages
Statistical Method
No ratings yet
Statistical Method
227 pages
Statiscal Method Using R
No ratings yet
Statiscal Method Using R
150 pages
Quantitative Analysis For Management - I: Random Variable
No ratings yet
Quantitative Analysis For Management - I: Random Variable
10 pages
Inventory Newsvendor
No ratings yet
Inventory Newsvendor
19 pages
Chapter 04
100% (1)
Chapter 04
27 pages
Be The Actual Value Which The: May Event
No ratings yet
Be The Actual Value Which The: May Event
25 pages
2 Probability
No ratings yet
2 Probability
16 pages
The Binomial, Poisson, and Normal Distributions: Modified After Powerpoint by Fauziah Binti Aziz
No ratings yet
The Binomial, Poisson, and Normal Distributions: Modified After Powerpoint by Fauziah Binti Aziz
25 pages
Rec 9B - Continuous Random Variables - Part 2-3
No ratings yet
Rec 9B - Continuous Random Variables - Part 2-3
3 pages
Imp Questions For Probability
50% (2)
Imp Questions For Probability
2 pages
Doob, The Development of Rigor in Mathematical Probability (1900-1950) PDF
No ratings yet
Doob, The Development of Rigor in Mathematical Probability (1900-1950) PDF
11 pages
Courses
No ratings yet
Courses
11 pages
SAS 18 ACC 117 2nd Periodical Exam CS
No ratings yet
SAS 18 ACC 117 2nd Periodical Exam CS
8 pages
Statistics - Discrete Probability Distributions
No ratings yet
Statistics - Discrete Probability Distributions
36 pages
Palestine Technical University-Kadoorie College of Engineering and Technology Course Syllabus
No ratings yet
Palestine Technical University-Kadoorie College of Engineering and Technology Course Syllabus
2 pages
Stats Reviewer - 2nd Sem, 3rd QTR
No ratings yet
Stats Reviewer - 2nd Sem, 3rd QTR
20 pages
PTSP Notes Final
No ratings yet
PTSP Notes Final
57 pages
8 Random Variable
No ratings yet
8 Random Variable
7 pages
S1 Chp6 StatisticalDistributions
No ratings yet
S1 Chp6 StatisticalDistributions
23 pages
Statistics Probability G11 Quarter 3 Module 4 Constructing Probability Distributions
No ratings yet
Statistics Probability G11 Quarter 3 Module 4 Constructing Probability Distributions
30 pages
Maple Manual
No ratings yet
Maple Manual
285 pages
What Is A Data Set?
No ratings yet
What Is A Data Set?
19 pages
CHAPTER 1 Part 1 Student
No ratings yet
CHAPTER 1 Part 1 Student
6 pages
04 StaisticalMethods - 1
No ratings yet
04 StaisticalMethods - 1
101 pages
Statistics and Probability Theory MTH-262
No ratings yet
Statistics and Probability Theory MTH-262
38 pages
Baye 9e Chapter 12
100% (1)
Baye 9e Chapter 12
40 pages
[Ebooks PDF] download (Ebook) Understanding Markov Chains: Examples and Applications by Nicolas Privault ISBN 9789811306587, 9789811306594, 9811306583, 9811306591, B07HHC252G full chapters
100% (5)
[Ebooks PDF] download (Ebook) Understanding Markov Chains: Examples and Applications by Nicolas Privault ISBN 9789811306587, 9789811306594, 9811306583, 9811306591, B07HHC252G full chapters
63 pages