0% found this document useful (0 votes)

7 views

Lecture 13. em Algorithm (After-Class)

Uploaded by

laijiahao0430

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Lecture 13. em Algorithm (After-Class)

Uploaded by

laijiahao0430

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Lecture 13.

EM Algorithm (After-class)

Notes: As we saw before, many estimation problems require maximization of the probability
distribution with respect to an unknown parameter, for example when computing ML estimates of the
parameters or MAP estimates of the hidden random variables. For many interesting problems,
differentiating the probability distribution with respect to the parameter of interest and setting the
derivative to zero results in a nonlinear equation that does not have a closed-form solution. In such
cases, we have to resort to numerical optimization.

Maximum likelihood estimation with unknown parameters

1 w.p.
Example: Let w be Bernoulli r.v., w = {
δ
, y (binary r.v.) be a noise
0 w.p. 1 − δ
observation of w .

y=
PY ∣W (y∣w) = { w
ϵ
1−ϵ y=w
Let w = [w1 , ⋯ , wn ]T be n i.i.d. samples of w .
y = [y1 , ⋯ , yn ]T

We do not observe w , but observe y .

Our goal is to estimate δ and ϵ.

n
ML: PY (y; ϵ, δ) = ∏i=1 PYi (yi ; ϵ, δ)
n
= ∏ [PYi ∣Wi (yi ∣wi = 0; ϵ, δ)PWi (0; ϵ, δ) + PYi ∣Wi (yi ∣wi = 1; ϵ, δ)PWi (1; ϵ, δ)]
i=1
latent r.v.

(ϵ∗ , δ∗ ) = arg max PY (y; ϵ, δ)

ϵ,δ

No close form solution for the optimal ϵ, δ.

Lecture 13. EM Algorithm (After-class) 1

Algorithm to compute the optimal parameters with hidden variables ⇒ EM Algorithm.

General setup
Assume complete data z , generate by PZ (⋅; x), and x is the parameter to estimate.

Can only observe y = g(z), observe data, g: deterministic function.

In our example: z = [w , y], y = y, x = [ϵ, δ]
The goal is to find x∗ = arg max PY (y; x) = arg max log PY (y; x)
x x
PZ Y (z,y;x)

Note that PZ (z; x) = ∑y PZ ∣Y (z∣y; x) ⋅ PY (y; x) = PZ ∣Y (z∣g(z); x) ⋅ PY (g(z); x)

⇒ let y = g(z): log PY (y; x) = log PZ (z; x) − log PZ ∣Y (z∣y; x)

Do expectation of both sides over PZ ∣Y (z∣y; x′ ):

LHS= ∑z ′ PZ ∣Y (z ′ ∣y; x′ ) log PY (y; x) = (∑z ′ PZ ∣Y (z ′ ∣y; x′ )) log PY (y; x) =
log PY (y; x)
RHS= E[log PZ (z; x)∣Y = y, x = x′ ] −E[log PZ ∣Y (z∣y; x)∣Y = y, x = x′ ]
U (x,x′ )≜ ∑ z ′ PZ ∣Y (z ′ ∣y;x′ ) log PZ (z ′ ;x) V (x,x′ )≜− ∑ z ′ PZ ∣Y (z ′ ∣y;x′ ) log PZ ∣Y (z ′ ∣y;x)

⇒ log PY (y; x) = U(x, x′ ) + V (x, x′ ), ∀x′ .

Lemma: V (x, x′ ) ≥ V (x′ , x′ )

PZ ∣Y (z∣y;x′ )
Proof: V (x, x′ ) − V (x′ , x′ ) = ∑z PZ ∣Y (z∣y; x′ ) log PZ ∣Y (z∣y;x)
=
D(PZ ∣Y ;x′ ∥PZ ∣Y ;x ) ≥ 0

′ ′ ′ ′
⇒ If we can find x =
 x such that U(x, x ) ≥ U(x , x ), then
log PY (y; x) = U(x, x′ ) + V (x, x′ ) ≥ U(x′ , x′ ) + V (x′ , x′ ) = log PY (y; x′ )

Lecture 13. EM Algorithm (After-class) 2

EM Algorithm:
^(0)
1. Initialization: choose a x

2. Repeat until convergence:

^(n) , compute
E-step: given the previous estimation x

^(n) ) = E[log PZ (z; x)∣Y = y, x = x

U(x, x ^(n) ]

^(n+1) maximizing U(⋅; x

M-step: find x ^(n) )

^(n+1) = arg max U(x; x

x ^(n) )
x

^(n+1) , x
⇒ U(x ^(n) ) ≥ U(x
^(n) , x
^(n) )

^(0) , x
We can get a sequence of x ^(1) , ⋯ such that

^(0) ) ≤ PY (y; x
PY (y; x ^(1) ) ≤ ⋯

Since PY (y; x) ≤ 1, it is a non-decreasing bound sequence ⇒ it must converge.

EM Algorithm converges to a stationary point of the likelihood function, i.e., let x∗ be the
∂
convergent point, then ∂x PY (y; x)∣x=x∗ = 0

EM Algorithm for mixture model

Mixture model:
Assume data generated by the following process:

1. sample li ∈ {1, ⋯ , k}, i = 1, ⋯ , m and li ∼ multinomial(ϕ), ϕ = [ϕ1 , ⋯ , ϕk ] (

∑ki=1 ϕi = 1)
2. Sample observation yi from some distribution P (li , yi )

P (li , yi ) = P (li ) ⋅ P (yi ∣li )

Mixture Gaussian model: P (yi ∣li = j) ∼ N (μj , Σj )

Lecture 13. EM Algorithm (After-class) 3

In our notation,

⎧ϕ = [ϕ1 , ⋯ , ϕk ]
x = [ϕ, μ, Σ] : ⎨μ = [μ1 , ⋯ , μk ]
⎩Σ = [Σ , ⋯ , Σ ]
1 k

z = [l1 , ⋯ , lm , y1 , ⋯ , ym ]

y = [y1 , ⋯ , ym ]

The EM Algorithm:
E-step:

^(n) ) = EPZ ∣Y (⋅∣y;x^(n) ) [log PZ (z; x)∣Y = y, x

U(x, x ^(n) ]
m k
= ∑ ∑ P (li ∣yi ; x
^(n) ) log P (li , yi ; x)
i=1 li =1

M-step:

^(n+1) = arg max U(x; x

x ^(n) )
x

The Mixture Gaussian:

E-step:
( ) ( )

Lecture 13. EM Algorithm (After-class) 4

P (yi ∣li = j; x
(n) ^(n) ) ⋅ P (li = j; x ^(n) )
P (li = j∣yi ; x
^ )=
^(n) )
P (yi ; x
P (yi ∣li = j; x ^(n) ) ⋅ P (li = j; x^(n) )
= k
∑j ′ =1 P (yi ∣li = j ′ ; x
^(n) ) ⋅ P (li = j ′ ; x
^(n) )
(n)
^ ∣− 2 exp(− 1 (yi − μ
∣2π Σ
(n) d
^
^j )T Σ
(n) (n)−1
^j )) ⋅ ϕ^j
(yi − μ
(n)
j 2 j
=
^ (n)
∑kj ′ =1 ∣2π Σ ′ ∣− d2 exp(− 1 (y − μ
i ^
(n) T ^ (n)−1
′ ) Σ ′ (yi − ^
μ
(n)
′ )) ⋅ ϕ^j ′ (n)
j 2 j j j
≜ wij

^(n) ) ∀x
⇒ Compute U(x, x

m k
^ ) = ∑ ∑ wij log P (li = j, yi ; x)
U(x, x (n)

i=1 j=1
m k
= ∑ ∑ wij (log P (li = j; x) + log P (yi ∣li = j; x))
i=1 j=1
m k
1 1
= ∑ ∑ wij (log ϕj − log(2π)d ∣Σj ∣ − (yi − μj )T Σ−1
j (yi − μj ))
i=1 j=1
2 2

find x∗ ^(n) )
= arg max U(x, x
x
M-step: updating parameters
k
Do the derivatives on ϕj . Note that ∑j=1 ϕj = 1. Use the Lagrangian multiplier method.

^(n) ) − λ(∑j=1 ϕj − 1) ∣
k
∂U(x, x m
∑i=1 wij
= −λ=0
∂ϕj ^
ϕ
(n+1)
∣ϕj =ϕ^j
(n+1)
j

k
By ∑j=1 ϕj = 1, λ = ∑m k
i=1 ∑j=1 wij = m, and then

m
1
ϕ^j ∑ wij
(n+1)
=
m
i=1

Do the derivatives on μj

Lecture 13. EM Algorithm (After-class) 5

m
^(n) ) ∣
∂U(x, x
= − ∑ wij Σ−1
(n+1)
j (yi − μ
^j )=0
∂μj ∣μj =μ^(n+1)
j i=1

Then, we have

m
(n+1) ∑ wij ⋅ yi
^j
μ = i=1m
∑i=1 wij

Do the derivatives on Σj

^(n) ) ∣
∂U(x, x
∂Σj ∣Σj =Σ^(n+1) ,μj =μ^(n+1)
j j
m
1 ^ (n+1)−1 ^ (n+1)−1
= − ∑ wij (Σ
(n) (n) T ^ (n+1)−1
j − Σ j (yi − ^
μ j )(yi − ^
μ j ) Σj )=0
i=1
2

m
⇒ ∑ wij (Σ
^ (n+1) − (yi − μ(n) (n)
j ^j )T ) = 0
^j )(yi − μ
i=1

Then, we have

(n) (n)
∑m
i=1 wij (yi − μ
^j )(yi − μ
^j )T
^ (n+1)
Σ =
j
∑mi=1 wij

Lecture 13. EM Algorithm (After-class) 6

Elcid Test
No ratings yet
Elcid Test
7 pages
Mixture Models and Expectation-Maximization: Justus H. Piater
No ratings yet
Mixture Models and Expectation-Maximization: Justus H. Piater
11 pages
cs229 Notes7b PDF
No ratings yet
cs229 Notes7b PDF
4 pages
The EM Algorithm: Ajit Singh November 20, 2005
No ratings yet
The EM Algorithm: Ajit Singh November 20, 2005
4 pages
Lecture 11
No ratings yet
Lecture 11
124 pages
Lecture3 EM
No ratings yet
Lecture3 EM
36 pages
کتاب ششم بارگزاری شده
No ratings yet
کتاب ششم بارگزاری شده
49 pages
Statistical Inference III: Mohammad Samsul Alam
No ratings yet
Statistical Inference III: Mohammad Samsul Alam
32 pages
Figueiredo EM Algorithm
No ratings yet
Figueiredo EM Algorithm
35 pages
EM-algorithm: California Institute of Technology 136-93 Pasadena, CA 91125 Welling@vision - Caltech.edu
No ratings yet
EM-algorithm: California Institute of Technology 136-93 Pasadena, CA 91125 Welling@vision - Caltech.edu
7 pages
8th Lecture Note - 1039837803 230515 094639
No ratings yet
8th Lecture Note - 1039837803 230515 094639
10 pages
Aiml Lab Algorithms
No ratings yet
Aiml Lab Algorithms
10 pages
EM Algo
No ratings yet
EM Algo
8 pages
GMM
No ratings yet
GMM
26 pages
11 Hidden Markov Models (HMMS) Model and Problem Description
No ratings yet
11 Hidden Markov Models (HMMS) Model and Problem Description
15 pages
Likelihood EM HMM Kalman
No ratings yet
Likelihood EM HMM Kalman
46 pages
(Slides) The em Algorithm
No ratings yet
(Slides) The em Algorithm
14 pages
A Derivation of The EM Updates For Finding The Maximum Likelihood Parameter Estimates of The Student's T Distribution
No ratings yet
A Derivation of The EM Updates For Finding The Maximum Likelihood Parameter Estimates of The Student's T Distribution
5 pages
Lec15 16 Handout
No ratings yet
Lec15 16 Handout
33 pages
Point Estimation: Definition of Estimators
No ratings yet
Point Estimation: Definition of Estimators
8 pages
TR 97 021
No ratings yet
TR 97 021
15 pages
Expectation-Maximization Algorithm
No ratings yet
Expectation-Maximization Algorithm
13 pages
Expectation Maximization
No ratings yet
Expectation Maximization
21 pages
Applied Stat
No ratings yet
Applied Stat
2 pages
ds11 2
No ratings yet
ds11 2
19 pages
Beamer
No ratings yet
Beamer
34 pages
Lec16 PDF
No ratings yet
Lec16 PDF
10 pages
Expectation Maximization Notes
No ratings yet
Expectation Maximization Notes
5 pages
An Introduction To Variational Calculus in Machine Learning
No ratings yet
An Introduction To Variational Calculus in Machine Learning
7 pages
The Problem: Library (MASS) Data (Faithful) Attach (Faithful)
No ratings yet
The Problem: Library (MASS) Data (Faithful) Attach (Faithful)
7 pages
Expectation Maximization (EM) Algorithm.pptx
No ratings yet
Expectation Maximization (EM) Algorithm.pptx
47 pages
lecture5
No ratings yet
lecture5
16 pages
ML - Expectation-Maximization Algorithm
No ratings yet
ML - Expectation-Maximization Algorithm
3 pages
Tutorial On Generalized Expectation Maximization: Javier R. Movellan
No ratings yet
Tutorial On Generalized Expectation Maximization: Javier R. Movellan
6 pages
Tutorial On Generalized Expectation
No ratings yet
Tutorial On Generalized Expectation
6 pages
Isye 6416: Computational Statistics Spring 2023: Prof. Yao Xie
No ratings yet
Isye 6416: Computational Statistics Spring 2023: Prof. Yao Xie
24 pages
sta255 Week 11-2 pre
No ratings yet
sta255 Week 11-2 pre
21 pages
Probabilistic Modelling and Reasoning
No ratings yet
Probabilistic Modelling and Reasoning
13 pages
Questions_for_Unit_4 (2)
No ratings yet
Questions_for_Unit_4 (2)
6 pages
ML-2-Expectation Maximization
No ratings yet
ML-2-Expectation Maximization
11 pages
11 Parameter Estimation
No ratings yet
11 Parameter Estimation
6 pages
GMMEMNotes
No ratings yet
GMMEMNotes
10 pages
A Modified Expectation Maximization Algorithm For Penalized Likelihood Estimation in Emission Tomorzradhv
No ratings yet
A Modified Expectation Maximization Algorithm For Penalized Likelihood Estimation in Emission Tomorzradhv
6 pages
Nummax
No ratings yet
Nummax
3 pages
AI29
No ratings yet
AI29
3 pages
EM Presentation 2013
No ratings yet
EM Presentation 2013
18 pages
An Alternative View of EM_poornima
No ratings yet
An Alternative View of EM_poornima
4 pages
ML and MAP - HTML
No ratings yet
ML and MAP - HTML
9 pages
Filt Ident Lecturenotes
No ratings yet
Filt Ident Lecturenotes
12 pages
Expectation Maximization: Dekang Lin Department of Computing Science University of Alberta
No ratings yet
Expectation Maximization: Dekang Lin Department of Computing Science University of Alberta
22 pages
Variational Problems in Machine Learning and Their Solution With Finite Elements
No ratings yet
Variational Problems in Machine Learning and Their Solution With Finite Elements
11 pages
HW2
No ratings yet
HW2
4 pages
Dis10 Sol PDF
No ratings yet
Dis10 Sol PDF
6 pages
The Expectation Maximization Algorithm
No ratings yet
The Expectation Maximization Algorithm
7 pages
The Kullback-Liebler Distance and Entropy
No ratings yet
The Kullback-Liebler Distance and Entropy
5 pages
Applied HW4 Solutions
No ratings yet
Applied HW4 Solutions
14 pages
Stochastic Calculus Midterm Exam Solutions
No ratings yet
Stochastic Calculus Midterm Exam Solutions
6 pages
PROBABILISTIC Learning Jb-new
No ratings yet
PROBABILISTIC Learning Jb-new
13 pages
Chapter 9.4 Allele Frequency Estimation
No ratings yet
Chapter 9.4 Allele Frequency Estimation
24 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
English Practice
No ratings yet
English Practice
4 pages
Provably Powerful Graph Networks: Haggai Maron Heli Ben-Hamu Hadar Serviansky Yaron Lipman
No ratings yet
Provably Powerful Graph Networks: Haggai Maron Heli Ben-Hamu Hadar Serviansky Yaron Lipman
15 pages
10 KG
No ratings yet
10 KG
63 pages
Functional Protein Design With Local Domain Alignment
No ratings yet
Functional Protein Design With Local Domain Alignment
20 pages
Aligning Transformers With Weisfeiler-Leman: K K K K
No ratings yet
Aligning Transformers With Weisfeiler-Leman: K K K K
51 pages
R E P GNN G B: Ethinking The Xpressive Ower of S Via Raph Iconnectivity
No ratings yet
R E P GNN G B: Ethinking The Xpressive Ower of S Via Raph Iconnectivity
60 pages
On The Connection Between MPNN and Graph Transformer
No ratings yet
On The Connection Between MPNN and Graph Transformer
23 pages
ดอกเบี้ยและมูลค่าของเงิน
No ratings yet
ดอกเบี้ยและมูลค่าของเงิน
29 pages
Affidavit For Common Law Name Change
No ratings yet
Affidavit For Common Law Name Change
1 page
He Who Has Never Learned To Obey Cannot Be A Good
No ratings yet
He Who Has Never Learned To Obey Cannot Be A Good
4 pages
Ebisa Airline Marketing 20 Last
No ratings yet
Ebisa Airline Marketing 20 Last
19 pages
Description Features: ENC6800+ENS6800+
No ratings yet
Description Features: ENC6800+ENS6800+
2 pages
Performance-Related and Skill-Based Pay: An Introduction: ACT/EMP/17
No ratings yet
Performance-Related and Skill-Based Pay: An Introduction: ACT/EMP/17
44 pages
Ta25du2.4 Ta25du3.1 Ta25du32 Ta25du11 Ta25du14 Ta25du4.0 Ta25du1.8 Ta25du25
No ratings yet
Ta25du2.4 Ta25du3.1 Ta25du32 Ta25du11 Ta25du14 Ta25du4.0 Ta25du1.8 Ta25du25
8 pages
(Ebooks PDF) Download (Ebook PDF) Understanding Patent Law, Third Edition 3rd Edition Full Chapters
100% (3)
(Ebooks PDF) Download (Ebook PDF) Understanding Patent Law, Third Edition 3rd Edition Full Chapters
51 pages
Announcing SAP API Business Hub Next Gen Experience General Availability
No ratings yet
Announcing SAP API Business Hub Next Gen Experience General Availability
8 pages
Ion Exchange Demineralizers: Big Problems, Small Solutions
No ratings yet
Ion Exchange Demineralizers: Big Problems, Small Solutions
10 pages
Daikin Altherma
No ratings yet
Daikin Altherma
48 pages
List of Questions for Interview
No ratings yet
List of Questions for Interview
2 pages
Make A Copy and Edit in Google Slides. Download An Offline Copy and Edit in Microsoft Powerpoint
No ratings yet
Make A Copy and Edit in Google Slides. Download An Offline Copy and Edit in Microsoft Powerpoint
43 pages
Cloud Interview Guide_v4
No ratings yet
Cloud Interview Guide_v4
35 pages
ELYM 115 2nd Opp Examination MC-2022-09-21 TT
No ratings yet
ELYM 115 2nd Opp Examination MC-2022-09-21 TT
3 pages
Financial Management Theory and Practice 14th Edition Brigham Test Bank download pdf
100% (15)
Financial Management Theory and Practice 14th Edition Brigham Test Bank download pdf
66 pages
BS4 BOPs
No ratings yet
BS4 BOPs
37 pages
How To Get An Appointment With Anyone in 3 Simple Steps
No ratings yet
How To Get An Appointment With Anyone in 3 Simple Steps
9 pages
Recovery of Scattered and Precious Metals From Copper Anode Slime by Hydrometallurgy - A Review
No ratings yet
Recovery of Scattered and Precious Metals From Copper Anode Slime by Hydrometallurgy - A Review
17 pages
Lingua Inglesa Cespe
No ratings yet
Lingua Inglesa Cespe
128 pages
Service Marketing
No ratings yet
Service Marketing
2 pages
Tsion Solomon
No ratings yet
Tsion Solomon
77 pages
Eced 9 Module 2
No ratings yet
Eced 9 Module 2
11 pages
Name: Roll No: Sub:: AIM Geometric Modelling
No ratings yet
Name: Roll No: Sub:: AIM Geometric Modelling
9 pages
Applied Statistics Worksheet 5.4.21
No ratings yet
Applied Statistics Worksheet 5.4.21
2 pages
CV
100% (1)
CV
3 pages
Statistics L1 (Worksheet)
No ratings yet
Statistics L1 (Worksheet)
16 pages
Materials Laboratory Manual
No ratings yet
Materials Laboratory Manual
120 pages
Astm A285 A285m (2001)
No ratings yet
Astm A285 A285m (2001)
4 pages