Em Algorithm

The document describes using the Expectation-Maximization (EM) algorithm to fit a mixture of two normal distributions to simulated data where some data points were drawn from N(1,1) and others from N(7,1). The EM algorithm iteratively estimates the latent class assignments (E-step) and distribution parameters (M-step) until convergence. It demonstrates the EM algorithm converging to the correct parameters over iterations on two examples of simulated data.

Uploaded by

api-285777244

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3K views

Em Algorithm

Uploaded by

api-285777244

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

EM Algorithm

YIK LUN, KEI

set.seed(123)
tau_1_true <- 0.25
x <- y <- rep(0,1000)
for( i in 1:1000 ) {
if( runif(1) < tau_1_true ) {
x[i] <- rnorm(1, mean=1);y[i] <- "heads"
} else {
x[i] <- rnorm(1, mean=7);y[i] <- "tails"
}
}
library(lattice)
densityplot( ~x, par.settings = list(plot.symbol = list(col=as.factor(y))))

0.25

Density

0.20

0.15

0.10

0.05

0.00
0

x
##initial guesses for the distribution parameters
mu_1 <- 0
mu_2 <- 1
##latent variable parameters
tau_1 <- 0.5
tau_2 <- 0.5

for( i in 1:10 ) {
## Given the observed data and distribution parameters, what are the latent variables?
T_1 <- tau_1 * dnorm( x, mu_1 )
T_2 <- tau_2 * dnorm( x, mu_2 )
P_1 <- T_1 / (T_1 + T_2)
P_2 <- T_2 / (T_1 + T_2) ## note: P_2 = 1 - P_1
tau_1 <- mean(P_1)
tau_2 <- mean(P_2)
## Given the observed data, as well as the latent variables, what are the population parameters?
mu_1 <- sum( P_1 * x ) / sum(P_1)
mu_2 <- sum( P_2 * x ) / sum(P_2)
print( c(mu_1, mu_2, mean(P_1)) )
}
##
##
##
##
##
##
##
##
##
##

[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]

0.5045618
0.8546336
0.9732251
0.9853947
0.9864849
0.9865811
0.9865895
0.9865903
0.9865903
0.9865904

6.1011529
6.9403680
7.0006108
7.0054109
7.0058260
7.0058624
7.0058656
7.0058659
7.0058660
7.0058660

0.1002794
0.2301181
0.2423406
0.2434347
0.2435309
0.2435394
0.2435401
0.2435402
0.2435402
0.2435402

set.seed(123)
tau_true <- 0.25
x <- y <- rep(0,1000)
for( i in 1:1000 ) {
if( runif(1) < tau_true ) {
x[i] <- rnorm(1, mean=1);y[i] <- "heads"
} else {
x[i] <- rnorm(1, mean=4);y[i] <- "tails"
}
}
densityplot( ~x, par.settings = list( plot.symbol=list( col=as.factor(y) ) ) )

0.25

Density

0.20

0.15

0.10

0.05

0.00
2

x
mu_1 <- 0
mu_2 <- 1
tau_1 <- 0.5
tau_2 <- 0.5
for( i in 1:30 ) {
## Given the observed data and the distribution parameters, what are the latent variables?
T_1 <- tau_1 * dnorm( x, mu_1 )
T_2 <- tau_2 * dnorm( x, mu_2 )
P_1 <- T_1 / (T_1 + T_2)
P_2 <- T_2 / (T_1 + T_2) ## note: P_2 = 1 - P_1
tau_1 <- mean(P_1)
tau_2 <- mean(P_2)
## Given the observed data and the latent variables, what are the population parameters?
mu_1 <- sum( P_1 * x ) / sum(P_1)
mu_2 <- sum( P_2 * x ) / sum(P_2)
print( c(mu_1, mu_2, mean(P_1)) )

}
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##

[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]

1.0835357
0.6797230
0.7320122
0.7910984
0.8298998
0.8545108
0.8701122
0.8800221
0.8863270
0.8903429
0.8929026
0.8945350
0.8955764
0.8962408
0.8966648
0.8969354
0.8971081
0.8972184
0.8972887
0.8973336
0.8973623
0.8973806
0.8973922
0.8973997
0.8974045
0.8974075
0.8974094
0.8974107
0.8974115
0.8974120

3.6048714
3.8663167
3.9306341
3.9574819
3.9730967
3.9827182
3.9887344
3.9925240
3.9949222
3.9964445
3.9974127
3.9980293
3.9984223
3.9986729
3.9988327
3.9989347
3.9989998
3.9990414
3.9990679
3.9990848
3.9990956
3.9991025
3.9991069
3.9991097
3.9991115
3.9991126
3.9991134
3.9991138
3.9991141
3.9991143

0.1320495
0.1865272
0.2059336
0.2165093
0.2230743
0.2272189
0.2298464
0.2315159
0.2325783
0.2332551
0.2336866
0.2339618
0.2341373
0.2342493
0.2343208
0.2343664
0.2343955
0.2344141
0.2344260
0.2344335
0.2344384
0.2344414
0.2344434
0.2344447
0.2344455
0.2344460
0.2344463
0.2344465
0.2344466
0.2344467

myEM <- normalmixEM( x, mu = c(0,1), sigma=c(1,1), sd.constr=c(1,1) )

## number of iterations= 21
myEM$mu ## the means of the two distributions

## [1] 0.8974058 3.9991120

myEM$lambda ## the mixing probabilities

## [1] 0.2344461 0.7655539

L31 Bayesian Logistic Regression PDF
No ratings yet
L31 Bayesian Logistic Regression PDF
8 pages
AR Model Session2 Output: Install - Packages ("Forecast")
No ratings yet
AR Model Session2 Output: Install - Packages ("Forecast")
30 pages
Support Vector Machine With Multiple Classes
100% (1)
Support Vector Machine With Multiple Classes
5 pages
samp_doc
No ratings yet
samp_doc
4 pages
Chapter 5
No ratings yet
Chapter 5
22 pages
Survival Analysis Practical
No ratings yet
Survival Analysis Practical
22 pages
Reliability Theory and Survival Analysis Final
No ratings yet
Reliability Theory and Survival Analysis Final
12 pages
Chap 35
No ratings yet
Chap 35
62 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
21bce0427 VL2022230503921 Ast02
No ratings yet
21bce0427 VL2022230503921 Ast02
13 pages
Em Algo For Multivariate GMM
No ratings yet
Em Algo For Multivariate GMM
9 pages
Matlab Code: 2.2 Exercises With Matlab 2.2.1 Standard Normal Distribution
No ratings yet
Matlab Code: 2.2 Exercises With Matlab 2.2.1 Standard Normal Distribution
12 pages
All All: % (A) Construct Side-By-Side Stem-And-Leaf Plots
No ratings yet
All All: % (A) Construct Side-By-Side Stem-And-Leaf Plots
34 pages
418 Material
No ratings yet
418 Material
16 pages
Assignment_01
No ratings yet
Assignment_01
6 pages
212011497-4SE5-Kautsar Hilmi Izzuddin Pertemuan 5
No ratings yet
212011497-4SE5-Kautsar Hilmi Izzuddin Pertemuan 5
13 pages
The Xtable Gallery: With Small Contributions From Others November 6, 2009
No ratings yet
The Xtable Gallery: With Small Contributions From Others November 6, 2009
19 pages
Week 2-A.Guess The Distribution
No ratings yet
Week 2-A.Guess The Distribution
10 pages
Lab-6
No ratings yet
Lab-6
3 pages
HW1 Econ
No ratings yet
HW1 Econ
8 pages
Problem Set 6 Solution Numerical Methods
No ratings yet
Problem Set 6 Solution Numerical Methods
11 pages
Exercise 3 Computer Intensive Statistics
No ratings yet
Exercise 3 Computer Intensive Statistics
10 pages
Sae P5 Kautsar
No ratings yet
Sae P5 Kautsar
13 pages
Latent Variables
No ratings yet
Latent Variables
20 pages
A028 GLM-SC3
No ratings yet
A028 GLM-SC3
137 pages
r-cheatsheet-ABC (1)
No ratings yet
r-cheatsheet-ABC (1)
3 pages
EM Algorithm: Shu-Ching Chang Hyung Jin Kim December 9, 2007
No ratings yet
EM Algorithm: Shu-Ching Chang Hyung Jin Kim December 9, 2007
10 pages
soruma-SECOND-ASSEsiment l reg
No ratings yet
soruma-SECOND-ASSEsiment l reg
33 pages
Kautsar Hilmi Izzuddin - Tugas SAE P5
No ratings yet
Kautsar Hilmi Izzuddin - Tugas SAE P5
13 pages
Evermann Slides PDF
No ratings yet
Evermann Slides PDF
364 pages
r-cheatsheet-ABCD (1)
No ratings yet
r-cheatsheet-ABCD (1)
3 pages
"C://mvnprob - Dat1" "C://mvnprob - Dat1": Inf Inf
No ratings yet
"C://mvnprob - Dat1" "C://mvnprob - Dat1": Inf Inf
2 pages
MIT 402 CAT 2 S
No ratings yet
MIT 402 CAT 2 S
8 pages
Soruma SECOND ASSEsiment Final l Reg
No ratings yet
Soruma SECOND ASSEsiment Final l Reg
34 pages
Linear Latent Variable Models in R: Odel Building ON Linear Constraints
No ratings yet
Linear Latent Variable Models in R: Odel Building ON Linear Constraints
2 pages
Ts Dyn
No ratings yet
Ts Dyn
35 pages
Confidence interval and credintial interval
No ratings yet
Confidence interval and credintial interval
15 pages
Loading Required Package: Timedate Loading Required Package: Timeseries
No ratings yet
Loading Required Package: Timedate Loading Required Package: Timeseries
4 pages
CourseKata r Cheatsheet ABC (1)
No ratings yet
CourseKata r Cheatsheet ABC (1)
5 pages
ESTIMASS
No ratings yet
ESTIMASS
5 pages
Simulating Multivariate Structures
No ratings yet
Simulating Multivariate Structures
3 pages
Fineng 508 hw1
No ratings yet
Fineng 508 hw1
7 pages
Problem Set 1 Solution Numerical Methods
No ratings yet
Problem Set 1 Solution Numerical Methods
32 pages
Assignment-1 80501
No ratings yet
Assignment-1 80501
6 pages
MAPLE Practice Problems
No ratings yet
MAPLE Practice Problems
11 pages
R Course
No ratings yet
R Course
7 pages
r-cheatsheet-ABCD
No ratings yet
r-cheatsheet-ABCD
3 pages
Latent 2
No ratings yet
Latent 2
4 pages
Econometrics 2019 PDF
No ratings yet
Econometrics 2019 PDF
143 pages
Multivariate Assign
No ratings yet
Multivariate Assign
11 pages
R Examples
No ratings yet
R Examples
56 pages
HW9
No ratings yet
HW9
6 pages
WEEK
No ratings yet
WEEK
17 pages
Topic 2 Applications
No ratings yet
Topic 2 Applications
4 pages
Yaikob Second Assesiment Final
No ratings yet
Yaikob Second Assesiment Final
33 pages
Statistical Models in S
No ratings yet
Statistical Models in S
115 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Core Concepts in Real Analysis
From Everand
Core Concepts in Real Analysis
Roshan Trivedi
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Stats101a Homework8
No ratings yet
Stats101a Homework8
7 pages
HW 3
No ratings yet
HW 3
10 pages
HW 2
No ratings yet
HW 2
8 pages
SQL Statement
No ratings yet
SQL Statement
1 page
HW 4
No ratings yet
HW 4
12 pages
Anova Review
100% (1)
Anova Review
8 pages
Clustering
No ratings yet
Clustering
8 pages
HW 2
No ratings yet
HW 2
13 pages
Coordinate Descent and Golden Selection Search
No ratings yet
Coordinate Descent and Golden Selection Search
2 pages
Generalized Additive Model
No ratings yet
Generalized Additive Model
10 pages
Non-Stationary Models
No ratings yet
Non-Stationary Models
13 pages
Monte Carlo Integration
No ratings yet
Monte Carlo Integration
3 pages
Support Vector Classification
No ratings yet
Support Vector Classification
8 pages
Point of Tangency
No ratings yet
Point of Tangency
5 pages
Harmonic Seasonal Models
No ratings yet
Harmonic Seasonal Models
10 pages
Adjusting Betas
No ratings yet
Adjusting Betas
2 pages
PCR and Pls Regression
No ratings yet
PCR and Pls Regression
5 pages
Variable Selection
No ratings yet
Variable Selection
15 pages
Gradient Steepest Descent
No ratings yet
Gradient Steepest Descent
7 pages
Polynomial Regression and Step Function
100% (1)
Polynomial Regression and Step Function
6 pages
Random Forests
No ratings yet
Random Forests
10 pages
Multi-Group Model
No ratings yet
Multi-Group Model
2 pages
Cross-Validation and The Bootstrap
No ratings yet
Cross-Validation and The Bootstrap
5 pages
Regression Splines
No ratings yet
Regression Splines
4 pages
Ridge Regression and The Lasso
No ratings yet
Ridge Regression and The Lasso
7 pages
Stockportfolio
No ratings yet
Stockportfolio
9 pages
Constant Correlation Model
No ratings yet
Constant Correlation Model
3 pages
Single Index Model
No ratings yet
Single Index Model
4 pages