0% found this document useful (0 votes)

9 views

Tutorial em

Uploaded by

minda14650

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Tutorial em

Uploaded by

minda14650

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 57

Gaussian Mixture Models

and EM algorithm
Radek Danecek
Gaussian Mixture Model
• Unsupervised method
• Fit multimodal Gaussian distributions
Formal Definition
• The model is described as:

• The parameters of the model are:

• The training data is unlabeled – unsupervised setting

• Why not fit with MLE?

Optimization problem
• Model:

• Apply MLE:
• Maximize:

• Difficult, non convex optimization with constraints

• Use EM algorithm instead

EM Algorithm for GMMs
• Idea:
• Objective function:
• Split optimization of the objective into to parts

• Algorithm:
• Initialize model parameters (randomly):
• Iterate until convergence:
• E-step
• Assign cluster probabilities (“soft labels”) to each sample

• M-step
• Solve the MLE using the soft labels
Initialization
• Initialize model parameters (randomly)

• Uniform for cluster probabilities

• Centers
• Random
• K-means heuristics

• Covariances:
• Spherical, according to empirical variance
E-step
K
• For each data point and each
cluster , compute the probability
that belongs to Probabilities of point n
belonging to clusters 1…K
(given current model parameters) (sum up to 1)
N

Sum determines the

“weight” of cluster k

“soft labels”
E-step
• For each data point and each
cluster , compute the probability
that belongs to
(given current model parameters)

“soft labels”
M-step
• Now we have “soft labels” for the data -> fall back to supervised MLE

• Optimize the log likelihood:

• Instead of the original (difficult objective):
We optimize the following:

• Differentiate w.r.t.
M-step
K
• Update model parameters:

Probabilities of point n
belonging to clusters 1…K
(sum up to 1)
• Update prior for each cluster: N

Sum determines the

“weight” of cluster k
Normalized column-wise sum
are priors for clusters 1…K
M-step
• Update model parameters:

• Update mean and covariance of

each cluster
M-step
• Update model parameters:

• Update mean and covariance of

each cluster
EM Algorithm for GMMs
• Idea:
• Objective function:
• Split optimization of the objective into to parts
• Algorithm:
• Initialize model parameters (randomly):
• Iterate until convergence:
• E-step
• Assign cluster probabilities (“soft labels”) to each sample
• M-step
• Find optimal parameters given the soft labels
Overlapping clusters
Overlapping clusters
Unequal cluster size
Imbalanced cluster size
Sensitivity to Initialization
Sensitivity to Initialization
Sensitivity to Initialization
Sensitivity to Initialization
Sensitivity to Initialization
Sensitivity to initialization
Degenerate covariance
• The determinant of the covariance matrix tends to 0
Practical Example – Color segmentation
• Input: an image

• Can be thought of as a dataset of 3D (color) samples

• Run 3D GMM clustering over

Practical Example – Color segmentation

Input Image
Practical Example – Color segmentation

3 clusters
Practical Example – Color segmentation

4 clusters
Practical Example – Color segmentation

5 clusters
Practical Example – Color segmentation

7 clusters
Practical Example – Color segmentation

8 clusters
Practical Example – Color segmentation

9 clusters
Practical Example – Color segmentation

10 clusters
Practical Example – Color segmentation

15 clusters
Practical Example – Color segmentation

20 clusters
Practical Example – Color segmentation

Input Image
EM Algorithm for GMMs
• Idea:
• Objective function:
• Split optimization of the objective into to parts
• Algorithm:
• Initialize model parameters (randomly):
• Iterate until convergence:
• E-step
• Assign cluster probabilities (“soft labels”) to each sample
• M-step
• Find optimal parameters given the soft labels
Generalized EM
• Idea:
• Objective function:
• Split optimization of the objective into to parts
• Algorithm:
• Initialize model parameters (randomly):
• Iterate until convergence:
• E-step
• Assign cluster probabilities (“soft labels”) to each sample
• M-step
• Find optimal parameters given the soft labels
Generalized EM
• Idea:
• Objective function:
• Split optimization of the objective into to parts
• Algorithm:
• Initialize model parameters (randomly):
• Iterate until convergence:
• E-step
• Assign cluster probabilities (“soft labels”) to each sample
• M-step
• Find optimal parameters given the soft labels
Generalized EM
• Idea:
• Objective function:
• Split optimization of the objective into to parts
• Algorithm:
• Initialize model parameters (randomly):
• Iterate until convergence:
• E-step
• Assign cluster probabilities (“soft labels”) to each sample
• M-step
• Find optimal parameters given the soft labels
Generalized EM
• Idea:
• Objective function:
• Split optimization of the objective into to parts
• Algorithm:
• Initialize model parameters (randomly):
• Iterate until convergence:
• E-step
• Assign cluster probabilities (“soft labels”) to each sample
• M-step
• Find optimal parameters given the soft labels
Generalized M-step
• What is the objective function?

• GMM:

• General:
Exercise
• Consider a mixture of K multivariate Bernoulli distributions with
parameters , where

• Multivariate Bernoulli distribution:

• Question 1: Write down the equation for the E-step update

hint GMM: Answer:

Exercise
• Consider a mixture of K multivariate Bernoulli distributions with
parameters , where

• Multivariate Bernoulli distribution:

• Question 1: Write down the equation for the E-step update

hint GMM: Answer:

Exercise
• Consider a mixture of K multivariate Bernoulli distributions with
parameters , where

• Multivariate Bernoulli distribution:

• Question 2: Write down the EM objective:

Exercise
• Consider a mixture of K multivariate Bernoulli distributions with
parameters , where

• Multivariate Bernoulli distribution:

• Question 2: Write down the EM objective:

Exercise
• Multivariate Bernoulli distribution:

• EM objective:
Exercise
• Multivariate Bernoulli distribution:

• EM objective:
Exercise
• Question 3: Write down the M-step update

• Differentiate wrt.:
Exercise
• Question 3: Write down the M-step update

• Differentiate wrt.:
Summary
• EM algorithm is useful for fitting GMMs (or other mixtures) in an
unsupervised setting

• Can be used for:

• Clustering
• Classification
• Distribution estimation
• Outlier detection
Other unsupervised clustering techniques

Source: https://scikit-learn.org/stable/modules/clustering.html
Alternative for density estimation
• Kernel density estimation

Source: https://scikit-learn.org/stable/auto_examples/neighbors/plot_species_kde.html
References
• Lecture slides/videos
• https://www.python-
course.eu/expectation_maximization_and_gaussian_mixture_models.
php
• https://scikit-learn.org/stable/modules/clustering.html

Solid Starts - First 100 Days
94% (18)
Solid Starts - First 100 Days
287 pages
Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
89% (45)
12 Week Program: Summer Body Starts Now
70 pages
The Hold Me Tight Workbook - Dr. Sue Johnson
100% (16)
The Hold Me Tight Workbook - Dr. Sue Johnson
187 pages
Read People Like A Book by Patrick King-Edited
62% (66)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Cheat Code To The Universe
94% (77)
Cheat Code To The Universe
34 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
COSMIC CONSCIOUSNESS OF HUMANITY - PROBLEMS OF NEW COSMOGONY (V.P.Kaznacheev,. Л. V. Trofimov.)
94% (212)
COSMIC CONSCIOUSNESS OF HUMANITY - PROBLEMS OF NEW COSMOGONY (V.P.Kaznacheev,. Л. V. Trofimov.)
212 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (541)
How To Develop and Write A Grant Proposal
17 pages
Workbook For The Body Keeps The Score
88% (52)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (28)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
75% (12)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
36 Questions To Fall in Love 1
97% (31)
36 Questions To Fall in Love 1
2 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
80% (35)
100 Questions To Ask Your Partner
2 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
ALCHEMIST
64% (14)
ALCHEMIST
4 pages
1001 Songs
71% (69)
1001 Songs
1,798 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Time Dilation
No ratings yet
Time Dilation
3 pages
Lecture7_KMeans
No ratings yet
Lecture7_KMeans
30 pages
lecture_06
No ratings yet
lecture_06
51 pages
CB PDF
No ratings yet
CB PDF
69 pages
Clustering Mixture
No ratings yet
Clustering Mixture
22 pages
6.2 K Means
No ratings yet
6.2 K Means
23 pages
Clustering (Unit 3)
100% (2)
Clustering (Unit 3)
71 pages
Ensembles of Classifiers: Evgueni Smirnov
No ratings yet
Ensembles of Classifiers: Evgueni Smirnov
43 pages
CLUSTERING CLASSIFICATION AND INTRO NEURAL NETWORK
No ratings yet
CLUSTERING CLASSIFICATION AND INTRO NEURAL NETWORK
168 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
Object Recognition
No ratings yet
Object Recognition
43 pages
Intro
No ratings yet
Intro
38 pages
Mlfa Autumn 22 Lec 02
No ratings yet
Mlfa Autumn 22 Lec 02
24 pages
FEM Ex 7 Transient Analysis
No ratings yet
FEM Ex 7 Transient Analysis
31 pages
APznzab0G8iLD5cDfn798Gn-fXshRpam8ullbf6ZS5Hd4l0BEcKNHy9gDG24DS66RfgvnKXAQjMAivMmmi5cmDWF9tqOaPMy3afuzafCU1kpG1xfQIr7b98q406ZWiqt50nL8WhMI6azoYzWSgf7c7khnqww3VlQ9I90ROmc0QL4DbmipYYoLleGYR6TO4UYmc_PsaQB5v0XmLUwPEub3QuwGdUnUEr2dp_hV4bds0MuRbpJ
No ratings yet
APznzab0G8iLD5cDfn798Gn-fXshRpam8ullbf6ZS5Hd4l0BEcKNHy9gDG24DS66RfgvnKXAQjMAivMmmi5cmDWF9tqOaPMy3afuzafCU1kpG1xfQIr7b98q406ZWiqt50nL8WhMI6azoYzWSgf7c7khnqww3VlQ9I90ROmc0QL4DbmipYYoLleGYR6TO4UYmc_PsaQB5v0XmLUwPEub3QuwGdUnUEr2dp_hV4bds0MuRbpJ
34 pages
Lecture13 - ML Linear & Log-Linear Models
No ratings yet
Lecture13 - ML Linear & Log-Linear Models
34 pages
Clustering-Part1.pptx
No ratings yet
Clustering-Part1.pptx
84 pages
Supervised Learning 1 PDF
100% (1)
Supervised Learning 1 PDF
162 pages
ML Notes
No ratings yet
ML Notes
79 pages
Class Adv Classification V
No ratings yet
Class Adv Classification V
50 pages
K-Means Clustering: Sargur Srihari Srihari@cedar - Buffalo.edu
No ratings yet
K-Means Clustering: Sargur Srihari Srihari@cedar - Buffalo.edu
20 pages
1731009606_Clustering_(Class_38-39)
No ratings yet
1731009606_Clustering_(Class_38-39)
45 pages
k Means Clustering
No ratings yet
k Means Clustering
43 pages
Ensemble Classifiers
No ratings yet
Ensemble Classifiers
37 pages
2021 Clustering
No ratings yet
2021 Clustering
50 pages
SLRG
No ratings yet
SLRG
13 pages
MECH4403 NN Week05
No ratings yet
MECH4403 NN Week05
22 pages
Data Driven Markov Chain Monte Carlo
No ratings yet
Data Driven Markov Chain Monte Carlo
40 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
32 pages
Lab 3
No ratings yet
Lab 3
3 pages
MLCH9
No ratings yet
MLCH9
45 pages
Lecture 11
No ratings yet
Lecture 11
55 pages
Feature Selection For SVMS: by J. Weston, S. Mukherjee, O. Chapelle, M. Pontil, T. Poggio, V. Vapnik
No ratings yet
Feature Selection For SVMS: by J. Weston, S. Mukherjee, O. Chapelle, M. Pontil, T. Poggio, V. Vapnik
19 pages
Feature Extraction Techniques
No ratings yet
Feature Extraction Techniques
32 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Neural Network: Prof. Subodh Kumar Mohanty
No ratings yet
Neural Network: Prof. Subodh Kumar Mohanty
37 pages
Lecture7 Computer Aided - Latest
No ratings yet
Lecture7 Computer Aided - Latest
15 pages
ML 8
No ratings yet
ML 8
31 pages
KNN
No ratings yet
KNN
20 pages
Segmentation
No ratings yet
Segmentation
26 pages
Machine Learning in Embedded System
No ratings yet
Machine Learning in Embedded System
56 pages
Mlfa Autumn 22 Lec 04
No ratings yet
Mlfa Autumn 22 Lec 04
24 pages
Greedy Method: General Method, Coin Change Problem Knapsack Problem Job Sequencing With Minimum Spanning Tree
No ratings yet
Greedy Method: General Method, Coin Change Problem Knapsack Problem Job Sequencing With Minimum Spanning Tree
55 pages
Unit-5
No ratings yet
Unit-5
52 pages
Data analysis ch1
No ratings yet
Data analysis ch1
13 pages
EM and Kmeans relations
No ratings yet
EM and Kmeans relations
70 pages
Unit 3
No ratings yet
Unit 3
110 pages
14.Chapter10_AdvancedDeepLearningForText
No ratings yet
14.Chapter10_AdvancedDeepLearningForText
22 pages
UNIT 4 - Perceptron and DL
No ratings yet
UNIT 4 - Perceptron and DL
39 pages
Lec 3-5 (Function Approximation)
No ratings yet
Lec 3-5 (Function Approximation)
34 pages
DSS08 - CLS-ANN, SVM, Ensemble-Vn
No ratings yet
DSS08 - CLS-ANN, SVM, Ensemble-Vn
44 pages
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
No ratings yet
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
20 pages
MLA TAB Lecture3
No ratings yet
MLA TAB Lecture3
70 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
Effective Applications of Learning: Speech Recognition
No ratings yet
Effective Applications of Learning: Speech Recognition
52 pages
ML QB Solutionss
No ratings yet
ML QB Solutionss
16 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
003-FIN7790 (Part2)
No ratings yet
003-FIN7790 (Part2)
162 pages
Lecture 2.3.7-2.3.9
No ratings yet
Lecture 2.3.7-2.3.9
53 pages
ml5
No ratings yet
ml5
35 pages
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
A Level Maths Paper 3 Exponential and Logarithmic Quiz
No ratings yet
A Level Maths Paper 3 Exponential and Logarithmic Quiz
4 pages
ML 1 2 3 4
No ratings yet
ML 1 2 3 4
13 pages
Tutorial 6
No ratings yet
Tutorial 6
12 pages
(Ebook) Econometric Analysis of Panel Data (Springer Texts in Business and Economics) by Badi H. Baltagi ISBN 9783030539528, 9783030539535, 3030539520, 3030539539 - Own the ebook now with all fully detailed content
100% (2)
(Ebook) Econometric Analysis of Panel Data (Springer Texts in Business and Economics) by Badi H. Baltagi ISBN 9783030539528, 9783030539535, 3030539520, 3030539539 - Own the ebook now with all fully detailed content
69 pages
B. Vucetic Et Al., Turbo Codes © Springer Science+Business Media New York 2000
No ratings yet
B. Vucetic Et Al., Turbo Codes © Springer Science+Business Media New York 2000
2 pages
Implementation of V Um at
No ratings yet
Implementation of V Um at
24 pages
Gradient Descent and Cost Function
No ratings yet
Gradient Descent and Cost Function
14 pages
P, Pi & PID Controller: By:-Karan Sati
No ratings yet
P, Pi & PID Controller: By:-Karan Sati
16 pages
Homework 1: EE 737 Spring 2019-20 Assigned: 30 Jan Due: Beginning of Class, 06 Feb
100% (1)
Homework 1: EE 737 Spring 2019-20 Assigned: 30 Jan Due: Beginning of Class, 06 Feb
2 pages
Graph Notes by Kapil Yadav
No ratings yet
Graph Notes by Kapil Yadav
17 pages
(Chatterjee, 2011) Interactions of Self-Organizing Systems in Nature
No ratings yet
(Chatterjee, 2011) Interactions of Self-Organizing Systems in Nature
3 pages
The Notorious Collatz Conjecture: Terence Tao, UCLA
100% (1)
The Notorious Collatz Conjecture: Terence Tao, UCLA
63 pages
A1 Time-Frequency Analysis: David Murray
No ratings yet
A1 Time-Frequency Analysis: David Murray
37 pages
Nist Fips 205
No ratings yet
Nist Fips 205
61 pages
Technology Mapping
No ratings yet
Technology Mapping
35 pages
An Optimized K Means Clustering For Improving Accuracy in Traffic Classification
No ratings yet
An Optimized K Means Clustering For Improving Accuracy in Traffic Classification
13 pages
Empirical Models PDF
No ratings yet
Empirical Models PDF
11 pages
Generative Ai
No ratings yet
Generative Ai
13 pages
Digital Tutotrials
No ratings yet
Digital Tutotrials
29 pages
Serial Matrices
No ratings yet
Serial Matrices
11 pages
Chapter 8 Sampling Distributions
No ratings yet
Chapter 8 Sampling Distributions
19 pages
Prerna and Sharma 2024
No ratings yet
Prerna and Sharma 2024
18 pages
CH14
No ratings yet
CH14
32 pages
PROJECT
No ratings yet
PROJECT
71 pages
Shuvajit Paul (Es-Cs301)
No ratings yet
Shuvajit Paul (Es-Cs301)
4 pages
Quiz 3
No ratings yet
Quiz 3
5 pages
Meshref 5 DR - Hossam PredictingLoanApproval CMM2020 Dec2020
No ratings yet
Meshref 5 DR - Hossam PredictingLoanApproval CMM2020 Dec2020
10 pages
Ensemble Methods
No ratings yet
Ensemble Methods
12 pages
A New Delayless Subband Adaptive Filtering
No ratings yet
A New Delayless Subband Adaptive Filtering
12 pages