0% found this document useful (0 votes)

67 views

MA5232 Modeling and Numerical Simulations: Iterative Methods For Mixture-Model Segmentation 8 Apr 2015

This document summarizes a lecture on iterative methods for mixture-model segmentation and subspace clustering. It reviews K-means clustering and the Expectation-Maximization (EM) algorithm for central clustering. It then discusses modeling data with a mixture of subspaces and formulations of the K-subspaces and EM algorithms for subspace segmentation. The EM algorithm estimates subspace membership probabilities and model parameters iteratively through E and M steps. Key differences between K-subspaces and EM are noted, and homework is assigned on exercises from the lecture handout.

Uploaded by

navneeth91

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views

MA5232 Modeling and Numerical Simulations: Iterative Methods For Mixture-Model Segmentation 8 Apr 2015

Uploaded by

navneeth91

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

4/7/2015

MA5232 Modeling and Numerical

Simulations
Lecture 2
Iterative Methods for Mixture-Model
Segmentation
8 Apr 2015
National University of Singapore

Last time
PCA reduces dimensionality of a data set while
retaining as much as possible the data variation.
Statistical view: The leading PCs are given by the
leading eigenvectors of the covariance.
Geometric view: Fitting a d-dim subspace model via
SVD

Extensions of PCA
Probabilistic PCA via MLE
Kernel PCA via kernel functions and kernel matrices
National University of Singapore

4/7/2015

This lecture
Review basic iterative algorithms for central
clustering
Formulation of the subspace segmentation
problem

National University of Singapore

Segmentation by Clustering

From: Object Recognition as Machine Translation, Duygulu, Barnard, de Freitas, Forsyth, ECCV02

4/7/2015

Example 4.1
Euclidean distance-based clustering is not
invariant to linear transformation

Distance metric needs to be adjusted after

linear transformation

National University of Singapore

Central Clustering
Assume data sampled from a mixture of
Gaussian

Classical distance metric between a sample

and the mean of the jth cluster is the
Mahanalobis distance

National University of Singapore

4/7/2015

Central Clustering: K-Means

Assume a map function provide each ith
sample a label
An optimal clustering minimizes the withincluster scatter:

i.e., the average distance of all samples to

their respective cluster means
National University of Singapore

Central Clustering: K-Means

However, as K is user defined,
when
each point becomes a cluster itself: K=n.
In this chapter, would assume true K is known.

National University of Singapore

4/7/2015

Algorithm
A chicken-and-egg view

National University of Singapore

Two-Step Iteration

National University of Singapore

4/7/2015

Example
http://util.io/k-means

National University of Singapore

Feature Space

Source: K. Grauman

4/7/2015

Results of K-Means Clustering:

Image

Clusters on intensity

Clusters on color

K-means clustering using intensity alone and color alone

* From Marc Pollefeys COMP 256 2003

4/7/2015

A bad local optimum

National University of Singapore

Characteristics of K-Means
It is a greedy algorithm, does not guarantee to
converge to the global optimum.
Given fixed initial clusters/ Gaussian models, the
iterative process is deterministic.
Result may be improved by running k-means
multiple times with different starting conditions.
The segmentation-estimation process can be
treated as a generalized expectationmaximization algorithm
National University of Singapore

4/7/2015

EM Algorithm [Dempster-Laird-Rubin 1977]

Expectation Maximization (EM) estimates the
model parameters and the segmentation in a
ML sense.
Assume samples are independently drawn
from a mixed probabilistic distribution,
indicated by a hidden discrete variable z
Cond. dist. can be Gaussian
National University of Singapore

The Maximum-Likelihood Estimation

The unknown parameters are
The likelihood function:

The optimal solution maximizes the loglikelihood

National University of Singapore

4/7/2015

The Maximum-Likelihood Estimation

Directly maximize the log-likelihood function
is a high-dimensional nonlinear optimization
problem

National University of Singapore

Define a new function:

The first term is called expected complete loglikelihood function;

The second term is the conditional entropy.
National University of Singapore

4/7/2015

Observation:

National University of Singapore

The Maximum-Likelihood Estimation

Regard the (incomplete) log-likelihood as a
function of two variables:
Maximize g iteratively (E step, followed by M
step)

National University of Singapore

4/7/2015

Iteration converges to a stationary

point

National University of Singapore

Prop 4.2: Update

National University of Singapore

4/7/2015

Update
Recall

Assume
is fixed, then maximize the
expected complete log-likelihood

National University of Singapore

To maximize the expected log-likelihood, as an

example, assume each cluster is isotropic
normal distribution:

Eliminate the constant term in the objective

National University of Singapore

4/7/2015

Exer 4.2

Compared to k-means, EM assigns the

samples softly to each cluster according to a
set of probabilities.

National University of Singapore

EM Algorithm

National University of Singapore

4/7/2015

Exam 4.3: Global max may not exist

National University of Singapore

Alternative view of EM:

Coordinate ascent
w

National University of Singapore

4/7/2015

Alternative view of EM:

Coordinate ascent
w

National University of Singapore

Alternative view of EM:

Coordinate ascent
w

w2
w1

National University of Singapore

4/7/2015

Alternative view of EM:

Coordinate ascent
w

w2
w1

National University of Singapore

Alternative view of EM:

Coordinate ascent
w

w2
w1

National University of Singapore

4/7/2015

Alternative view of EM:

Coordinate ascent
w

w2
w1

National University of Singapore

Visual example of EM

4/7/2015

Potential Problems
Incorrect number of Mixture Components

Singularities

Incorrect Number of Gaussians

4/7/2015

Incorrect Number of Gaussians

Singularities
A minority of the data can have a
disproportionate effect on the model
likelihood.
For example

4/7/2015

GMM example

Singularities
When a mixture component collapses on a
given point, the mean becomes the point, and
the variance goes to zero.
Consider the likelihood function as the
covariance goes to zero.
The likelihood approaches infinity.

4/7/2015

K-means VS EM

k-means clustering and EM clustering on an artificial dataset ("mouse"). The

tendency of k-means to produce equi-sized clusters leads to bad results, while
EM benefits from the Gaussian distribution present in the data set

National University of Singapore

So far
K-means
Expectation Maximization

National University of Singapore

4/7/2015

Next up
Multiple-Subspace Segmentation
K-subspaces
EM for Subspaces

National University of Singapore

Multiple-Subspace Segmentation

National University of Singapore

4/7/2015

K-subspaces

National University of Singapore

K-subspaces
With noise, we minimize

Unfortunately, unlike PCA, there is no constructive

solution to the above minimization problem. The main
difficulty is that the foregoing objective is hybrid it is
a combination of minimization on the continuous
variables {Uj} and the discrete variable j.
National University of Singapore

4/7/2015

K-subspaces

National University of Singapore

K-subspaces

Exactly the same as

in PCA

National University of Singapore

4/7/2015

K-subspaces

National University of Singapore

K-subspaces

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces
In the M step

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces

National University of Singapore

4/7/2015

Relationship between K-subspaces and

EM
At each iteration,
K-subspaces algorithm gives a definite
assignment of every data point into one of the
subspaces;
EM algorithm views the membership as a
random variable and uses its expected value
to give a probabilistic assignment of the
data point.
National University of Singapore

Homework
Read the handout Chapter 4 Iterative
Methods for Multiple-Subspace
Segmentation.
Complete exercise 4.2 (page 111) of the
handout

National University of Singapore

The War Within - An Anatomy ..
100% (3)
The War Within - An Anatomy ..
30 pages
J. Heifetz, D.Oistrakh, J.Szigeti - Their Contributions To The Violin Repertoire PDF
50% (2)
J. Heifetz, D.Oistrakh, J.Szigeti - Their Contributions To The Violin Repertoire PDF
65 pages
Angelina Jolie
50% (2)
Angelina Jolie
132 pages
"Be Men, Be Strong": Masculinity in The Pauline Corpus
No ratings yet
"Be Men, Be Strong": Masculinity in The Pauline Corpus
240 pages
6.2 K Means
No ratings yet
6.2 K Means
23 pages
Lecture 3
No ratings yet
Lecture 3
15 pages
Mixture Models and Clustering
No ratings yet
Mixture Models and Clustering
8 pages
Introduction To (Statistical) Machine Learning
No ratings yet
Introduction To (Statistical) Machine Learning
30 pages
cs229 Notes7b PDF
No ratings yet
cs229 Notes7b PDF
4 pages
Module13 GaussianMixtureModel
No ratings yet
Module13 GaussianMixtureModel
17 pages
5 Clustering
No ratings yet
5 Clustering
38 pages
Week 7 - Latent Variable Models and Expectation Maximization
No ratings yet
Week 7 - Latent Variable Models and Expectation Maximization
39 pages
EM-converted
No ratings yet
EM-converted
22 pages
EM and Kmeans relations
No ratings yet
EM and Kmeans relations
70 pages
Week 5 v1.1 - Unsupervised Learning
No ratings yet
Week 5 v1.1 - Unsupervised Learning
40 pages
Clustering Mixture
No ratings yet
Clustering Mixture
22 pages
K Means
No ratings yet
K Means
33 pages
ML Lecture06 Unsupervised Learning
No ratings yet
ML Lecture06 Unsupervised Learning
87 pages
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
No ratings yet
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
47 pages
DSA5102_lecture10
No ratings yet
DSA5102_lecture10
40 pages
CB PDF
No ratings yet
CB PDF
69 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
Week 7 GMM
No ratings yet
Week 7 GMM
9 pages
cz4041 10 Clustering
No ratings yet
cz4041 10 Clustering
67 pages
ML.5-Clustering Techniques (Week 9)
No ratings yet
ML.5-Clustering Techniques (Week 9)
71 pages
Lec 11
No ratings yet
Lec 11
57 pages
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
No ratings yet
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
47 pages
gmm
No ratings yet
gmm
8 pages
Applied Stat
No ratings yet
Applied Stat
2 pages
Expectation-Maximization Clustring V2
No ratings yet
Expectation-Maximization Clustring V2
9 pages
22 Mixture Models A EM
No ratings yet
22 Mixture Models A EM
32 pages
Machine Learning-4
No ratings yet
Machine Learning-4
73 pages
Pattern Analysis-Machine Learning
No ratings yet
Pattern Analysis-Machine Learning
74 pages
Mixture Models and Expectation-Maximization: Justus H. Piater
No ratings yet
Mixture Models and Expectation-Maximization: Justus H. Piater
11 pages
Clustering
No ratings yet
Clustering
65 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
Dsci303-19 GM - em
No ratings yet
Dsci303-19 GM - em
81 pages
Unit Iii
No ratings yet
Unit Iii
70 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
Lec15 16 Handout
No ratings yet
Lec15 16 Handout
33 pages
Lecture Expectation Maximization
No ratings yet
Lecture Expectation Maximization
58 pages
U L D R: Nsupervised Earning and Imensionality Eduction
No ratings yet
U L D R: Nsupervised Earning and Imensionality Eduction
58 pages
K-Medias, Mezcla de Gausianas y Un Ejemplo
No ratings yet
K-Medias, Mezcla de Gausianas y Un Ejemplo
6 pages
Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering
No ratings yet
Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering
28 pages
2017-AdaCluster Adaptive Clustering For Heterogeneous Data
No ratings yet
2017-AdaCluster Adaptive Clustering For Heterogeneous Data
34 pages
K-Means Clustering Method For The Analysis of Log Data
No ratings yet
K-Means Clustering Method For The Analysis of Log Data
3 pages
Lecture08b Kmeans
No ratings yet
Lecture08b Kmeans
10 pages
I2ml3e Chap7
No ratings yet
I2ml3e Chap7
22 pages
Symmetrical Based Projects
No ratings yet
Symmetrical Based Projects
105 pages
Tema5 Teoria-2830
No ratings yet
Tema5 Teoria-2830
57 pages
K.means Clustering
No ratings yet
K.means Clustering
8 pages
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
No ratings yet
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
65 pages
Medical Imabmnge Analysis
No ratings yet
Medical Imabmnge Analysis
41 pages
Data Clustering Using Kernel Based
No ratings yet
Data Clustering Using Kernel Based
6 pages
Dis10 Sol PDF
No ratings yet
Dis10 Sol PDF
6 pages
ML Unit - IV
No ratings yet
ML Unit - IV
56 pages
Lecture Notes On Clustering
No ratings yet
Lecture Notes On Clustering
10 pages
Image Segmentation1
No ratings yet
Image Segmentation1
42 pages
Week3 Statnlp Web
No ratings yet
Week3 Statnlp Web
58 pages
Introduction to Finite Element Analysis
From Everand
Introduction to Finite Element Analysis
Rahul Basu
No ratings yet
EDUCATION DATA MINING FOR PREDICTING STUDENTS’ PERFORMANCE
From Everand
EDUCATION DATA MINING FOR PREDICTING STUDENTS’ PERFORMANCE
Dr. GEETHA N DATA SCIENTIST, BENGALURU
No ratings yet
40 Machine Learning Algorithms
From Everand
40 Machine Learning Algorithms
Anam Giri
No ratings yet
Teaching and Learning in STEM With Computation, Modeling, and Simulation Practices: A Guide for Practitioners and Researchers
From Everand
Teaching and Learning in STEM With Computation, Modeling, and Simulation Practices: A Guide for Practitioners and Researchers
Alejandra J. Magana
No ratings yet
Mastering Partial Least Squares Structural Equation Modeling (Pls-Sem) with Smartpls in 38 Hours
From Everand
Mastering Partial Least Squares Structural Equation Modeling (Pls-Sem) with Smartpls in 38 Hours
Ken Kwong-Kay Wong
3/5 (1)
Foregrounding and Cohesion in Philip Larkin
No ratings yet
Foregrounding and Cohesion in Philip Larkin
11 pages
CI2117
No ratings yet
CI2117
2 pages
Cambridge International AS & A Level: HISTORY 9489/32
No ratings yet
Cambridge International AS & A Level: HISTORY 9489/32
4 pages
Lethbridge-Stewart and The Band of Evil
No ratings yet
Lethbridge-Stewart and The Band of Evil
42 pages
[2K9] FRIENDS - UNIT 7+8
No ratings yet
[2K9] FRIENDS - UNIT 7+8
87 pages
How Not To Become A Millennial: Learning From America's Largest Sociological Disaster Vince Barrick All Chapter Instant Download
No ratings yet
How Not To Become A Millennial: Learning From America's Largest Sociological Disaster Vince Barrick All Chapter Instant Download
53 pages
Kalinga School of Architecture
No ratings yet
Kalinga School of Architecture
3 pages
Quiz For Structure of Phrases and Sentences
No ratings yet
Quiz For Structure of Phrases and Sentences
4 pages
Education Issues Lesson Plan
No ratings yet
Education Issues Lesson Plan
2 pages
Top Rated English Movies
No ratings yet
Top Rated English Movies
6 pages
Migne. Patrologiae Cursus Completus. Series Graeca. Volume 035.
No ratings yet
Migne. Patrologiae Cursus Completus. Series Graeca. Volume 035.
640 pages
June7 2020 Gilead A3 VolXX Issue8
No ratings yet
June7 2020 Gilead A3 VolXX Issue8
3 pages
Energy Is The Eternal Delight
No ratings yet
Energy Is The Eternal Delight
7 pages
The Awakening By:: Kate Chopin: PPT: BY: Will Zorn
No ratings yet
The Awakening By:: Kate Chopin: PPT: BY: Will Zorn
8 pages
1-s2.0-S1871187122000384-main
No ratings yet
1-s2.0-S1871187122000384-main
12 pages
Key de 3
No ratings yet
Key de 3
8 pages
Licensing Contract For Commercial Use of Images
No ratings yet
Licensing Contract For Commercial Use of Images
2 pages
Surgical Incisions: STJ - Dr. Aylin Mert 0902110019
100% (1)
Surgical Incisions: STJ - Dr. Aylin Mert 0902110019
22 pages
kr8800 Manual PDF
No ratings yet
kr8800 Manual PDF
88 pages
Atomic Physics
100% (2)
Atomic Physics
55 pages
Kreo
No ratings yet
Kreo
3 pages
Myfile
No ratings yet
Myfile
8 pages
Henriksen B pp3
No ratings yet
Henriksen B pp3
2 pages
Reuyan Vs Inc Navigation Co. Phils., Inc. - GR. No. 250203 - LISING
No ratings yet
Reuyan Vs Inc Navigation Co. Phils., Inc. - GR. No. 250203 - LISING
4 pages
Charlotte Bronte - Jane Eyre
No ratings yet
Charlotte Bronte - Jane Eyre
499 pages
Honey Glazed Halloumi - Cooking With Ayeh
No ratings yet
Honey Glazed Halloumi - Cooking With Ayeh
36 pages