ML Mod 5
ML Mod 5
ML Mod 5
EM ALGORITHM
Ans
1. Initialize the model parameters and cluster assignments. This can be done
randomly or using a heuristic method.
2. E-step (Expectation step): Calculate the expected probability of each
data point belonging to each cluster, given the current model parameters
and cluster assignments.
3. M-step (Maximization step): Update the model parameters to maximize
the expected log-likelihood of the data, given the current cluster
assignments.
4. Repeat steps 2 and 3 until the model parameters and cluster assignments
converge.
Q2. Density based clustering
Ans
Density-based clustering is a type of clustering algorithm that groups
data points based on their density in the data space. The most popular
density-based clustering algorithm is DBSCAN. DBSCAN stands for
Density-Based Spatial Clustering of Applications with Noise.
1. For each data point, find all other data points within a certain
distance (ε-neighborhood).
2. If a data point has at least a minimum number of neighbors
(MinPts) within its ε-neighborhood, then it is considered a core point.
3. Clusters are formed by connecting all core points that are directly or
indirectly connected to each other through a chain of core points.
4. Data points that are not core points and are not reachable from any
core point are considered noise.
Step-3: Assign each data point to their closest centroid, which will form the
predefined K clusters.
Step-4: Calculate the variance and place a new centroid of each cluster.
Step-5: Repeat the third steps, which means assign each datapoint to the new
closest centroid of each cluster.