Week 15 - Clustering
Week 15 - Clustering
Week 15 - Clustering
Fundamentals
• Learning
• Classification and Clustering
Clustering
• K-Means Clustering
Fundamentals – Learning
Supervised Unsupervised
Classification and Clustering
Classification and clustering are two methods
of pattern identification used in machine learning.
Although both techniques have certain similarities,
the difference lies in the fact that classification
uses predefined classes in which objects are
assigned, while clustering identifies similarities
between objects, which it groups according to those
characteristics in common and which differentiate
them from other groups of objects. These groups are
known as "clusters". [1]
[1] https://blog.bismart.com/en/classification-vs.-clustering-a-practical-explanation
K-Means Clustering – What is it?
Suppose you have data that you need to put in three clusters
Step 1
Select the number K of clusters you want to identify in your data. For example K = 3
Step 2
Randomly select 3 distinct data points. These are the initial clusters
Step 3
Measure the distance between the 1st point and the three initial clusters
Step 4
Assign the 1st point to the nearest cluster. In this case the nearest cluster is the blue
one.
Step 4
The repeat the process using the mean values. Since the clustering do not change in
this iteration, the process ends.