0% found this document useful (0 votes)

55 views

Hierarchical Clustering in Machine Learning

This document discusses hierarchical clustering, an unsupervised machine learning algorithm. It describes how hierarchical clustering builds a hierarchy of clusters in the form of a dendrogram tree structure. The document also covers agglomerative hierarchical clustering, which is a bottom-up approach that starts with each data point as a separate cluster and merges them iteratively. It provides details on how agglomerative hierarchical clustering works and discusses different linkage methods for calculating distances between clusters.

Uploaded by

UJJAWAL PUGALIA

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views

Hierarchical Clustering in Machine Learning

Uploaded by

UJJAWAL PUGALIA

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Hierarchical Clustering in Machine Learning

Hierarchical clustering is another unsupervised machine learning algorithm, which is used to group the unlabeled datasets
into a cluster and also known as hierarchical cluster analysis or HCA.

In this algorithm, we develop the hierarchy of clusters in the form of a tree, and this tree-shaped structure is known as the
dendrogram.

Sometimes the results of K-means clustering and hierarchical clustering may look similar, but they both differ depending
on how they work. As there is no requirement to predetermine the number of clusters as we did in the K-Means algorithm.
k
The hierarchical clustering technique has two approaches: automatically

" 71 " 1

⇒ 1. Agglomerative: Agglomerative is a bottom-up approach, in which the algorithm starts with taking all data points
as single clusters and merging them until one cluster is left. 1.

⇒ 2. Divisive: Divisive algorithm is the reverse of the agglomerative algorithm as it is a top-down approach.

☒ Why hierarchical clustering?

As we already have other clustering algorithms such as K-Means Clustering, then why we need hierarchical clustering?
So, as we have seen in the K-means clustering that there are some challenges with this algorithm, which are a
predetermined number of clusters, and it always tries to create the clusters of the same size. To solve these two
challenges, we can opt for the hierarchical clustering algorithm because, in this algorithm, we don't need to have
knowledge about the predefined number of clusters.

In this topic, we will discuss the Agglomerative Hierarchical clustering algorithm.

Agglomerative Hierarchical clustering

The agglomerative hierarchical clustering algorithm is a popular example of HCA. To group the datasets into clusters, it
follows the bottom-up approach. It means, this algorithm considers each dataset as a single cluster at the beginning,
and then start combining the closest pair of clusters together. It does this until all the clusters are merged into a single
cluster that contains all the datasets.

This hierarchy of clusters is represented in the form of the dendrogram.

1.
How the Agglomerative Hierarchical clustering Work?
ADVERTISEMENT

The working of the AHC algorithm can be explained using the below steps:

ADVERTISEMENT ADVERTISEMENT

⇒
N.

◦ Step-1: Create each data point as a single cluster. Let's say there are N data points, so the number of clusters will
also be N.

N ⇒ N clusters.

◦ Step-2: Take two closest data points or clusters and merge them to form one cluster. So, there will now be N-1
clusters.

⇒ N - I
-.

◦ Step-3: Again, take the two closest clusters and merge them together to form one cluster. There will be N-2 clusters.
ADVERTISEMENT

N-I > N-2

N - Pairs are meant

- r

◦ Step-4: Repeat Step 3 until only one cluster left. So, we will get the following clusters. Consider the below images:

◦ Step-5: Once all the clusters are combined into one big cluster, develop the dendrogram to divide the clusters as per
the problem.
Note: To better understand hierarchical clustering, it is advised to have a look on k-means clustering
ADVERTISEMENT

Measure for the distance between two clusters

As we have seen, the closest distance between the two clusters is crucial for the hierarchical clustering. There are various
ways to calculate the distance between two clusters, and these ways decide the rule for clustering. These measures are called
Linkage methods. Some of the popular linkage methods are given below:
ADVERTISEMENT ADVERTISEMENT

1. Single Linkage: It is the Shortest Distance between the closest points of the clusters. Consider the below image:

# Surfaced to Noise.

2. Complete Linkage: It is the farthest distance between the two points of two different clusters. It is one of the popular
linkage methods as it forms tighter clusters than single-linkage.

≠ More Robust.
man
M ✗ N = 0h?♀l Adage.
# can be
mn

3. Average Linkage: It is the linkage method in which the distance between each pair of datasets is added up and then

# Robustdivided by the total number of datasets to calculate the average distance between two clusters. It is also one of the most
popular linkage methods.

4. Centroid Linkage: It is the linkage method in which the distance between the centroid of the clusters is calculated.
Consider the below image:
ADVERTISEMENT

From the above-given approaches, we can apply any of them according to the type of problem or business requirement.

Woking of Dendrogram in Hierarchical clustering

The dendrogram is a tree-like structure that is mainly used to store each step as a memory that the HC algorithm performs. In the
dendrogram plot, the Y-axis shows the Euclidean distances between the data points, and the x-axis shows all the data points of
the given dataset.

The working of the dendrogram can be explained using the below diagram:

# 6

In the above diagram, the left part is showing how clusters are created in agglomerative clustering, and the right part is showing
the corresponding dendrogram.

◦ As we have discussed above, firstly, the datapoints P2 and P3 combine together and form a cluster, correspondingly a
dendrogram is created, which connects P2 and P3 with a rectangular shape. The hight is decided according to the
Euclidean distance between the data points.

◦ In the next step, P5 and P6 form a cluster, and the corresponding dendrogram is created. It is higher than of previous, as
the Euclidean distance between P5 and P6 is a little bit greater than the P2 and P3.

◦ Again, two new dendrograms are created that combine P1, P2, and P3 in one dendrogram, and P4, P5, and P6, in another
dendrogram.

◦ At last, the final dendrogram is created that combines all the data points together.

We can cut the dendrogram tree structure at any level as per our requirement.

Python Implementation of Agglomerative Hierarchical Clustering

Now we will see the practical implementation of the agglomerative hierarchical clustering algorithm using Python. To implement
this, we will use the same dataset problem that we have used in the previous topic of K-means clustering so that we can compare
both concepts easily. ADVERTISEMENT

The dataset is containing the information of customers that have visited a mall for shopping. So, the mall owner wants to find some
patterns or some particular behavior of his customers using the dataset information.
ADVERTISEMENT ADVERTISEMENT

Steps for implementation of AHC using Python:

The steps for implementation will be the same as the k-means clustering, except for some changes such as the method to find the
number of clusters. Below are the steps:

1. Data Pre-processing ✓
2. Finding the optimal number of clusters using the Dendrogram

3. Training the hierarchical clustering model

4. Visualizing the clusters

Data Pre-processing Steps:

In this step, we will import the libraries and datasets for our model.

◦ Importing the libraries

# Importing the libraries

import numpy as nm
import matplotlib.pyplot as mtp
import pandas as pd

The above lines of code are used to import the libraries to perform specific tasks, such as numpy for the Mathematical operations,
matplotlib for drawing the graphs or scatter plot, and pandas for importing the dataset.
ADVERTISEMENT ADVERTISEMENT
ADVERTISEMENT
◦ Importing the dataset

# Importing the dataset

dataset = pd.read_csv('Mall_Customers_data.csv')

As discussed above, we have imported the same dataset of Mall_Customers_data.csv, as we did in k-means clustering. Consider
the below output:

◦ Extracting the matrix of features

Here we will extract only the matrix of features as we don't have any further information about the dependent variable. Code is given
below:

x = dataset.iloc[:, [3, 4]].values

ADVERTISEMENT ADVERTISEMENT
Here we have extracted only 3 and 4 columns as we will use a 2D plot to see the clusters. So, we are considering the Annual income and
spending score as the matrix of features.
ADVERTISEMENT

Step-2: Finding the optimal number of clusters using the Dendrogram

Now we will find the optimal number of clusters using the Dendrogram for our model. For this, we are going to use scipy library as
provides a function that will directly return the dendrogram for our code. Consider the below lines of code:

#Finding the optimal number of clusters using the dendrogram

import scipy.cluster.hierarchy as shc
dendro = shc.dendrogram(shc.linkage(x, method="ward"))
mtp.title("Dendrogrma Plot")
mtp.ylabel("Euclidean Distances")
mtp.xlabel("Customers")
mtp.show()

In the above lines of code, we have imported the hierarchy module of scipy library. This module provides us a method
shc.denrogram(), which takes the linkage() as a parameter. The linkage function is used to define the distance between two clusters
so here we have passed the x(matrix of features), and method "ward," the popular method of linkage in hierarchical clustering.

The remaining lines of code are to describe the labels for the dendrogram plot.

Output:

By executing the above lines of code, we will get the below output:

# data point.
ADVERTISEMENT ADVERTISEMENT

Using this Dendrogram, we will now determine the optimal number of clusters for our model. For this, we will find the maximum vertica
distance that does not cut any horizontal bar. Consider the below diagram:
ADVERTISEMENT

4th Hiya:

M11
i

In the above diagram, we have shown the vertical distances that are not cutting their horizontal bars. As we can visualize, the 4
is looking the maximum, so according to this, the number of clusters will be 5(the vertical lines in this range). We can also take the 2
number as it approximately equals the 4th distance, but we will consider the 5 clusters because the same we calculated in the K-mea
algorithm.

So, the optimal number of clusters will be 5, and we will train the model in the next step, using the same.

Step-3: Training the hierarchical clustering model

As we know the required optimal number of clusters, we can now train our model. The code is given below:

#training the hierarchical model on dataset

from sklearn.cluster import AgglomerativeClustering
hc= AgglomerativeClustering(n_clusters=5, affinity='euclidean', linkage='ward')
y_pred= hc.fit_predict(x) =

In the above code, we have imported the AgglomerativeClustering class of cluster module of scikit learn library.

Then we have created the object of this class named as hc. The AgglomerativeClustering class takes the following parameters:

◦ n_clusters=5: It defines the number of clusters, and we have taken here 5 because it is the optimal number of clusters.

◦ affinity='euclidean': It is a metric used to compute the linkage.

◦ linkage='ward': It defines the linkage criteria, here we have used the "ward" linkage. This method is the popular linkage meth
that we have already used for creating the Dendrogram. It reduces the variance in each cluster.

In the last line, we have created the dependent variable y_pred to fit or train the model. It does train not only the model but also retu
the clusters to which each data point belongs.
ADVERTISEMENT ADVERTISEMENT
ADVERTISEMENT

After executing the above lines of code, if we go through the variable explorer option in our Sypder IDE, we can check the y_pred var
We can compare the original dataset with the y_pred variable. Consider the below image:

As we can see in the above image, the y_pred shows the clusters value, which means the customer id 1 belongs to the 5
indexing starts from 0, so 4 means 5th cluster), the customer id 2 belongs to 4th cluster, and so on.

Step-4: Visualizing the clusters

As we have trained our model successfully, now we can visualize the clusters corresponding to the dataset.

Here we will use the same lines of code as we did in k-means clustering, except one change. Here we will not plot the centroid that we d
k-means, because here we have used dendrogram to determine the optimal number of clusters. The code is given below:

#visulaizing the clusters

mtp.scatter(x[y_pred == 0, 0], x[y_pred == 0, 1], s = 100, c = 'blue', label = 'Cluster 1')
mtp.scatter(x[y_pred == 1, 0], x[y_pred == 1, 1], s = 100, c = 'green', label = 'Cluster 2')
mtp.scatter(x[y_pred== 2, 0], x[y_pred == 2, 1], s = 100, c = 'red', label = 'Cluster 3')
mtp.scatter(x[y_pred == 3, 0], x[y_pred == 3, 1], s = 100, c = 'cyan', label = 'Cluster 4')
mtp.scatter(x[y_pred == 4, 0], x[y_pred == 4, 1], s = 100, c = 'magenta', label = 'Cluster 5')
mtp.title('Clusters of customers')
ADVERTISEMENT

mtp.xlabel('Annual Income (k$)')

mtp.ylabel('Spending Score (1-100)')
mtp.legend()
mtp.show()

Output: By executing the above lines of code, we will get the below output:

Staffing in International Context
0% (1)
Staffing in International Context
22 pages
Teaching Without A Coursebook
100% (1)
Teaching Without A Coursebook
4 pages
ML Unit 5
No ratings yet
ML Unit 5
50 pages
Data Science Unit 5
No ratings yet
Data Science Unit 5
105 pages
Text Analytics Unit-3
No ratings yet
Text Analytics Unit-3
11 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
7 pages
DA-Unit V
No ratings yet
DA-Unit V
152 pages
Module 3 - 1
No ratings yet
Module 3 - 1
149 pages
Unit 4 Self Made (1)
No ratings yet
Unit 4 Self Made (1)
28 pages
Partition
No ratings yet
Partition
52 pages
Hierarchial Clustering
No ratings yet
Hierarchial Clustering
14 pages
Clustering Analysis
No ratings yet
Clustering Analysis
30 pages
Module-5-Cluster Analysis-Part1
No ratings yet
Module-5-Cluster Analysis-Part1
24 pages
Hierarchical Clustering - 11.3.2024 - Full
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
14 pages
ifferent methods of clustering
No ratings yet
ifferent methods of clustering
8 pages
An Introduction To Clustering Methods
No ratings yet
An Introduction To Clustering Methods
8 pages
Unit-4 new
No ratings yet
Unit-4 new
36 pages
ML UNIT-5 (1)
No ratings yet
ML UNIT-5 (1)
30 pages
K Means Clustering
No ratings yet
K Means Clustering
6 pages
DWDM Unit5
No ratings yet
DWDM Unit5
14 pages
Techniques of Cluster Analysis: A Seminar On
No ratings yet
Techniques of Cluster Analysis: A Seminar On
25 pages
Hierarchical Clustering pdf
No ratings yet
Hierarchical Clustering pdf
7 pages
An Introduction To Clustering and Different Methods of Clustering
No ratings yet
An Introduction To Clustering and Different Methods of Clustering
9 pages
Unit 3 Updated Notes
No ratings yet
Unit 3 Updated Notes
29 pages
Techniques of Cluster Analysis: A Seminar On
No ratings yet
Techniques of Cluster Analysis: A Seminar On
25 pages
Chapter 4 _ Clustering
No ratings yet
Chapter 4 _ Clustering
21 pages
DWM Exp8 127 133 137
No ratings yet
DWM Exp8 127 133 137
4 pages
Data Analytics and Model Evaluation
No ratings yet
Data Analytics and Model Evaluation
55 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
MACHINE LEARNING NOTES ANNA UNIVERSITY
No ratings yet
MACHINE LEARNING NOTES ANNA UNIVERSITY
14 pages
CLUSTERING
No ratings yet
CLUSTERING
5 pages
ML UNIT-5
No ratings yet
ML UNIT-5
31 pages
Clustering Analysis (1)
No ratings yet
Clustering Analysis (1)
12 pages
IDS Unit-3 L2
No ratings yet
IDS Unit-3 L2
26 pages
Data mining and machine learning
No ratings yet
Data mining and machine learning
48 pages
Clustering
No ratings yet
Clustering
69 pages
ML_Lec-17
No ratings yet
ML_Lec-17
12 pages
Unit Ii DM
No ratings yet
Unit Ii DM
82 pages
Hierarchical Clustering Unit 4 ML
No ratings yet
Hierarchical Clustering Unit 4 ML
14 pages
22AIP3101A Session 9
No ratings yet
22AIP3101A Session 9
38 pages
4.unsupervised Learning Model-Clustering
No ratings yet
4.unsupervised Learning Model-Clustering
45 pages
Module 6 - Un-Supervised Learning Algorithms
No ratings yet
Module 6 - Un-Supervised Learning Algorithms
31 pages
Unit 3 Data
No ratings yet
Unit 3 Data
37 pages
Hierarchical Clustering: Required Data
No ratings yet
Hierarchical Clustering: Required Data
6 pages
UNIT - 4 DWDM
No ratings yet
UNIT - 4 DWDM
27 pages
8. Clustering
No ratings yet
8. Clustering
38 pages
Zara
No ratings yet
Zara
47 pages
U1 - KMeans - 5th Sem - DS
No ratings yet
U1 - KMeans - 5th Sem - DS
14 pages
Facebook Live Seller
No ratings yet
Facebook Live Seller
8 pages
clustering
No ratings yet
clustering
6 pages
Assignment 5
No ratings yet
Assignment 5
3 pages
som-new
No ratings yet
som-new
21 pages
A Tutorial On Clustering Algorithms
No ratings yet
A Tutorial On Clustering Algorithms
4 pages
unsupervised learning
No ratings yet
unsupervised learning
23 pages
ML extended
No ratings yet
ML extended
25 pages
1. Clustering
No ratings yet
1. Clustering
75 pages
Presentation 28128 Content Document 20241126014005PM
No ratings yet
Presentation 28128 Content Document 20241126014005PM
80 pages
Cluster Analysis: G Sreenivas
No ratings yet
Cluster Analysis: G Sreenivas
29 pages
Artificial Intelligence Report
No ratings yet
Artificial Intelligence Report
23 pages
Week-9-Part-2 Agglomerative Clustering
No ratings yet
Week-9-Part-2 Agglomerative Clustering
40 pages
Week 11
No ratings yet
Week 11
49 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Final Book Compressed-1
No ratings yet
Final Book Compressed-1
60 pages
Networking Training 14
No ratings yet
Networking Training 14
6 pages
Document
No ratings yet
Document
2 pages
Qideng Temple Incident
No ratings yet
Qideng Temple Incident
7 pages
DB Scan
No ratings yet
DB Scan
7 pages
SInv Englist 20240317
No ratings yet
SInv Englist 20240317
72 pages
ENT 101 Chapter 3 Entrepreneurial Characteristics and Competencies
No ratings yet
ENT 101 Chapter 3 Entrepreneurial Characteristics and Competencies
51 pages
ST 1 Common and Proper Nouns
No ratings yet
ST 1 Common and Proper Nouns
1 page
Jorgensen 2010
No ratings yet
Jorgensen 2010
14 pages
Muhammad Okasha - KFUPM GRADUATE PROGRAMS - Online Application - Regular Programs - 241
No ratings yet
Muhammad Okasha - KFUPM GRADUATE PROGRAMS - Online Application - Regular Programs - 241
8 pages
Exercises No. 2 1
No ratings yet
Exercises No. 2 1
2 pages
6400 Hybrid NVR
No ratings yet
6400 Hybrid NVR
2 pages
Emtech Reviewer
No ratings yet
Emtech Reviewer
7 pages
Systems of Two Equations - All Methods
No ratings yet
Systems of Two Equations - All Methods
4 pages
Classroom Rules
100% (1)
Classroom Rules
12 pages
Lectura sobre abejas en inglés
No ratings yet
Lectura sobre abejas en inglés
2 pages
Elte011 İngilizce Ders Kitabi İncelemesi
No ratings yet
Elte011 İngilizce Ders Kitabi İncelemesi
57 pages
Identity, Language Learning, and Social Change
No ratings yet
Identity, Language Learning, and Social Change
37 pages
Alculturation of M&A
No ratings yet
Alculturation of M&A
13 pages
Mathematics 9 Q1 Module2b For Submission
75% (4)
Mathematics 9 Q1 Module2b For Submission
20 pages
782970551-ĐỀ-THI-THỬ-SỐ-1-BIEN-SOẠN-THEO-FORM-MINH-HỌA-MỚI-2025-CO-PHẠM-LIỄU-6
No ratings yet
782970551-ĐỀ-THI-THỬ-SỐ-1-BIEN-SOẠN-THEO-FORM-MINH-HỌA-MỚI-2025-CO-PHẠM-LIỄU-6
1 page
Program IP Month Celebration 2022
No ratings yet
Program IP Month Celebration 2022
2 pages
Design Test Report: Template Information
No ratings yet
Design Test Report: Template Information
5 pages
Lesson Plan Math 1
No ratings yet
Lesson Plan Math 1
6 pages
DLL - Mapeh 3 - Q4 - W5
No ratings yet
DLL - Mapeh 3 - Q4 - W5
3 pages
Classroom Management Philosophy
No ratings yet
Classroom Management Philosophy
5 pages
Q5. Teaching Styles (CAPEL's Book Excerpt)
No ratings yet
Q5. Teaching Styles (CAPEL's Book Excerpt)
10 pages
General Strategy For Listening
No ratings yet
General Strategy For Listening
8 pages
01 04 PDF
100% (1)
01 04 PDF
22 pages
Principal Chemist
No ratings yet
Principal Chemist
2 pages
DragonsLoveTacos 1
No ratings yet
DragonsLoveTacos 1
59 pages
Engineering 2024
No ratings yet
Engineering 2024
12 pages
SAP S4H - MM - Creating Evaluation Questionnaires
No ratings yet
SAP S4H - MM - Creating Evaluation Questionnaires
9 pages
Textbook of Ear, Nose & Throat With Head and Neck Surgery For Medical Students
No ratings yet
Textbook of Ear, Nose & Throat With Head and Neck Surgery For Medical Students
2 pages