Welcome to Scribd!

100% found this document useful (2 votes)

48 views

Week 15 - Clustering

Uploaded by

This document discusses clustering and k-means clustering. It begins with an overview of clustering and how it differs from classification by identifying similarities between objects rather than assigning them to predefined classes. It then focuses on k-means clustering, explaining that it is an unsupervised machine learning algorithm that groups objects into k clusters based on their characteristics. The document walks through the steps of how k-means clustering works, including randomly selecting initial clusters, assigning data points to the nearest cluster, recalculating cluster means, and repeating the process until cluster assignments no longer change. It also discusses how to determine the optimal number of clusters k.

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Week 15 - Clustering

Uploaded by

Royer Rojano

100% found this document useful (2 votes)

48 views25 pages

Original Description:

INFORME

Original Title

Week 15 - Clustering (1)

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Download as pptx, pdf, or txt

100% found this document useful (2 votes)

48 views25 pages

Week 15 - Clustering

Uploaded by

Royer Rojano

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Download as pptx, pdf, or txt

Jump to Page

You are on page 1of 25

Search inside document

Clustering

Nallig Leal Narváez

PhD en Ingeniería de Sistemas y
Computación

Universidad Autónoma del Caribe

CONTENT

Fundamentals

• Learning
• Classification and Clustering

Clustering
• K-Means Clustering
Fundamentals – Learning

Supervised Unsupervised
Classification and Clustering
Classification and clustering are two methods
of pattern identification used in machine learning.
Although both techniques have certain similarities,
the difference lies in the fact that classification
uses predefined classes in which objects are
assigned, while clustering identifies similarities
between objects, which it groups according to those
characteristics in common and which differentiate
them from other groups of objects. These groups are
known as "clusters". [1]

[1] https://blog.bismart.com/en/classification-vs.-clustering-a-practical-explanation
K-Means Clustering – What is it?

The K-Means algorithm is an unsupervised machine learning classification

algorithm, which groups objects into K groups based on their characteristics

Graphics were taken from StatQuest of Josh Starmer

K-Means Clustering – How it works?

Suppose you have data that you need to put in three clusters

Graphics were taken from StatQuest of Josh Starmer

K-Means Clustering – How it works?

In this case the data make three, relatively obvious, clusters

Taken from StatQuest of Josh Starmer

K-Means Clustering – How it works?

Step 1

Select the number K of clusters you want to identify in your data. For example K = 3

Taken from StatQuest of Josh Starmer

K-Means Clustering – How it works?

Step 2

Randomly select 3 distinct data points. These are the initial clusters

Taken from StatQuest of Josh Starmer

K-Means Clustering – How it works?

Step 3

Measure the distance between the 1st point and the three initial clusters

Taken from StatQuest of Josh Starmer

K-Means Clustering – How it works?

Step 4

Assign the 1st point to the nearest cluster. In this case the nearest cluster is the blue
one.

Taken from StatQuest of Josh Starmer

K-Means Clustering – How it works?

Step 4

Then, do the same for the other points.

Taken from StatQuest of Josh Starmer

K-Means Clustering – How it works?
Calculate the mean of each cluster
Step 5

The repeat the process using the mean values. Since the clustering do not change in
this iteration, the process ends.

Taken from StatQuest of Josh Starmer

K-Means Clustering – How it works?

Taken from StatQuest of Josh Starmer

K-Means Clustering – How it works?

Taken from StatQuest of Josh Starmer

K-Means Clustering – How it works?

Taken from StatQuest of Josh Starmer

K-Means Clustering – How many clusters?

Taken from StatQuest of Josh Starmer

K-Means Clustering – How many clusters?

Taken from StatQuest of Josh Starmer

K-Means Clustering – How many clusters?

Taken from StatQuest of Josh Starmer

K-Means Clustering – How many clusters?

Taken from StatQuest of Josh Starmer

K-Means Clustering – How many clusters?

Taken from StatQuest of Josh Starmer

K-Means Clustering – How many clusters?

Taken from StatQuest of Josh Starmer

K-Means Clustering – How many clusters?

Taken from StatQuest of Josh Starmer

K-Means Clustering – How many clusters?

Taken from StatQuest of Josh Starmer

K-Means Clustering in two dimensions

Taken from StatQuest of Josh Starmer

3) Code For ID3 Algorithm Implementation
Document8 pages
3) Code For ID3 Algorithm Implementation
Prajith Sprinťèř
100% (1)
K-NN (Nearest Neighbor)
Document17 pages
K-NN (Nearest Neighbor)
Shisir Ahmed
100% (1)
Study Plan - SBL 12 Week - PER
Document1 page
Study Plan - SBL 12 Week - PER
sherif mohamed
100% (1)
SMART REPORT 2019 Final PDF
Document76 pages
SMART REPORT 2019 Final PDF
Shylet Anesu Jonga
No ratings yet
Specifications: VB-60 VB-80 VB-100 VB-120 VB-160
Document1 page
Specifications: VB-60 VB-80 VB-100 VB-120 VB-160
Arturo López
No ratings yet
Classification of Ship Images - Kaggle PDF
Document39 pages
Classification of Ship Images - Kaggle PDF
Tefe
100% (1)
An Introduction of Ensemble Learning
Document40 pages
An Introduction of Ensemble Learning
Friday Jones
100% (1)
Lead Scoring Group Case Study Presentation
Document19 pages
Lead Scoring Group Case Study Presentation
Santosh Arakeri
100% (2)
19f0217 8B Assignment04
Document12 pages
19f0217 8B Assignment04
Shahid Imran
100% (1)
Enseble LEarning
Document57 pages
Enseble LEarning
YASH GAIKWAD
100% (1)
Lecture 9 PDF
Document28 pages
Lecture 9 PDF
Sachin singh
100% (1)
ML MU Unit 2
Document42 pages
ML MU Unit 2
Paulos K
100% (2)
ML Module 5 2022 PDF
Document31 pages
ML Module 5 2022 PDF
january
100% (1)
Brain Tumor Classification
Document12 pages
Brain Tumor Classification
Ultra Bloch
100% (1)
Jntuk R20 ML Unit-Iii
Document21 pages
Jntuk R20 ML Unit-Iii
Mahesh
100% (1)
Classification and Regression Trees
Document60 pages
Classification and Regression Trees
ShyamBhatt
100% (1)
Ensemble Methods
Document15 pages
Ensemble Methods
brm1shubha
100% (1)
CS550 Regression Aug12
Document63 pages
CS550 Regression Aug12
dipsresearch
100% (1)
Bagging and Boosting
Document19 pages
Bagging and Boosting
Ana-Cosmina Popescu
100% (1)
HW1
Document8 pages
HW1
Anonymous fXSlye
100% (1)
Classification and Prediction
Document31 pages
Classification and Prediction
Sharmila Saravanan
100% (1)
ML Project
Document57 pages
ML Project
Pranav Viswanathan
100% (2)
Kami Export - 13. Model Evaluation
Document25 pages
Kami Export - 13. Model Evaluation
YENI SRI MAHARANI -
100% (1)
Introduction To Scikit Learn
Document108 pages
Introduction To Scikit Learn
Raiyan Zannat
100% (1)
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
Document14 pages
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
endale
100% (1)
XGBoost R Tutorial
Document10 pages
XGBoost R Tutorial
Nitish
100% (1)
Unit V - Classification and Prediction 2020-21
Document68 pages
Unit V - Classification and Prediction 2020-21
rambabudugyani
100% (1)
ML0101EN Clas Logistic Reg Churn Py v1
Document13 pages
ML0101EN Clas Logistic Reg Churn Py v1
banicx
100% (1)
Sajjad DS
Document97 pages
Sajjad DS
Hey Buddy
100% (2)
Decision Trees For Predictive Modeling (Neville)
Document24 pages
Decision Trees For Predictive Modeling (Neville)
Mohith Reddy
100% (1)
Decision Trees: at Some Point of Time You Have To Take A Decision Sitting On A Tree
Document19 pages
Decision Trees: at Some Point of Time You Have To Take A Decision Sitting On A Tree
Chandrasekkhar Nayduw.K
100% (1)
Bagging and Boosting Regression Algorithms
Document84 pages
Bagging and Boosting Regression Algorithms
Raja
100% (1)
ML Unit1 PDF
Document36 pages
ML Unit1 PDF
Mohanraj Pramanathan
100% (1)
ML0101EN Clas K Nearest Neighbors CustCat Py v1
Document11 pages
ML0101EN Clas K Nearest Neighbors CustCat Py v1
banicx
100% (1)
Charmi Shah 20bcp299 Lab2
Document7 pages
Charmi Shah 20bcp299 Lab2
Princy
100% (1)
02 Multicollinearity
Document8 pages
02 Multicollinearity
Gabriel Gheorghe
100% (1)
Lab 3. Linear Regression 230223
Document7 pages
Lab 3. Linear Regression 230223
ruso
100% (1)
Multicollinearity Exercise
Document6 pages
Multicollinearity Exercise
Debaraj Sarkar
100% (1)
K Means
Document18 pages
K Means
krinunn
No ratings yet
Linear Regression
Document27 pages
Linear Regression
John Roncoroni
100% (1)
Vinee
Document28 pages
Vinee
vineesha28
100% (1)
Regressao Linear Simples - Ipynb - Colaboratory
Document2 pages
Regressao Linear Simples - Ipynb - Colaboratory
Gestão Financeira Fatec Bragança
100% (1)
ML Lab6.Ipynb - Colaboratory
Document5 pages
ML Lab6.Ipynb - Colaboratory
Avi Srivastava
100% (1)
Unit - 4 Machine Learning
Document84 pages
Unit - 4 Machine Learning
Ramandeep kaur
100% (1)
Linear - Regression
Document39 pages
Linear - Regression
howgibaa
100% (1)
Chapter-3-Linear Models For Regression
Document61 pages
Chapter-3-Linear Models For Regression
longfei zhang
100% (1)
Credit EDA Assignment PDF
Document40 pages
Credit EDA Assignment PDF
Alisha Anand
No ratings yet
Lab7.ipynb - Colaboratory
Document5 pages
Lab7.ipynb - Colaboratory
PRAGASM PROG
100% (1)
Gradient Boosting: Presentation Edited by
Document38 pages
Gradient Boosting: Presentation Edited by
Svastits
100% (1)
Bagging, Boosting
Document32 pages
Bagging, Boosting
lassijassi
100% (1)
ML Lect1
Document51 pages
ML Lect1
physics lover
100% (1)
EDA Assignment For Banks On Credits
Document20 pages
EDA Assignment For Banks On Credits
Lohith Labhala
No ratings yet
TP Regression
Document1 page
TP Regression
MOHAMED EL BACHIR BERRIOUA
100% (1)
8 Best Python Cheat Sheets For Beginners and Intermediate Learners
Document13 pages
8 Best Python Cheat Sheets For Beginners and Intermediate Learners
Joseph Jung
100% (1)
Introduction
Document49 pages
Introduction
Ebrahim Daneshifar
100% (1)
Data Analytics Time Table V2
Document6 pages
Data Analytics Time Table V2
Hussain
100% (1)
Teleco Cutomer Churn
Document5 pages
Teleco Cutomer Churn
MCV101
100% (1)
Merging - Scaled - 1D - & - Trying - Different - CLassification - ML - Models - .Ipynb - Colaboratory
Document16 pages
Merging - Scaled - 1D - & - Trying - Different - CLassification - ML - Models - .Ipynb - Colaboratory
girishcherry12
100% (1)
Handout9 Trees Bagging Boosting
Document23 pages
Handout9 Trees Bagging Boosting
matthiaskoerner19
100% (1)
A) What Is Motivation Behind Ensemble Methods? Give Your Answer in Probabilistic Terms
Document6 pages
A) What Is Motivation Behind Ensemble Methods? Give Your Answer in Probabilistic Terms
Hassan Saddiqui
100% (1)
Unsupervised Learning
Document66 pages
Unsupervised Learning
Karim Saad
No ratings yet
2 - Noman Naseer - Intro To AI, Machine - Learning - KNN - NBC - K Means Clustering
Document37 pages
2 - Noman Naseer - Intro To AI, Machine - Learning - KNN - NBC - K Means Clustering
057GV Văn Hiếu
No ratings yet
AI ENDSEM Sample Debjyoti
Document8 pages
AI ENDSEM Sample Debjyoti
Gourav
No ratings yet
Membuat Kunci Identifikasi Tipe Perbandingan Kartu Berlubang (Body-Punched Card Key)
Document4 pages
Membuat Kunci Identifikasi Tipe Perbandingan Kartu Berlubang (Body-Punched Card Key)
Aditya Surya
No ratings yet
Cybersecurity+for+Industry+4 0 PDF
Document273 pages
Cybersecurity+for+Industry+4 0 PDF
Muhammad Machbub Rochman
100% (2)
MS246650 MilitaryCotterPin
Document0 pages
MS246650 MilitaryCotterPin
Masih Belajar
No ratings yet
MSDS Pfad
Document2 pages
MSDS Pfad
Anggun
100% (1)
Course Content Schedule For SBL by Sir Muhammad Ashraf Rehman
Document8 pages
Course Content Schedule For SBL by Sir Muhammad Ashraf Rehman
AbdulAzeem
100% (1)
Effect of T6 Heat Treatment Parameters On Technolo
Document6 pages
Effect of T6 Heat Treatment Parameters On Technolo
ferhat aydogan
No ratings yet
CERR
Document2 pages
CERR
Rob Alejandro
100% (1)
Magic Item Creation Rules
Document3 pages
Magic Item Creation Rules
Michael McBear
100% (1)
Hizon Notes - Sales (Seña Civil Law Review II) PDF
Document42 pages
Hizon Notes - Sales (Seña Civil Law Review II) PDF
Leeanji Galamgam
No ratings yet
Abdul Rahman Dalupang Romuros: Objectives
Document2 pages
Abdul Rahman Dalupang Romuros: Objectives
Sarip Sharief Saripada
No ratings yet
Rita Processes
Document7 pages
Rita Processes
hasan
No ratings yet
Potassium - Benefits, Side Effects & Dosage
Document11 pages
Potassium - Benefits, Side Effects & Dosage
Michael Walls
No ratings yet
Presentation On China V/s India: Submitted To:-Chanchal Sir
Document32 pages
Presentation On China V/s India: Submitted To:-Chanchal Sir
Subhash Soni
No ratings yet
Tax Free Exchanges of Property
Document2 pages
Tax Free Exchanges of Property
Dianne Camille Uy
No ratings yet
Harmonic Reduction Using Shunt Active Filters: Abstract-Power Quality Problem Arises Due To The Increased
Document2 pages
Harmonic Reduction Using Shunt Active Filters: Abstract-Power Quality Problem Arises Due To The Increased
Aditya Tiwari
No ratings yet
Ralph Quintana
Document1 page
Ralph Quintana
Ralph Theodore Quintana
No ratings yet
Unbonded Vs Bonded
Document6 pages
Unbonded Vs Bonded
caner_kurtoglu-1
No ratings yet
Summer School Book
Document324 pages
Summer School Book
Sessiz Sihirbaz
No ratings yet
Agilent Value Promise 5991-1491EN
Document4 pages
Agilent Value Promise 5991-1491EN
varian1221
No ratings yet
Module 3 - Breakeven
Document2 pages
Module 3 - Breakeven
Bern Austin Esguerra
No ratings yet
Mechanical
Document609 pages
Mechanical
Mohammed
100% (1)
How To Fix STAAD Warning WWW - Uniquecivil
Document5 pages
How To Fix STAAD Warning WWW - Uniquecivil
MohdDanish
No ratings yet
Edexcel As Chemistry Unit 2
Document4 pages
Edexcel As Chemistry Unit 2
Husain Khuzema
No ratings yet
Specifications Mesh
Document2 pages
Specifications Mesh
Carlos Andrés Gaitan Valenciano
No ratings yet
Module 4 - Elaborate Pharmacology
Document2 pages
Module 4 - Elaborate Pharmacology
Khylamarie Villaluna
No ratings yet
Listening Activities Clothes
Document6 pages
Listening Activities Clothes
Laura Jean
No ratings yet
H.S.E. Questionnaire & Answers: Safety Standars
Document37 pages
H.S.E. Questionnaire & Answers: Safety Standars
RAFI K.A
No ratings yet