0% found this document useful (0 votes)

76 views

KNN Algorithm

The K-Nearest Neighbors (K-NN) algorithm is a simple supervised machine learning algorithm that stores all available data and classifies new data based on similarity. It finds the K closest training examples in the feature space and assigns the new data to the most common class among its K neighbors. K-NN is a non-parametric algorithm that makes no assumptions about the distribution of data and can be used for both classification and regression tasks.

Uploaded by

Megha Sahu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

76 views

KNN Algorithm

Uploaded by

Megha Sahu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

o K-Nearest Neighbour is one of the simplest Machine Learning algorithms

based on Supervised Learning technique.

o K-NN algorithm assumes the similarity between the new case/data and

available cases and put the new case into the category that is most similar to
the available categories.

o K-NN algorithm stores all the available data and classifies a new data point

based on the similarity. This means when new data appears then it can be
easily classified into a well suite category by using K- NN algorithm.

o K-NN algorithm can be used for Regression as well as for Classification but

mostly it is used for the Classification problems.

o K-NN is a non-parametric algorithm, which means it does not make any

assumption on underlying data.

o It is also called a lazy learner algorithm because it does not learn from the

training set immediately instead it stores the dataset and at the time of
classification, it performs an action on the dataset.

o KNN algorithm at the training phase just stores the dataset and when it gets

new data, then it classifies that data into a category that is much similar to
the new data

Suppose there are two categories, i.e., Category A and Category B, and we
have a new data point x1, so this data point will lie in which of these
categories. To solve this type of problem, we need a K-NN algorithm. With
the help of K-NN, we can easily identify the category or class of a particular
dataset. Consider the below diagram:
How does K-NN work?
The K-NN working can be explained on the basis of the below algorithm:

o Step-1: Select the number K of the neighbors

o Step-2: Calculate the Euclidean distance of K number of neighbors

o Step-3: Take the K nearest neighbors as per the calculated Euclidean

distance.

o Step-4: Among these k neighbors, count the number of the data points in

each category.

o Step-5: Assign the new data points to that category for which the number of

the neighbor is maximum.

o Step-6: Our model is ready.

Suppose we have a new data point and we need to put it in the required
category. Consider the below image:
o Firstly, we will choose the number of neighbors, so we will choose the k=5.

o Next, we will calculate the Euclidean distance between the data points. The

Euclidean distance is the distance between two points, which we have

already studied in geometry. It can be calculated as

o By calculating the Euclidean distance we got the nearest neighbors, as three

nearest neighbors in category A and two nearest neighbors in category B.

Consider the below image:
o As we can see the 3 nearest neighbors are from category A, hence this new

data point must belong to category A.

How to select the value of K in the K-NN Algorithm?

Below are some points to remember while selecting the value of K in the K-
NN algorithm:

o There is no particular way to determine the best value for "K", so we need to

try some values to find the best out of them. The most preferred value for K
is 5.

o A very low value for K such as K=1 or K=2, can be noisy and lead to the

effects of outliers in the model.

o Large values for K are good, but it may find some difficulties.

Advantages of KNN Algorithm:

o It is simple to implement.

o It is robust to the noisy training data

o It can be more effective if the training data is large.

Disadvantages of KNN Algorithm:

o Always needs to determine the value of K which may be complex some time.

o The computation cost is high because of calculating the distance between the

data points for all the training samples.

A Mobile Cloud Computing System For Emergency Management
No ratings yet
A Mobile Cloud Computing System For Emergency Management
23 pages
6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
Introduction To Cassandra
No ratings yet
Introduction To Cassandra
15 pages
00 Buku G - Creating Smart Enterprises - Leveraging Cloud, Big Data, Web, Social Media, Mobile and IoT Technologies
No ratings yet
00 Buku G - Creating Smart Enterprises - Leveraging Cloud, Big Data, Web, Social Media, Mobile and IoT Technologies
409 pages
KNN
No ratings yet
KNN
9 pages
Knn
No ratings yet
Knn
5 pages
Knn
No ratings yet
Knn
3 pages
ML Unit-2
No ratings yet
ML Unit-2
24 pages
Unit 3 KNN
No ratings yet
Unit 3 KNN
16 pages
K-Nearest Neighbor Algorithm
100% (1)
K-Nearest Neighbor Algorithm
6 pages
K Nearest Neighbor (KNN)
No ratings yet
K Nearest Neighbor (KNN)
9 pages
K-Nearest Neighbor Algorithm
No ratings yet
K-Nearest Neighbor Algorithm
6 pages
K-Nearest Neighbour (KNN) Algorithm_f3ec27282ed7dde87d5cf56f95272d1a
No ratings yet
K-Nearest Neighbour (KNN) Algorithm_f3ec27282ed7dde87d5cf56f95272d1a
5 pages
AI_5
No ratings yet
AI_5
11 pages
Seminar Report File On KNN Models: University Institute of Engineering and Technology, Kurukshetra University
No ratings yet
Seminar Report File On KNN Models: University Institute of Engineering and Technology, Kurukshetra University
24 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
K-Nearest Neighbor (KNN)
No ratings yet
K-Nearest Neighbor (KNN)
27 pages
Machine Learning Unit-3.1
No ratings yet
Machine Learning Unit-3.1
20 pages
KNN
No ratings yet
KNN
20 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
22 pages
Day43 KNN Intro
No ratings yet
Day43 KNN Intro
4 pages
ML CH 3
No ratings yet
ML CH 3
88 pages
KNN Algorithm
No ratings yet
KNN Algorithm
15 pages
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
No ratings yet
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
7 pages
AI28
No ratings yet
AI28
5 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
18 pages
CSL0777 L22
No ratings yet
CSL0777 L22
35 pages
Unit V: Distance and Rule Based Models
No ratings yet
Unit V: Distance and Rule Based Models
56 pages
KNN Algo
No ratings yet
KNN Algo
7 pages
ML Mid2 Ans
No ratings yet
ML Mid2 Ans
24 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
17 pages
UNIT 3 ML Distance Based Learning
No ratings yet
UNIT 3 ML Distance Based Learning
19 pages
14 K - Nearest Neighbours
No ratings yet
14 K - Nearest Neighbours
8 pages
Untitled 9
No ratings yet
Untitled 9
17 pages
'Machine Learning (Nagarjun)
No ratings yet
'Machine Learning (Nagarjun)
10 pages
Machine Learning-Lecture 03
No ratings yet
Machine Learning-Lecture 03
19 pages
Distance-Based Methods - KNN
No ratings yet
Distance-Based Methods - KNN
8 pages
k-Nearest Neighbors (k-NN) Algorithm
No ratings yet
k-Nearest Neighbors (k-NN) Algorithm
10 pages
lec46
No ratings yet
lec46
12 pages
ML Lecture#10
No ratings yet
ML Lecture#10
17 pages
ML-UNIT-2
No ratings yet
ML-UNIT-2
46 pages
ML-Unit 5
No ratings yet
ML-Unit 5
40 pages
UNIT-2 K-Nn-March 2024
No ratings yet
UNIT-2 K-Nn-March 2024
23 pages
FPA unit 2
No ratings yet
FPA unit 2
20 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
2 pages
K-Nearest Neighbors: KNN Algorithm Pseudocode
No ratings yet
K-Nearest Neighbors: KNN Algorithm Pseudocode
2 pages
KNN
No ratings yet
KNN
53 pages
KNN - Feb 19
No ratings yet
KNN - Feb 19
42 pages
CPE412 Pattern Recognition (Week 6)
No ratings yet
CPE412 Pattern Recognition (Week 6)
27 pages
Adobe Scan 16 May 2023 (3)
No ratings yet
Adobe Scan 16 May 2023 (3)
9 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
Sample KNN
No ratings yet
Sample KNN
7 pages
K Nearestneighborknnalgorithm 241117075907 d767c46d
No ratings yet
K Nearestneighborknnalgorithm 241117075907 d767c46d
13 pages
KNN
No ratings yet
KNN
16 pages
Clustering - KNN
No ratings yet
Clustering - KNN
10 pages
lec36
No ratings yet
lec36
13 pages
Bài-nhóm-tìm-hiểu-về-KNN
No ratings yet
Bài-nhóm-tìm-hiểu-về-KNN
5 pages
ML Unit -2
No ratings yet
ML Unit -2
85 pages
Presentation UNIT-2(Old)
No ratings yet
Presentation UNIT-2(Old)
58 pages
14-15 ASAP Advanced Statistics Clasification Techniques KNN
No ratings yet
14-15 ASAP Advanced Statistics Clasification Techniques KNN
49 pages
Week 10
No ratings yet
Week 10
41 pages
Lecture 14 and 15
No ratings yet
Lecture 14 and 15
42 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
SQL Notes
No ratings yet
SQL Notes
69 pages
PST Assignment 1
No ratings yet
PST Assignment 1
10 pages
Optimizing Replication of Data For Distributed Cloud Computing Environments Techniques Challenges and Research Gap
No ratings yet
Optimizing Replication of Data For Distributed Cloud Computing Environments Techniques Challenges and Research Gap
7 pages
DBMS Course File
No ratings yet
DBMS Course File
193 pages
Bict DBMS
No ratings yet
Bict DBMS
6 pages
DBMS
No ratings yet
DBMS
1 page
Rajalakshmi Engineering College
100% (2)
Rajalakshmi Engineering College
3 pages
4.0 - Lession 2 - DW
No ratings yet
4.0 - Lession 2 - DW
55 pages
CSC - 331 - Data Management I - Lecture 1-1
No ratings yet
CSC - 331 - Data Management I - Lecture 1-1
25 pages
SQL Interview Questions - GeekInterview
No ratings yet
SQL Interview Questions - GeekInterview
17 pages
Nihal Ashik - PythonSQL Program of The BOISSS PDF
No ratings yet
Nihal Ashik - PythonSQL Program of The BOISSS PDF
13 pages
PHP Code Example For View Edit Delete Search Update Database Table
93% (14)
PHP Code Example For View Edit Delete Search Update Database Table
12 pages
Abbreviated Component Maintenance Manual With Illustrated Parts List
No ratings yet
Abbreviated Component Maintenance Manual With Illustrated Parts List
19 pages
Visual Search at Alibaba: Yanhao Zhang, Pan Pan, Yun Zheng, Kang Zhao, Yingya Zhang, Xiaofeng Ren, Rong Jin
No ratings yet
Visual Search at Alibaba: Yanhao Zhang, Pan Pan, Yun Zheng, Kang Zhao, Yingya Zhang, Xiaofeng Ren, Rong Jin
9 pages
Enterprise Guide To Customer Data Platforms - Webinar Deck (Final) - 2
No ratings yet
Enterprise Guide To Customer Data Platforms - Webinar Deck (Final) - 2
36 pages
PGIS Practical
No ratings yet
PGIS Practical
70 pages
Dashboard 8 Teacher ManualsupportMaterialTMD-8
50% (2)
Dashboard 8 Teacher ManualsupportMaterialTMD-8
84 pages
Claim Management System
No ratings yet
Claim Management System
3 pages
SMART TASK MANAGER WITH AI POWERED PRIORITIZATION
No ratings yet
SMART TASK MANAGER WITH AI POWERED PRIORITIZATION
40 pages
Git Pocket Guide A Working Introduction 1st Edition Richard E. Silverman - The 2025 ebook edition is available with updated content
No ratings yet
Git Pocket Guide A Working Introduction 1st Edition Richard E. Silverman - The 2025 ebook edition is available with updated content
57 pages
Coverity Security Report Pttep Bankguarantee - Snid 39791
No ratings yet
Coverity Security Report Pttep Bankguarantee - Snid 39791
16 pages
Atomic Commit and Concurrency Control: COS 418: Distributed Systems Wyatt Lloyd
No ratings yet
Atomic Commit and Concurrency Control: COS 418: Distributed Systems Wyatt Lloyd
40 pages
ETL Test Scenarios
No ratings yet
ETL Test Scenarios
4 pages
DBMS Module 1
No ratings yet
DBMS Module 1
7 pages
Prakash, Chandra - Google Cloud Professional Data Engineer Practice Tests 2019 - GCP Data Engineer Dumps 2019. 100 - Unconditional Pass Guarantee Ex (2019, 万千书友聚集地) - Libgen.li
No ratings yet
Prakash, Chandra - Google Cloud Professional Data Engineer Practice Tests 2019 - GCP Data Engineer Dumps 2019. 100 - Unconditional Pass Guarantee Ex (2019, 万千书友聚集地) - Libgen.li
141 pages
Unit 2 - Data Preprocessing
No ratings yet
Unit 2 - Data Preprocessing
23 pages
P2 - ER Diagram
No ratings yet
P2 - ER Diagram
2 pages

KNN Algorithm

Uploaded by

KNN Algorithm

Uploaded by

o K-Nearest Neighbour is one of the simplest Machine Learning algorithms

based on Supervised Learning technique.

mostly it is used for the Classification problems.

o K-NN is a non-parametric algorithm, which means it does not make any

assumption on underlying data.

o Step-1: Select the number K of the neighbors

o Step-2: Calculate the Euclidean distance of K number of neighbors

o Step-3: Take the K nearest neighbors as per the calculated Euclidean

o Step-4: Among these k neighbors, count the number of the data points in

the neighbor is maximum.

o Step-6: Our model is ready.

o Next, we will calculate the Euclidean distance between the data points. The

Euclidean distance is the distance between two points, which we have

o By calculating the Euclidean distance we got the nearest neighbors, as three

nearest neighbors in category A and two nearest neighbors in category B.

data point must belong to category A.

How to select the value of K in the K-NN Algorithm?

effects of outliers in the model.

Advantages of KNN Algorithm:

o It is robust to the noisy training data

o It can be more effective if the training data is large.

Disadvantages of KNN Algorithm:

data points for all the training samples.

You might also like