0% found this document useful (0 votes)

256 views

KNN Algorithm

This document describes the KNN (K-nearest neighbors) classification algorithm. It can classify new observations based on their distance to trained observations and the class of the nearest neighbors. The document outlines the model properties like number of neighbors, distance metrics, and methods for fitting models, predicting classes, and evaluating performance.

Uploaded by

Anandhi Anu

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

256 views

KNN Algorithm

Uploaded by

Anandhi Anu

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

KNN ALGORITHM

DESCRIPTION
A nearest-neighbor classification object, where both distance metric ("nearest") and number of neighbors can be altered. The object classifies new observations using the predict method. The object contains the data used for training, so can compute resubstitution predictions.

CLASSIFICATION
mdl = ClassificationKNN.fit(X,Y) creates a k-nearest neighbor classification model. For details, see ClassificationKNN.fit. mdl = ClassificationKNN.fit(X,Y,Name,Value) creates a classifier with additional options specified by one or more Name,Value pair arguments. For details, see ClassificationKNN.fit.

Input Arguments X Matrix of predictor values. Each column of X represents one variable, and each row represents one observation. Y Grouping variables of response values with the same number of elements (rows) as X. Each entry in Y is the response to the data in the corresponding row of X.

Properties BreakTies String specifying the method predict uses to break ties if multiple classes have the same smallest cost. By default, ties occur when multiple classes have the same number of nearest points among the K nearest neighbors.

'nearest' Use the class with the nearest neighbor among tied groups. 'random' Use a random tiebreaker among tied groups. 'smallest' Use the smallest index among tied groups. 'BreakTies' applies when 'IncludeTies' is false. Change BreakTies using dot addressing: mdl.BreakTies = newBreakTies

CategoricalPredictors Specification of which predictors are categorical.

'all' All predictors are categorical. [] No predictors are categorical. List of elements in the training data Y with duplicates removed.ClassNames can be a numeric vector, vector of categorical variables (nominal or ordinal), logical vector, character array, or cell array of strings.ClassNames has the same data type as the data in the argument Y. Change ClassNames using dot addressing: mdl.ClassNames = newClassNames

ClassNames

Cost

Square matrix, where Cost(i,j) is the cost of classifying a point into

class j if its true class is i. Cost is K-by-K, where K is the number of classes. Change a Cost matrix using dot addressing: mdl.Cost = costMatrix

Distance

String or function handle specifying the distance metric. The allowable strings depend on the NSMethod parameter, which you set inClassificationKNN.fit, and which exists as a field in ModelParams. NSMethod exhaustive kdtree Distance Metric Names Any distance metric of ExhaustiveSearcher 'cityblock', 'chebychev', 'euclidean', or'minkowski'

For definitions, see Distance Metrics. The distance metrics of ExhaustiveSearcher: Value 'cityblock' 'chebychev' Description City block distance. Chebychev distance (maximum coordinate difference). 'correlation' One minus the sample linear correlation between observations (treated as sequences of values). 'cosine' One minus the cosine of the included angle between observations (treated as vectors).

'euclidean' 'hamming'

Euclidean distance. Hamming distance, percentage of coordinates that differ.

'jaccard'

One minus the Jaccard coefficient, the percentage of nonzero coordinates that differ.

'mahalanobis' Mahalanobis distance, computed using a positive definite covariance matrix C. The default value of C is the sample covariance matrix of X, as computed by nancov(X). To specify a different value for C, use the 'Cov' name-value pair. 'minkowski' Minkowski distance. The default exponent is 2. To specify a different exponent, use the 'P'name-value pair. 'seuclidean' Standardized Euclidean distance. Each coordinate difference between X and a query point is scaled, meaning divided by a scale valueS. The default value of S is the standard deviation computed from X, S = nanstd(X). To specify another value for S, use the Scale name-value pair. 'spearman' One minus the sample Spearman's rank correlation between observations (treated as sequences of

values). @distfun Distance function handle. distfun has the form function D2 = DISTFUN(ZI,ZJ)

% calculation of distance

...

where

ZI is a 1-by-N vector containing one row of X orY. ZJ is an M2-by-N matrix containing multiple rows of X or Y.

D2 is an M2-by-1 vector of distances, andD2(k) is the distance between observationsZI and ZJ(J,:).

Change Distance using dot addressing: mdl.Distance = newDistance

If NSMethod is kdtree, you can use dot addressing to change Distanceonly among the types 'cityblock', 'chebychev', 'euclidean', or'minkowski'. DistanceWeight String or function handle specifying the distance weighting function.

DistanceWeight Meaning

'equal' 'inverse' 'inversesquared' @fcn

No weighting Weight is 1/distance Weight is 1/distance2 fcn is a function that accepts a matrix of nonnegative distances, and returns a matrix the same size containing nonnegative distance weights. For example, 'inversesquared' is equivalent to @(d)d.^(-2).

Change DistanceWeight using dot addressing: mdl.DistanceWeight = newDistanceWeight

DistParameter

Additional parameter for the distance metric.

Distance Metric 'mahalanobis' 'minkowski' 'seuclidean' Parameter Positive definite covariance matrix C. Minkowski distance exponent, a positive scalar. Vector of positive scale values with length equal to the number of columns of X. For values of the distance metric other than those in the table,DistParameter must be []. Change DistParameter using dot

addressing: mdl.DistParameter = newDistParameter

IncludeTies

Logical value indicating whether predict includes all the neighbors whose distance values are equal to the Kth smallest distance. IfIncludeTies is true, predict includes all these neighbors. Otherwise,predict uses exactly K neighbors (see 'BreakTies'). Change IncludeTies using dot addressing: mdl.IncludeTies = newIncludeTies

ModelParams NObservations

Parameters used in training mdl. Number of observations used in training mdl. This can be less than the number of rows in the training data, because data rows containing NaNvalues are not part of the fit.

NumNeighbors

Positive integer specifying the number of nearest neighbors in X to find for classifying each point when predicting. Change NumNeighbors using dot addressing: mdl.NumNeighbors = newNumNeighbors

PredictorNames

Cell array of names for the predictor variables, in the order in which they appear in the training data X. Change PredictorNames using dot addressing: mdl.PredictorNames = newPredictorNames

Prior

Prior probabilities for each class. Prior is a numeric vector whose entries relate to the corresponding ClassNames property. Add or change a Prior vector using dot addressing: obj.Prior = priorVector

ResponseName

String describing the response variable Y. Change ResponseName using dot addressing: mdl.ResponseName = newResponseName

Numeric vector of nonnegative weights with the same number of rows as Y. Each entry in W specifies the relative importance of the corresponding observation in Y. Change W using dot addressing: mdl.W = newW

Numeric matrix of predictor values. Each column of X represents one predictor (variable), and each row represents one observation.

Numeric vector of response values with the same number of rows as X. Each entry in Y is the response to the data in the corresponding row of X.

Methods crossval edge fit loss margin predict resubEdge resubLoss resubMargin resubPredict template Cross-validated k-nearest neighbor classifier Edge of k-nearest neighbor classifier Fit k-nearest neighbor classifier Loss of k-nearest neighbor classifier Margin of k-nearest neighbor classifier Predict k-nearest neighbor classification Edge of k-nearest neighbor classifier by resubstitution Loss of k-nearest neighbor classifier by resubstitution Margin of k-nearest neighbor classifier by resubstitution Predict resubstitution response of k-nearest neighbor classifier k-nearest neighbor classifier template for ensemble

Definitions Prediction ClassificationKNN predicts the classification of a point Xnew using a procedure equivalent to this: 1. Find the NumNeighbors points in the training set X that are nearest to Xnew. 2. Find the NumNeighbors response values Y to those nearest points. 3. Assign the classification label Ynew that has smallest expected misclassification cost among the values in Y.

Ottosen and Peterrsson - Introduction of FEM - 17 Chapters
No ratings yet
Ottosen and Peterrsson - Introduction of FEM - 17 Chapters
175 pages
Determining The Number of Clusters in A Data Set
No ratings yet
Determining The Number of Clusters in A Data Set
6 pages
Notes On Clifford Algebra and Spin (N) Representations PDF
No ratings yet
Notes On Clifford Algebra and Spin (N) Representations PDF
14 pages
Linkage (Analisis Gerarquico)
No ratings yet
Linkage (Analisis Gerarquico)
7 pages
SVD_Document
No ratings yet
SVD_Document
8 pages
Sklearn - neighbors.KNeighborsClassifier - Scikit-Learn 1.4.1 Documentation
No ratings yet
Sklearn - neighbors.KNeighborsClassifier - Scikit-Learn 1.4.1 Documentation
6 pages
Parametric Classification PDF
No ratings yet
Parametric Classification PDF
46 pages
Q. 1) What Is Class Condition Density? (3 Marks) Ans
No ratings yet
Q. 1) What Is Class Condition Density? (3 Marks) Ans
12 pages
Topic 2 Matlab Examples
No ratings yet
Topic 2 Matlab Examples
5 pages
PATTERN FILE[1]
No ratings yet
PATTERN FILE[1]
29 pages
Cluster Analysis Introduction (Unit-6)
No ratings yet
Cluster Analysis Introduction (Unit-6)
20 pages
Road Traffic Algorithm
No ratings yet
Road Traffic Algorithm
5 pages
Sairam PCA
No ratings yet
Sairam PCA
27 pages
Cross Validation
No ratings yet
Cross Validation
14 pages
LFD 2005 Nearest Neighbour
No ratings yet
LFD 2005 Nearest Neighbour
6 pages
Predicting Sonar Rocks Against Mines With ML
No ratings yet
Predicting Sonar Rocks Against Mines With ML
7 pages
Stata Post Publication Update
No ratings yet
Stata Post Publication Update
20 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
74 pages
ML DSBA Lab4
No ratings yet
ML DSBA Lab4
5 pages
Ml unit 2
No ratings yet
Ml unit 2
11 pages
RBF, KNN, SVM, DT
No ratings yet
RBF, KNN, SVM, DT
9 pages
ML IA2 answers
No ratings yet
ML IA2 answers
4 pages
Pattern Recognition Unit 1 Chat GPT
No ratings yet
Pattern Recognition Unit 1 Chat GPT
13 pages
CS168: The Modern Algorithmic Toolbox Lecture #9: The Singular Value Decomposition (SVD) and Low-Rank Matrix Approximations
No ratings yet
CS168: The Modern Algorithmic Toolbox Lecture #9: The Singular Value Decomposition (SVD) and Low-Rank Matrix Approximations
10 pages
Week_8
No ratings yet
Week_8
2 pages
Reconstruction of A Low-Rank Matrix in The Presence of Gaussian Noise
No ratings yet
Reconstruction of A Low-Rank Matrix in The Presence of Gaussian Noise
34 pages
Classification
No ratings yet
Classification
74 pages
K-Nearest Neighbors Algorithm
No ratings yet
K-Nearest Neighbors Algorithm
11 pages
ML Lab - Sukanya Raja
No ratings yet
ML Lab - Sukanya Raja
23 pages
Chapter 07
No ratings yet
Chapter 07
68 pages
SVM Versus Least Squares SVM
No ratings yet
SVM Versus Least Squares SVM
8 pages
Choosing Numbers For The Properties of Their Squares
No ratings yet
Choosing Numbers For The Properties of Their Squares
11 pages
MachineLearning-Spring24 - KNN Implementation For Classification
No ratings yet
MachineLearning-Spring24 - KNN Implementation For Classification
3 pages
BRIO Computed Functions & Variables
No ratings yet
BRIO Computed Functions & Variables
10 pages
UNIT I Notes-1
No ratings yet
UNIT I Notes-1
18 pages
UNIT I Notes
No ratings yet
UNIT I Notes
23 pages
DA Notes - Module 3
No ratings yet
DA Notes - Module 3
14 pages
Lecture 2
No ratings yet
Lecture 2
27 pages
Adhikary e Murty - 2012 - Feature Selection For Unsupervised Learning
No ratings yet
Adhikary e Murty - 2012 - Feature Selection For Unsupervised Learning
8 pages
k-nearest neighbors algorithm - Wikipedia
No ratings yet
k-nearest neighbors algorithm - Wikipedia
10 pages
Frank-Michael Schleif Et Al - Generalized Derivative Based Kernelized Learning Vector Quantization
No ratings yet
Frank-Michael Schleif Et Al - Generalized Derivative Based Kernelized Learning Vector Quantization
8 pages
Data Mining: Clustering
No ratings yet
Data Mining: Clustering
46 pages
Maxwell's Equations For Electromagnetic Waves
No ratings yet
Maxwell's Equations For Electromagnetic Waves
54 pages
Tasks
No ratings yet
Tasks
7 pages
Shuffle (: New in Version 3.6. Changed in Version 3.9: Raises A
No ratings yet
Shuffle (: New in Version 3.6. Changed in Version 3.9: Raises A
3 pages
Support Vector Clustering: Journal of Machine Learning Research 2 (2001) 125-137 Submitted 3/04 Published 12/01
No ratings yet
Support Vector Clustering: Journal of Machine Learning Research 2 (2001) 125-137 Submitted 3/04 Published 12/01
13 pages
Linear Regression
No ratings yet
Linear Regression
13 pages
TMP 560
No ratings yet
TMP 560
9 pages
Using The RAND Function. Note How The RAND Function Is Displayed in The Formula Bar
No ratings yet
Using The RAND Function. Note How The RAND Function Is Displayed in The Formula Bar
5 pages
A_Comparative_Study_on_Distance_Measuring_Approach
No ratings yet
A_Comparative_Study_on_Distance_Measuring_Approach
3 pages
Data Analysis
No ratings yet
Data Analysis
8 pages
ML DSBA Lab3
No ratings yet
ML DSBA Lab3
4 pages
Kmeans Package
No ratings yet
Kmeans Package
1 page
Varlist: Robust - Robust Variance Estimates
No ratings yet
Varlist: Robust - Robust Variance Estimates
24 pages
BA Notes[End Sem)
No ratings yet
BA Notes[End Sem)
26 pages
New Classification and Regression Models
No ratings yet
New Classification and Regression Models
7 pages
Data Mining
No ratings yet
Data Mining
98 pages
Numerical Methods Toolbox
No ratings yet
Numerical Methods Toolbox
3 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
Exercises of Vectors and Vectorial Spaces
From Everand
Exercises of Vectors and Vectorial Spaces
Simone Malacrida
No ratings yet
An Tho Cyan in and Chlorophyl L Extraction
No ratings yet
An Tho Cyan in and Chlorophyl L Extraction
2 pages
Anthocyanin From Alugbati (Basella Rubra Linn)
100% (1)
Anthocyanin From Alugbati (Basella Rubra Linn)
12 pages
132 - Cameroon EAAE Paper
No ratings yet
132 - Cameroon EAAE Paper
16 pages
AIRAH DA13 Fans PublicReviewDraft
No ratings yet
AIRAH DA13 Fans PublicReviewDraft
112 pages
Rainwater Harvesting For Schools
No ratings yet
Rainwater Harvesting For Schools
11 pages
6 006CS - Citrus Nepal
No ratings yet
6 006CS - Citrus Nepal
50 pages
C-2002-Patras-Quasi-Rigid Docking of AUV for Underwater Manipulations
No ratings yet
C-2002-Patras-Quasi-Rigid Docking of AUV for Underwater Manipulations
6 pages
M A Economics Syllabus 2024-25 (1)
No ratings yet
M A Economics Syllabus 2024-25 (1)
75 pages
Mathcad - Chapter 4
No ratings yet
Mathcad - Chapter 4
32 pages
Chapter 23 - Product Metrics For Software
No ratings yet
Chapter 23 - Product Metrics For Software
6 pages
Duc Tran - Basic Linear Algebra - An Introduction With An Intuitive Approach (2022)
No ratings yet
Duc Tran - Basic Linear Algebra - An Introduction With An Intuitive Approach (2022)
190 pages
DEV Lab Record - Merged
No ratings yet
DEV Lab Record - Merged
28 pages
D18130522 Exercise One
No ratings yet
D18130522 Exercise One
10 pages
Statistics Hounours 2nd Year Syllabus
No ratings yet
Statistics Hounours 2nd Year Syllabus
9 pages
Assignment Mathematics I
No ratings yet
Assignment Mathematics I
12 pages
Zhou Et Al 2020 Deep Neural Networks As Add On Modules For Enhancing Robot Performance in Impromptu Trajectory Tracking
No ratings yet
Zhou Et Al 2020 Deep Neural Networks As Add On Modules For Enhancing Robot Performance in Impromptu Trajectory Tracking
22 pages
Matrix Structural Analysis 2nd Edition PDF
100% (3)
Matrix Structural Analysis 2nd Edition PDF
482 pages
Bird s Comprehensive Engineering Mathematics 2nd Edition John Bird all chapter instant download
100% (1)
Bird s Comprehensive Engineering Mathematics 2nd Edition John Bird all chapter instant download
59 pages
Further Maths (VCE) - Exams
No ratings yet
Further Maths (VCE) - Exams
39 pages
Complete Answer Guide for Solution Manual for Introduction to the Design and Analysis of Algorithms, 3/E 3rd Edition Anany Levitin
100% (20)
Complete Answer Guide for Solution Manual for Introduction to the Design and Analysis of Algorithms, 3/E 3rd Edition Anany Levitin
58 pages
AES Example - Input (128 Bit Key and Message) : Thats My Kung Fu
No ratings yet
AES Example - Input (128 Bit Key and Message) : Thats My Kung Fu
17 pages
Book Pract RDFFBD Chapter-6
No ratings yet
Book Pract RDFFBD Chapter-6
90 pages
Garuda Online Written Test
100% (1)
Garuda Online Written Test
36 pages
Practice Problems For Finite Element Method
No ratings yet
Practice Problems For Finite Element Method
5 pages
Solutions
No ratings yet
Solutions
72 pages
B Tech Ece Course Book
No ratings yet
B Tech Ece Course Book
148 pages
MATLAB Pamphlet With Exercises: Section 0: A Few Basics
0% (1)
MATLAB Pamphlet With Exercises: Section 0: A Few Basics
14 pages
7 Algorithms Simbajon
No ratings yet
7 Algorithms Simbajon
43 pages
Solutions Mode Shapes
No ratings yet
Solutions Mode Shapes
3 pages
SB PDF
No ratings yet
SB PDF
197 pages
Numpy
No ratings yet
Numpy
28 pages
Numerical Linear Algebra
No ratings yet
Numerical Linear Algebra
266 pages
IFEM Ch02
No ratings yet
IFEM Ch02
11 pages
Biomechanics of Movement
No ratings yet
Biomechanics of Movement
6 pages