Module3

Uploaded by

aarchanasingh20

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Module3

Uploaded by

aarchanasingh20

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

Agenda

• Nearest Neighbor techniques

• Cost functions and Optimization Technique
• introduction to Gradient Descent, its applications on Linear Regression.
• Ensemble Learning algorithms – Bagging (Random Forest),
Boosting(AdaBoost)
Nearest Neighbor Classifiers
• Basic idea:
• If it walks like a duck, quacks like a duck, then it’s
probably a duck

Training Compute
Records Distance Test
Record

Choose k of the
“nearest” records
Nearest-Neighbor Classifiers
Unknown record  Requires three things
– The set of stored records
– Distance Metric to compute
distance between records
– The value of k, the number of
nearest neighbors to retrieve

 To classify an unknown record:

– Compute distance to other
training records
– Identify k nearest neighbors
– Use class labels of nearest
neighbors to determine the
class label of unknown record
(e.g., by taking majority vote)
Definition of Nearest Neighbor

X X X

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

K-nearest neighbors of a record x are data points

that have the k smallest distance to x
Nearest Neighbor Classification
• Compute distance between two points:
• Euclidean distance

d ( p, q )   ( pi
i
 q)
i
2

• Determine the class from nearest neighbor list

• take the majority vote of class labels among the k-
nearest neighbors
• Weigh the vote according to distance
• weight factor, w = 1/d2
Nearest Neighbor Classification…
• Choosing the value of k:
• If k is too small, susceptible to overfitting, due to
noise points in the training data.
• If k is too large, neighborhood may include points
from other classes.

X
Nearest Neighbor Classification…
• Scaling issues
• Attributes may have to be scaled to prevent distance
measures from being dominated by one of the
attributes
• Example:
• height of a person may vary from 1.5m to 1.8m
• weight of a person may vary from 90lb to 300lb
• income of a person may vary from $10K to $1M
• Solution: Normalize the vectors to unit length
Nearest Neighbor Classification…
1. Let k be the no. of nearest neighbors and D be the set of
training examples.
2. for each test example z = (x’, y’) do
2.1 compute d(x’, x), the distance between z and every
example (x,y) Є D.
2.2 Select ⊆ D, the set of k closest training examples to
z.
2.3

2.4 end for

Nearest neighbor Classification…
• k-NN classifiers are lazy learners
• It does not build models explicitly
• Unlike eager learners such as decision tree induction
and rule-based systems
• Classifying unknown records are relatively expensive
Very Important Topic
• https://www.analyticsvidhya.com/blog/
2021/04/gradient-descent-in-linear-regression/
Ensemble Methods/Learning
• Construct a set of classifiers from the training
data
• Predict class label of previously unseen records
by aggregating predictions made by multiple
classifiers
• Improves the classification accuracy
• Predicted output of the base classifiers is
combined by majority voting
• Build different experts and let them vote.
General Idea
Original
D Training data

Step 1:
Create Multiple D1 D2 .... Dt-1 Dt
Data Sets

Step 2:
Build Multiple C1 C2 Ct -1 Ct
Classifiers

Step 3:
Combine C*
Classifiers
General Procedure for ensemble methods
Bagging
•

Original Data 1 2 3 4 5 6 7 8 9 10
Bagging (Round 1) 7 8 10 8 2 5 10 10 5 9
Bagging (Round 2) 1 4 9 1 2 3 2 7 3 2
Bagging (Round 3) 1 8 5 10 5 5 9 6 3 7
Example of Bagging
• Refer notes for a numerical example.
• Data Set used to construct an ensemble of bagging
classifiers
x 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
y 1 1 1 -1 -1 -1 -1 1 1 1
Boosting
• An iterative procedure to adaptively change
distribution of training data by focusing more on
previously misclassified records
• Initially, all N records are assigned equal weights, 1/N
• Unlike bagging, weights may change at the end of
boosting round
• With each boosting sample, a classifier is
induced(iteratively) and is used to classify all training
examples.
• Misclassified examples are assigned more weights for the
next round.
Boosting

• Records that are wrongly classified will have

their weights increased
• Records that are classified correctly will have
their weights decreased
Original Data 1 2 3 4 5 6 7 8 9 10
Boosting (Round 1) 7 3 2 8 7 9 4 10 6 3
Boosting (Round 2) 5 4 9 4 2 5 1 7 4 2
Boosting (Round 3) 4 4 8 10 4 5 4 6 3 4

• Example 4 is hard to classify

• Its weight is increased, therefore it is more likely to be chosen again
in subsequent rounds
•Final ensemble is an aggregate of the base classifiers got from each
boosting round.
How Boosting Works?
•
Basic Idea
• Suppose there are just 5 training examples {1, 2, 3, 4, 5}
• Initially each example has 0.2 (1/5) probability, of being
sampled.
• If the boosting samples for the first round are {2,4,4,3,2}, a
base classifier is built from this.
• Suppose 2,3,5 are correctly predicted by this classifier and
1,4 are wrongly predicted:
• Weight of 1,4 is increased
• Weight of 2,3,5 is decreased.
• Second round of boosting , again 5 samples, but now 1,4 are
more likely to be sampled.
Boosting
• Is an iterative procedure.
• The distribution of training examples are adaptively
changed, so that the base classifiers in the next
iteration, focus more on examples that are wrongly
predicted in the previous iteration.
• Boosting assigns a weight to each example.
• Weights are adaptively changed at the end of each
boosting round.
• Weights assigned to the training examples are used
in the following ways:-
Boosting
1. To draw a set of bootstrap samples from the
original data
2. Can be used by the base classifier to learn a
model that is biased towards heigher weight
examples.
Steps:-
3. Initially wt of all examples are same 1/N.
4. A sample is drawn as per the sampling distbn
of the training examples to get a new training
set.
5. A classifier is induced from this training set.
Boosting
4. All examples of the original data are classified
using this classifier.
5. Wrongly classified examples … increase in
weight
Correctly classified examples …. decrease in
weight.
So, wrongly classified examples will be focussed
more in subsequent iterations.
6. Repeat steps 2 to 5 for k times(k = no. of base
classifiers)
Boosting
7. As boosting round proceeds, wrongly classified
examples become more prevalent.
8. Final ensemble is got by aggregating base
classifiers got from each boosting round.

Several implementations of the boosting algorithm

have been developed. They all differ in terms of
1) How the weights of the examples are updated.
2) How the predictions of the base classifiers are
combined.
Example: AdaBoost
•

N
1
i 
N
 w  C ( x )  y 
j 1
j i j j

1  1 i 
 i  ln  
2  i 
Example: AdaBoost

•
j
( j 1)
( j)
w  exp if C j ( xi )  yi
wi i
 
Z j  exp j if C j ( xi )  yi
where Z j is the normalizat ion factor
Ada Boost
•

T
C * ( x ) arg max   j C j ( x )  y 
y j 1

Training Report On Data Sciencep
No ratings yet
Training Report On Data Sciencep
80 pages
Class Adv Classification V
No ratings yet
Class Adv Classification V
50 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
lecture slide 12
No ratings yet
lecture slide 12
22 pages
Ensemble Classifiers
No ratings yet
Ensemble Classifiers
37 pages
Machine Learning: Ensemble Methods
No ratings yet
Machine Learning: Ensemble Methods
54 pages
Session 5 ppt
No ratings yet
Session 5 ppt
36 pages
AIML Lect6 Ensembles
No ratings yet
AIML Lect6 Ensembles
41 pages
cs4302-lecture2
No ratings yet
cs4302-lecture2
40 pages
MLDM Lect17 Classification Ensembles
No ratings yet
MLDM Lect17 Classification Ensembles
2 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
36 pages
ensembles_learning
No ratings yet
ensembles_learning
16 pages
LFD 2005 Nearest Neighbour
No ratings yet
LFD 2005 Nearest Neighbour
6 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
Data Mining Lecture 10B: Classification
No ratings yet
Data Mining Lecture 10B: Classification
62 pages
ML8Ensembles (1)
No ratings yet
ML8Ensembles (1)
31 pages
Validaciones - Bosstrap
No ratings yet
Validaciones - Bosstrap
50 pages
Combining Classifiers: Outline
No ratings yet
Combining Classifiers: Outline
15 pages
Unit 2 - SVM
No ratings yet
Unit 2 - SVM
137 pages
Introduction To Boosting - 2
No ratings yet
Introduction To Boosting - 2
79 pages
Data Mining - Ensemble Methods
No ratings yet
Data Mining - Ensemble Methods
12 pages
Bagging, Boosting
100% (1)
Bagging, Boosting
32 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
No ratings yet
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
19 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
47 pages
Ensemble Classification
No ratings yet
Ensemble Classification
25 pages
ensemble
No ratings yet
ensemble
33 pages
کتاب هفتم بارگزاری شده
No ratings yet
کتاب هفتم بارگزاری شده
57 pages
datamining-lect12
No ratings yet
datamining-lect12
75 pages
Unit - 3 ML
No ratings yet
Unit - 3 ML
17 pages
Boosting Buehlmann
No ratings yet
Boosting Buehlmann
52 pages
Ensembles of Classifiers: Evgueni Smirnov
No ratings yet
Ensembles of Classifiers: Evgueni Smirnov
43 pages
T6- KNN - Features, Distances &amp; Non-Parametric Models
No ratings yet
T6- KNN - Features, Distances &amp; Non-Parametric Models
23 pages
12 Ensemble Model
No ratings yet
12 Ensemble Model
90 pages
107 Boostong Models
No ratings yet
107 Boostong Models
27 pages
Week 11 EnsembleLearning
No ratings yet
Week 11 EnsembleLearning
34 pages
Lecture 10 Ensemble Methods
No ratings yet
Lecture 10 Ensemble Methods
69 pages
Boosting Mit
No ratings yet
Boosting Mit
36 pages
16 Boosting
No ratings yet
16 Boosting
7 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
Introduction To Boosting: Cynthia Rudin PACM, Princeton University
No ratings yet
Introduction To Boosting: Cynthia Rudin PACM, Princeton University
29 pages
An Introduction of Ensemble Learning
100% (1)
An Introduction of Ensemble Learning
40 pages
2.4-Ensemble_methods_lecture_notes (1)
No ratings yet
2.4-Ensemble_methods_lecture_notes (1)
14 pages
Bagging and Boosting in Data Mining: Carolina Ruiz
No ratings yet
Bagging and Boosting in Data Mining: Carolina Ruiz
8 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
Ensemble Learning
No ratings yet
Ensemble Learning
52 pages
Module 5,1 Ensemble_Bagging, RF,Boosting
No ratings yet
Module 5,1 Ensemble_Bagging, RF,Boosting
66 pages
3900286
No ratings yet
3900286
23 pages
boosting
No ratings yet
boosting
28 pages
Datamining Lect7knearst
No ratings yet
Datamining Lect7knearst
62 pages
KNN CIML
No ratings yet
KNN CIML
12 pages
Example 1: Riding Mowers
No ratings yet
Example 1: Riding Mowers
6 pages
Module 5-Part 1
No ratings yet
Module 5-Part 1
30 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
7.classification Before
No ratings yet
7.classification Before
27 pages
5c. Nearest Neighbour Classifier
No ratings yet
5c. Nearest Neighbour Classifier
2 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Combining Pattern Classifiers: Methods and Algorithms
From Everand
Combining Pattern Classifiers: Methods and Algorithms
Ludmila I. Kuncheva
No ratings yet
Environmental Statistics: Methods and Applications
From Everand
Environmental Statistics: Methods and Applications
Vic Barnett
No ratings yet
Stqa Miniproject
No ratings yet
Stqa Miniproject
18 pages
DSE 310 - Practice 3
No ratings yet
DSE 310 - Practice 3
34 pages
Ms-Dos Linker: Dhriti Das (Roll No-07) Simsima Gogoi (Roll No-17) Tanmi Bharadwaj (Roll No-18)
0% (1)
Ms-Dos Linker: Dhriti Das (Roll No-07) Simsima Gogoi (Roll No-17) Tanmi Bharadwaj (Roll No-18)
28 pages
Full Stack Web Development - IIT Roorkee
No ratings yet
Full Stack Web Development - IIT Roorkee
5 pages
8086 Viva Questions
0% (1)
8086 Viva Questions
10 pages
15-150703-Design and Analysis of Algorithms PDF
No ratings yet
15-150703-Design and Analysis of Algorithms PDF
2 pages
12 IP Notes On Series
No ratings yet
12 IP Notes On Series
5 pages
Bit Manipulation: CS 521: Systems Programming
No ratings yet
Bit Manipulation: CS 521: Systems Programming
17 pages
Pop
No ratings yet
Pop
2 pages
Computer Science 12 A Practical File
No ratings yet
Computer Science 12 A Practical File
50 pages
Half Yearly-Viii Comp MR
No ratings yet
Half Yearly-Viii Comp MR
2 pages
Lab 01
No ratings yet
Lab 01
14 pages
C# ADO EF LINQ DB MVC Jquery
No ratings yet
C# ADO EF LINQ DB MVC Jquery
19 pages
Computer Organization: - Six Logical Units in Every Computer
No ratings yet
Computer Organization: - Six Logical Units in Every Computer
17 pages
Connecting Excel To ControlLogix
No ratings yet
Connecting Excel To ControlLogix
7 pages
Fds Practical No.7
No ratings yet
Fds Practical No.7
9 pages
اسئلة حاسوب فاينل
No ratings yet
اسئلة حاسوب فاينل
25 pages
BC 180407008
No ratings yet
BC 180407008
6 pages
PYCS 05 - Else and Elif - Colaboratory
No ratings yet
PYCS 05 - Else and Elif - Colaboratory
6 pages
First Prep QP-X-CTA
No ratings yet
First Prep QP-X-CTA
6 pages
Program - 3: AIM: Write A C Program To Implement Cyclic Redundant Check
No ratings yet
Program - 3: AIM: Write A C Program To Implement Cyclic Redundant Check
4 pages
Scilab and Scicos Revised
No ratings yet
Scilab and Scicos Revised
143 pages
Pratt Chapter 1
No ratings yet
Pratt Chapter 1
11 pages
SQL Server Architecture Explained
100% (1)
SQL Server Architecture Explained
22 pages
Notes 04 String Matching
No ratings yet
Notes 04 String Matching
96 pages
UNIT I PROBLEM SOLVING - Important Question Bank
No ratings yet
UNIT I PROBLEM SOLVING - Important Question Bank
3 pages
Unit 1
No ratings yet
Unit 1
10 pages
Variables and Data Types in C#
No ratings yet
Variables and Data Types in C#
14 pages
Windows Scripting Host Programmer's ReferenceProgrammer's Reference
100% (1)
Windows Scripting Host Programmer's ReferenceProgrammer's Reference
44 pages