0% found this document useful (0 votes)

22 views

Notes On ANOVA For Comparing Multiple Algorithms

Uploaded by

bodev46157

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

Notes On ANOVA For Comparing Multiple Algorithms

Uploaded by

bodev46157

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Notes on ANOVA for Comparing Multiple Algorithms

1. Overview of the Problem

When comparing the performance of multiple algorithms, we often train and test several
algorithms on multiple datasets to evaluate their error rates. Given L algorithms and K training
sets, we induce K classifiers for each algorithm and test them on K validation sets. This results in
L groups of K error rates each. The goal is to determine if there are statistically significant
differences in error rates among these algorithms.

2. ANOVA Framework

Objective:

 Test whether there are significant differences in mean error rates across L algorithms.

Hypotheses:

 Null Hypothesis (H0): All algorithms have the same mean error rate. μ1 = μ2 = ⋯ =
μL .
 Alternative Hypothesis (H1): At least one algorithm has a different mean error rate.

Data Assumptions:

 Error rates Xij are normally distributed with mean μj and common variance σ2 .
 Each error rate is approximately normal due to the binomial distribution of validation
errors.

3. ANOVA Procedure

a. Estimators of Variance:

̂𝟐 ):
1. Between-Group Variance (Estimator 𝛔𝒃
1
o Group Mean: mj = ∑K X
K i=1 ij
1
o Overall Mean: m = ∑Lj=1 mj
L
2
o Between-Group Sum of Squares (SSB): SSB = K ∑Lj=1(mj − m)
o ̂2 = SSB
Estimator: σb L−1
2. Within-Group Variance (Estimator 𝛔̂𝟐𝒘 ):
1 2
o Group Variance: Sj2 = ∑K (X − mj )
K−1 i=1 ij
2
o Within-Group Sum of Squares (SSW): SSW = ∑Lj=1 ∑Ki=1(Xij − mj )
o 2 = SSW
Estimator: σ̂
w L⋅(K−1)
b. F-Ratio Calculation:

σ̂2b
 F-Ratio: F0 =
σ̂
w
2

o The F-Ratio compares the variance between groups to the variance within groups.

c. Decision Rule:

 If 𝐅𝟎 is greater than the critical value 𝐅𝛂,𝐋−𝟏,𝐋(𝐊−𝟏) from the F-distribution table,
reject the null hypothesis.
 If 𝐅𝟎 is not significant, fail to reject the null hypothesis.

4. ANOVA Table

Source of Sum of Degrees of Mean Square F-Ratio

Variation Squares Freedom
Between Groups SSB L-1 MSB = SSB / (L - 1) MSB /
MSW
Within Groups SSW L(K - 1) MSW = SSW / (L(K
- 1))
Total SST L·K-1

2
 Total Sum of Squares (SST): SST = ∑Lj=1 ∑Ki=1(Xij − m)

5. Post Hoc Testing

Purpose:

 To identify which specific groups differ after finding a significant difference with
ANOVA.

Common Tests:
mi −mj
 Least Significant Difference (LSD) Test: t =
√2σ̂
2 /K
w

o Compare pairwise means to check significant differences.

Multiple Comparisons Correction:

 Bonferroni Correction: Adjust the significance level to account for multiple

comparisons.
α
αadj = T

o Where T is the number of comparisons.

6. Summary

ANOVA is a powerful tool for comparing multiple algorithms' performance. It assesses whether
the observed differences in error rates are statistically significant by analyzing variances within
and between groups. Significant results suggest that at least one algorithm's performance is
significantly different, warranting further investigation through post hoc tests.

Let's go through a detailed example of ANOVA with calculations and post hoc testing. Suppose
we have three algorithms, and we want to compare their error rates using a 5-fold cross-
validation. Here’s the step-by-step process:

Example Dataset

Let’s assume we have the following error rates (in percentage) for three algorithms (A, B, and C)
across 5 folds:

Algorithm Fold 1 Fold 2 Fold 3 Fold 4 Fold 5

A 10 12 11 13 12
B 15 14 16 17 15
C 20 19 21 22 20

Step 1: Calculate Group Means and Overall Mean

Group Means:
10+12+11+13+12
 Algorithm A: mA = = 11.6
5
15+14+16+17+15
 Algorithm B: mB = = 15.4
5
20+19+21+22+20
 Algorithm C: mC = = 20.4
5
(10+12+11+13+12)+(15+14+16+17+15)+(20+19+21+22+20) 165
 Overall Mean: m = = = 11.0
15 15

Step 2: Calculate Sum of Squares

a. Between-Group Sum of Squares (SSB):

SSB = 5 × [(11.6 − 15.4)2 + (15.4 − 15.4)2 + (20.4 − 15.4)2 ]

SSB = 5 × [(11.6 − 15.4)2 + (15.4 − 15.4)2 + (20.4 − 15.4)2 ]
SSB = 5 × [(−3.8)2 + (0)2 + (5)2 ]
SSB = 5 × [14.44 + 0 + 25] = 5 × 39.44 = 197.2

b. Within-Group Sum of Squares (SSW):

 Algorithm A:
(10 − 11.6)2 + (12 − 11.6)2 + (11 − 11.6)2 + (13 − 11.6)2 + (12 − 11.6)2
SA2 =
5−1

2.56 + 0.16 + 0.36 + 1.96 + 0.16 5.2

SA2 = = = 1.3
4 4

 Algorithm B:

(15 − 15.4)2 + (14 − 15.4)2 + (16 − 15.4)2 + (17 − 15.4)2 + (15 − 15.4)2
SB2 =
5−1

0.16 + 1.96 + 0.36 + 2.56 + 0.16 5.2

SB2 = = = 1.3
4 4

 Algorithm C:

(20 − 20.4)2 + (19 − 20.4)2 + (21 − 20.4)2 + (22 − 20.4)2 + (20 − 20.4)2
SC2 =
5−1

0.16 + 1.96 + 0.36 + 2.56 + 0.16 5.2

SC2 = = = 1.3
4 4

SSW = 1.3 × 2 + 1.3 × 2 + 1.3 × 2 = 7.8

Step 3: Calculate Mean Squares and F-Ratio

 Mean Square Between (MSB):

SSB 197.2 197.2

MSB = = = = 98.6
L−1 3−1 2

 Mean Square Within (MSW):

SSW 7.8 7.8

MSW = = = = 0.65
L ⋅ (K − 1) 3 × (5 − 1) 12

 F-Ratio:

MSB 98.6
F0 = = ≈ 151.0
MSW 0.65

Step 4: Decision and Post Hoc Testing

a. Compare F-Ratio to Critical Value:

 Assume significance level α = 0.05, degrees of freedom for the numerator dfB = L −
1 = 2, and for the denominator dfW = L ⋅ (K − 1) = 12.
 From F-distribution tables, the critical value for F0.05,2,12 is approximately 3.89.
 Since F0 ≈ 151.0 is much greater than 3.89, we reject the null hypothesis.

b. Post Hoc Testing:

Least Significant Difference (LSD) Test:

 Standard Error:

SE = √2 × MSW/K = √2 × 0.65/5 = √0.26 ≈ 0.51

 Critical t-value (for 12 degrees of freedom, α = 0.05) is approximately 2.18.

 Pairwise Comparisons:
o Algorithm A vs. B:

Difference = |11.6 − 15.4| = 3.8

3.8
t= ≈ 7.45
0.51

Since 7.45 > 2.18, this difference is significant.

o Algorithm A vs. C:

Difference = |11.6 − 20.4| = 8.8

8.8
t= ≈ 17.25
0.51

Since 17.25 > 2.18, this difference is significant.

o Algorithm B vs. C:

Difference = |15.4 − 20.4| = 5.0

5.0
t= ≈ 9.80
0.51

Since 9.80 > 2.18, this difference is significant.

Summary: All pairwise comparisons are significant, indicating that all algorithms have
significantly different error rates.
By following these steps, we have used ANOVA to determine that there are significant
differences in error rates among the algorithms and used post hoc tests to pinpoint where those
differences lie.

Korelasi & Regresi Soal Dan Jawaban
No ratings yet
Korelasi & Regresi Soal Dan Jawaban
15 pages
Ambit Optimist 8 Installation Guide
0% (1)
Ambit Optimist 8 Installation Guide
87 pages
ANOVA Presentation
No ratings yet
ANOVA Presentation
22 pages
TI-Nspire Programming - TI-Basic Developer
No ratings yet
TI-Nspire Programming - TI-Basic Developer
14 pages
ANOVA of Unequal Sample Sizes
No ratings yet
ANOVA of Unequal Sample Sizes
7 pages
ANOVA of Equal Sample Sizes
No ratings yet
ANOVA of Equal Sample Sizes
7 pages
A Nova Module
No ratings yet
A Nova Module
21 pages
Final-Simple Linear Regression-Theory (1)
No ratings yet
Final-Simple Linear Regression-Theory (1)
3 pages
Assg 2
No ratings yet
Assg 2
16 pages
Unit Iv Design of Experiments
0% (1)
Unit Iv Design of Experiments
34 pages
Dokumen - Tips Homework 3 Solution Department of Statistics Ovitekstat526 Spring11filespdfshw3 Solpdfstat
No ratings yet
Dokumen - Tips Homework 3 Solution Department of Statistics Ovitekstat526 Spring11filespdfshw3 Solpdfstat
12 pages
VI F-Test
No ratings yet
VI F-Test
7 pages
Tugas Ke 7 Statistika Terapan
No ratings yet
Tugas Ke 7 Statistika Terapan
10 pages
Assignment-3 DSA
No ratings yet
Assignment-3 DSA
4 pages
11 Analysis of Variance
No ratings yet
11 Analysis of Variance
19 pages
Text On Class
No ratings yet
Text On Class
18 pages
Febbie Jane G. Tibog Written Report in Experimental Designs Factorial Comparison
No ratings yet
Febbie Jane G. Tibog Written Report in Experimental Designs Factorial Comparison
7 pages
Text - On - Class Econometrics
No ratings yet
Text - On - Class Econometrics
17 pages
Lecture 12 - ReKm - Parametric Tets - ANOVA
No ratings yet
Lecture 12 - ReKm - Parametric Tets - ANOVA
8 pages
Section 1.2 - Systems of Linear Equations PDF
No ratings yet
Section 1.2 - Systems of Linear Equations PDF
11 pages
Statformula - Mergedfinal
No ratings yet
Statformula - Mergedfinal
29 pages
Randomized Block Design
No ratings yet
Randomized Block Design
7 pages
1 The Randomized Block Design
No ratings yet
1 The Randomized Block Design
5 pages
Econometrics - Sheet 2A - MR - 2024
No ratings yet
Econometrics - Sheet 2A - MR - 2024
3 pages
3_Lecture_on_ANOVA-2-19
No ratings yet
3_Lecture_on_ANOVA-2-19
18 pages
Calculating ANOVA for the RCBD Data
No ratings yet
Calculating ANOVA for the RCBD Data
3 pages
Stat Question Set #7 Ans
No ratings yet
Stat Question Set #7 Ans
13 pages
Analisis Rata Rata Pengangguran Terbuka Menurut Pendidikan Tertinggi
No ratings yet
Analisis Rata Rata Pengangguran Terbuka Menurut Pendidikan Tertinggi
3 pages
Lecture 09 Anova
No ratings yet
Lecture 09 Anova
37 pages
Final EM2 2021 Answer Key
No ratings yet
Final EM2 2021 Answer Key
3 pages
CHAPTER 2
No ratings yet
CHAPTER 2
8 pages
Round2 Stat1 Hadath Nov2022 en Solution
No ratings yet
Round2 Stat1 Hadath Nov2022 en Solution
5 pages
Linear Regression (Simple & Multiple)
No ratings yet
Linear Regression (Simple & Multiple)
29 pages
ANOVAangel
No ratings yet
ANOVAangel
24 pages
Rohini 65690336567
No ratings yet
Rohini 65690336567
5 pages
Guide and Formulas For ANOVA
No ratings yet
Guide and Formulas For ANOVA
4 pages
sheet 2A_MR_2021
No ratings yet
sheet 2A_MR_2021
3 pages
One Way Analysis of Variance (ANOVA) : "Slide 43-45)
No ratings yet
One Way Analysis of Variance (ANOVA) : "Slide 43-45)
15 pages
Acspracfinal
No ratings yet
Acspracfinal
13 pages
CE204 Recitation08 Week10 Chapter4
No ratings yet
CE204 Recitation08 Week10 Chapter4
3 pages
Tutorial 8 - Analysis of Variance (ANOVA) : Presented by Eng. Alaa Zarif & Eng. Lobna El Seify
No ratings yet
Tutorial 8 - Analysis of Variance (ANOVA) : Presented by Eng. Alaa Zarif & Eng. Lobna El Seify
19 pages
Chapter 1 Number Systems and Codes
No ratings yet
Chapter 1 Number Systems and Codes
57 pages
Soalan Klon Set D Pelajar (Pra PSPM) SM025
No ratings yet
Soalan Klon Set D Pelajar (Pra PSPM) SM025
8 pages
Design Analysis Presentation
No ratings yet
Design Analysis Presentation
13 pages
MeetLearn 575 Mark Guide
No ratings yet
MeetLearn 575 Mark Guide
5 pages
Matlab Activity 3
No ratings yet
Matlab Activity 3
6 pages
One Way and Two Way Anova
No ratings yet
One Way and Two Way Anova
19 pages
One-Way Anova & Multiple Contrasts
No ratings yet
One-Way Anova & Multiple Contrasts
78 pages
Fundamentals of Statistics
No ratings yet
Fundamentals of Statistics
6 pages
Analysis of Variance
No ratings yet
Analysis of Variance
42 pages
Lesson 12 T Test Dependent Samples
No ratings yet
Lesson 12 T Test Dependent Samples
26 pages
Tutorial 4: Determinants and Linear Transformations
No ratings yet
Tutorial 4: Determinants and Linear Transformations
5 pages
ANOVA - Final 26-3-2024
No ratings yet
ANOVA - Final 26-3-2024
25 pages
Anova
No ratings yet
Anova
51 pages
2016 Fall Estimation Theory Midterm I Sol
No ratings yet
2016 Fall Estimation Theory Midterm I Sol
7 pages
Math644 - Chapter 1 - Part2 PDF
No ratings yet
Math644 - Chapter 1 - Part2 PDF
14 pages
Mca 403
No ratings yet
Mca 403
85 pages
Business Statistics Assignment 5 Manish Chauhan (09-1128)
No ratings yet
Business Statistics Assignment 5 Manish Chauhan (09-1128)
14 pages
HO6 HW2Solutions
No ratings yet
HO6 HW2Solutions
11 pages
Least Squares Technique
No ratings yet
Least Squares Technique
9 pages
Trigonometric Ratios to Transformations (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Trigonometric Ratios to Transformations (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
From Everand
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
3ds Max (Glass)
No ratings yet
3ds Max (Glass)
12 pages
El Nuevo BMW X4 2018
No ratings yet
El Nuevo BMW X4 2018
12 pages
Chapter 2 Force and Motion TEACHER's GUIDE
92% (12)
Chapter 2 Force and Motion TEACHER's GUIDE
44 pages
Perhitungan Data Teknis Pompa Hidrolis, Motor Hidrolis Dan Prime Mover (Engine Diesel)
No ratings yet
Perhitungan Data Teknis Pompa Hidrolis, Motor Hidrolis Dan Prime Mover (Engine Diesel)
2 pages
Unemployment Problem in Bangladesh and Its Impact On Economic Growth
No ratings yet
Unemployment Problem in Bangladesh and Its Impact On Economic Growth
15 pages
Sava Tablice
No ratings yet
Sava Tablice
13 pages
Electrolyser-Operating Manual PDF
0% (1)
Electrolyser-Operating Manual PDF
6 pages
ST5227 Applied Data Mining: Sun Baoluo, Chan Hock Peng
100% (1)
ST5227 Applied Data Mining: Sun Baoluo, Chan Hock Peng
24 pages
Exam On Work and Power
No ratings yet
Exam On Work and Power
9 pages
Linear and Digital Integrated Circuits
No ratings yet
Linear and Digital Integrated Circuits
58 pages
Process Choreographics
No ratings yet
Process Choreographics
22 pages
CX Motion NCF v.1.9 Manual en 201003
No ratings yet
CX Motion NCF v.1.9 Manual en 201003
148 pages
2019vlsi05 2019vlsi06 Vlsi CKT Lab
No ratings yet
2019vlsi05 2019vlsi06 Vlsi CKT Lab
13 pages
CHE 324 (Dr. Garba's Part) Chemical Engineering Thermodynamics II (2 Credits)
No ratings yet
CHE 324 (Dr. Garba's Part) Chemical Engineering Thermodynamics II (2 Credits)
10 pages
Experiment 2: To Perform Addition & Subtraction of Two 8 Bit Numbers Using Microprocessor 8085A and 8051
No ratings yet
Experiment 2: To Perform Addition & Subtraction of Two 8 Bit Numbers Using Microprocessor 8085A and 8051
10 pages
JC Cuevas Molecular Electronics Lecture PDF
No ratings yet
JC Cuevas Molecular Electronics Lecture PDF
83 pages
14 - Stone Work
No ratings yet
14 - Stone Work
23 pages
Bergen Light Rail PDF
100% (1)
Bergen Light Rail PDF
60 pages
Widyowijatnoko, Andry
No ratings yet
Widyowijatnoko, Andry
17 pages
High Level Data Link Control (HDLC)
No ratings yet
High Level Data Link Control (HDLC)
3 pages
AHU Static Pressure Calc
No ratings yet
AHU Static Pressure Calc
56 pages
(1906) Wireless Telegraphy and Telephony (Wireless Radio)
100% (1)
(1906) Wireless Telegraphy and Telephony (Wireless Radio)
436 pages
Periodic Prop. 2
No ratings yet
Periodic Prop. 2
30 pages
Snowflake SnowPro Core Certification Exam Questions - Page 24 of 27 - SkillCertPro
No ratings yet
Snowflake SnowPro Core Certification Exam Questions - Page 24 of 27 - SkillCertPro
1 page
Crash 2024 03 31 18 08 12 419
No ratings yet
Crash 2024 03 31 18 08 12 419
9 pages
Uniform Fiber Bragg Grating Modeling and Simulation Used Matrix Transfer Method
No ratings yet
Uniform Fiber Bragg Grating Modeling and Simulation Used Matrix Transfer Method
7 pages
901344b - Straddle Big Joe
100% (1)
901344b - Straddle Big Joe
28 pages
IGMP and IGMP Snooping
No ratings yet
IGMP and IGMP Snooping
104 pages