An empirical comparison of supervised learning algorithms

R Caruana, A Niculescu-Mizil - … of the 23rd international conference on …, 2006 - dl.acm.org
Proceedings of the 23rd international conference on Machine learning, 2006dl.acm.org
A number of supervised learning methods have been introduced in the last decade.
Unfortunately, the last comprehensive empirical evaluation of supervised learning was the
Statlog Project in the early 90's. We present a large-scale empirical comparison between ten
supervised learning methods: SVMs, neural nets, logistic regression, naive bayes, memory-
based learning, random forests, decision trees, bagged trees, boosted trees, and boosted
stumps. We also examine the effect that calibrating the models via Platt Scaling and Isotonic …
A number of supervised learning methods have been introduced in the last decade. Unfortunately, the last comprehensive empirical evaluation of supervised learning was the Statlog Project in the early 90's. We present a large-scale empirical comparison between ten supervised learning methods: SVMs, neural nets, logistic regression, naive bayes, memory-based learning, random forests, decision trees, bagged trees, boosted trees, and boosted stumps. We also examine the effect that calibrating the models via Platt Scaling and Isotonic Regression has on their performance. An important aspect of our study is the use of a variety of performance criteria to evaluate the learning methods.
ACM Digital Library