Ensembles of Least Squares Classifiers with Randomized Kernels

Torkkola, Kari; Tuv, Eugene

doi:10.1007/978-3-540-78488-3_22

Kari Torkkola⁶ &
Eugene Tuv⁷

Part of the book series: Studies in Computational Intelligence ((SCI,volume 118))

1213 Accesses

Summary

For the recent NIPS-2003 feature selection challenge we studied ensembles of regularized least squares classifiers (RLSC). We showed that stochastic ensembles of simple least squares kernel classifiers give the same level of accuracy as the best single RLSC. Results achieved were ranked among the best at the challenge. We also showed that performance of a single RLSC is much more sensitive to the choice of kernel width than that of an ensemble. As a continuation of this work we demonstrate that stochastic ensembles of least squares classifiers with randomized kernel widths and OOB-post-processing often outperform the best single RLSC, and require practically no parameter tuning. We used the same set of very high dimensional classification problems presented at the NIPS challenge. Fast exploratory Random Forests were applied for variable filtering first.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

High-Dimensional Data Classification

Chameleon: A Python Workflow Toolkit for Feature Selection

Wisdom of Crowds: An Empirical Study of Ensemble-Based Feature Selection Strategies

References

A. Borisov, V. Eruhimov, and E. Tuv. Dynamic soft feature selection for tree-based ensembles. In I. Guyon, S. Gunn, M. Nikravesh, and L. Zadeh, editors, Feature Extraction, Foundations and Applications. Springer, Berlin Heidelberg New York, 2005
Google Scholar
O. Bousquet and A. Elisseeff. Algorithmic stability and generalization performance. In NIPS, pages 196–202, 2000
Google Scholar
L. Breiman. Bagging predictors. Machine Learning, 24(2):123–140, 1996
MATH MathSciNet Google Scholar
L. Breiman. Random forests. Machine Learning, 45(1):5–32, 2001
Article MATH Google Scholar
L. Breiman, J.H. Friedman, R.A. Olshen, and C.J. Stone. Classification and Regression Trees. CRC, Boca Raton, FL, 1984
MATH Google Scholar
F. Cucker and S. Smale. On the mathematial foundations of learning. Bulletin of the American Mathematical Society, 89(1):1–49, 2001
Article MathSciNet Google Scholar
F. Cucker and S. Smale. Best choices for regularization parameters in learning theory: on the bias-variance problem. Foundations of Computational Mathematics, 2(4):413–428, 2003
Article MathSciNet Google Scholar
T. Evgeniou. Learning with Kernel Machine Architectures. PhD thesis, Massachusetts Institute of Technology, EECS, July 2000
Google Scholar
Y. Freund and R. E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. In European Conference on Computational Learning Theory, pages 23–37, 1995
Google Scholar
J.H. Friedman. Greedy function approximation: a gradient boosting machine. Technical report, Department of Statistics, Stanford University, 1999
Google Scholar
J.H. Friedman. Stochastic gradient boosting. Technical report, Department of Statistics, Stanford University, 1999
Google Scholar
A. Hoerl and R. Kennard. Ridge regression; biased estimation for nonorthogonal problems. Technometrics, 12(3):55–67, 1970
Article MATH Google Scholar
T. Poggio, R. Rifkin, S. Mukherjee, and A. Rakhlin. Bagging regularizes. CBCL Paper 214, Massachusetts Institute of Technology, Cambridge, MA, February 2002. AI Memo #2002–2003
Google Scholar
T. Poggio and S. Smale. The mathematics of learning: Dealing with data. Notices of the American Mathematical Society (AMS), 50(5):537–544, 2003
MATH MathSciNet Google Scholar
R. Rifkin. Everything Old is New Again: A Fresh Look at Historical Approaches in Machine Learning. PhD thesis, MIT, 2002
Google Scholar
J.A.K. Suykens and J. Vandervalle. Least squares support vector machines. Neural Processing Letters, 9(3):293–300, June 1999
Article Google Scholar
A.N. Tikhonov and V.Y. Arsenin. Solutions of Ill-posed Problems. W.H. Wingston, Washington DC, 1977
MATH Google Scholar
K. Torkkola. Feature extraction by non-parametric mutual information maximization. Journal of Machine Learning Research, 3:1415–1438, March 2003
Article MATH MathSciNet Google Scholar
K. Torkkola and E. Tuv. Ensembles of regularized least squares classifiers for high-dimensional problems. In I. Guyon, S. Gunn, M. Nikravesh, and L. Zadeh, editors, Feature Extraction, Foundations and Applications. Springer, Berlin Heidelberg New York, 2005
Google Scholar

Download references

Author information

Authors and Affiliations

Intelligent Systems Lab, Motorola, Tempe, AZ, USA
Kari Torkkola
Analysis and Control Technology, Intel, Chandler, AZ, USA
Eugene Tuv

Authors

Kari Torkkola
View author publications
You can also search for this author in PubMed Google Scholar
Eugene Tuv
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, San Jose State University, San Jose, CA, 95192, USA
Tsau Young Lin
Department of Computer Science and Information Systems, Kennesaw State University, Building 11, Room 3060 1000 Chastain Road, Kennesaw, GA, 30144, USA
Ying Xie
Department of Computer Science, The University at Stony Brook, Stony Brook, New York, 11794-4400, USA
Anita Wasilewska
Institute of Information Science, Academia Sinica, No 128, Academia Road, Section 2 Nankang, Taipei, 11529, Taiwan
Churn-Jung Liau

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Torkkola, K., Tuv, E. (2008). Ensembles of Least Squares Classifiers with Randomized Kernels. In: Lin, T.Y., Xie, Y., Wasilewska, A., Liau, CJ. (eds) Data Mining: Foundations and Practice. Studies in Computational Intelligence, vol 118. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78488-3_22

Download citation

DOI: https://doi.org/10.1007/978-3-540-78488-3_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78487-6
Online ISBN: 978-3-540-78488-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Ensembles of Least Squares Classifiers with Randomized Kernels

Summary

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

High-Dimensional Data Classification

Chameleon: A Python Workflow Toolkit for Feature Selection

Wisdom of Crowds: An Empirical Study of Ensemble-Based Feature Selection Strategies

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Ensembles of Least Squares Classifiers with Randomized Kernels

Summary

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

High-Dimensional Data Classification

Chameleon: A Python Workflow Toolkit for Feature Selection

Wisdom of Crowds: An Empirical Study of Ensemble-Based Feature Selection Strategies

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation