Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

Scikit-learn: Machine Learning in Python

Published: 01 November 2011 Publication History

Abstract

Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems. This package focuses on bringing machine learning to non-specialists using a general-purpose high-level language. Emphasis is put on ease of use, performance, documentation, and API consistency. It has minimal dependencies and is distributed under the simplified BSD license, encouraging its use in both academic and commercial settings. Source code, binaries, and documentation can be downloaded from http://scikit-learn.sourceforge.net.

References

[1]
D. Albanese, G. Merler, S.and Jurman, and R. Visintainer. MLPy: high-performance python package for predictive modeling. In NIPS, MLOSS Workshop, 2008.
[2]
C.C. Chang and C.J. Lin. LIBSVM: a library for support vector machines. http://www.csie. ntu.edu.tw/cjlin/libsvm, 2001.
[3]
P.F. Dubois, editor. Python: Batteries Included, volume 9 of Computing in Science & Engineering. IEEE/AIP, May 2007.
[4]
R.E. Fan, K.W. Chang, C.J. Hsieh, X.R. Wang, and C.J. Lin. LIBLINEAR: a library for large linear classification. The Journal of Machine Learning Research, 9:1871-1874, 2008.
[5]
J. Friedman, T. Hastie, and R. Tibshirani. Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 33(1):1, 2010.
[6]
I. Guyon, S. R. Gunn, A. Ben-Hur, and G. Dror. Result analysis of the NIPS 2003 feature selection challenge, 2004.
[7]
M. Hanke, Y.O. Halchenko, P.B. Sederberg, S.J. Hanson, J.V. Haxby, and S. Pollmann. PyMVPA: A Python toolbox for multivariate pattern analysis of fMRI data. Neuroinformatics, 7(1):37-53, 2009.
[8]
T. Hastie and B. Efron. Least Angle Regression, Lasso and Forward Stagewise. http://cran. r-project.org/web/packages/lars/lars.pdf, 2004.
[9]
V. Michel, A. Gramfort, G. Varoquaux, E. Eger, C. Keribin, and B. Thirion. A supervised clustering approach for fMRI-based inference of brain states. Patt Rec, page epub ahead of print, April 2011.
[10]
K.J. Milmann and M. Avaizis, editors. Scientific Python, volume 11 of Computing in Science & Engineering. IEEE/AIP, March 2011.
[11]
S.M. Omohundro. Five balltree construction algorithms. ICSI Technical Report TR-89-063, 1989.
[12]
V. Rokhlin, A. Szlam, and M. Tygert. A randomized algorithm for principal component analysis. SIAM Journal on Matrix Analysis and Applications, 31(3):1100-1124, 2009.
[13]
T. Schaul, J. Bayer, D. Wierstra, Y. Sun, M. Felder, F. Sehnke, T. Rückstieß, and J. Schmidhuber. PyBrain. The Journal of Machine Learning Research, 11:743-746, 2010.
[14]
S. Sonnenburg, G. Rätsch, S. Henschel, C.Widmer, J. Behr, A. Zien, F. de Bona, A. Binder, C. Gehl, and V. Franc. The SHOGUN machine learning toolbox. Journal of Machine Learning Research, 11:1799-1802, 2010.
[15]
S. Van der Walt, S.C Colbert, and G. Varoquaux. The NumPy array: A structure for efficient numerical computation. Computing in Science and Engineering, 11, 2011.
[16]
T. Zito, N. Wilbert, L. Wiskott, and P. Berkes. Modular toolkit for data processing (MDP): A Python data processing framework. Frontiers in Neuroinformatics, 2, 2008.

Cited By

View all
  • (2024)Continuous Monte Carlo Graph SearchProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3662960(1047-1056)Online publication date: 6-May-2024
  • (2024)Forecasting and Mitigating Disruptions in Public Bus Transit ServicesProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3662933(798-806)Online publication date: 6-May-2024
  • (2024)Approximating the Core via Iterative Coalition SamplingProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3662919(669-678)Online publication date: 6-May-2024
  • Show More Cited By

Comments

Information & Contributors

Information

Published In

Publisher

JMLR.org

Publication History

Published: 01 November 2011
Published in JMLR Volume 12

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3,255
  • Downloads (Last 6 weeks)469
Reflects downloads up to 17 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Continuous Monte Carlo Graph SearchProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3662960(1047-1056)Online publication date: 6-May-2024
  • (2024)Forecasting and Mitigating Disruptions in Public Bus Transit ServicesProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3662933(798-806)Online publication date: 6-May-2024
  • (2024)Approximating the Core via Iterative Coalition SamplingProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3662919(669-678)Online publication date: 6-May-2024
  • (2024)Improvement of a Machine Learning Model Using a Sentiment Analysis Algorithm to Detect Fake NewsJournal of Cases on Information Technology10.4018/JCIT.34481226:1(1-26)Online publication date: 21-Jun-2024
  • (2024)A stacked architecture-based fuzzy classifier with data position transformation using fuzzy cognitive mapsJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-23608746:1(2037-2052)Online publication date: 1-Jan-2024
  • (2024)A multiview clustering framework for detecting deceptive reviewsJournal of Computer Security10.3233/JCS-22000132:1(31-52)Online publication date: 2-Feb-2024
  • (2024)Identifying the degree of cornstarch adulteration in turmeric powder using optimized convolutional neural networkIntelligent Decision Technologies10.3233/IDT-24065618:3(1955-1964)Online publication date: 16-Sep-2024
  • (2024)What factors distinguish overlapping Data job postings? Towards ML-based models for job category’s factors predictionIntelligent Decision Technologies10.3233/IDT-24050918:3(2161-2176)Online publication date: 16-Sep-2024
  • (2024)Classifying promoters by interpreting the hidden information of DNA sequences for disease prediction in clinical laboratories using Gaussian decision boundary estimationIntelligent Decision Technologies10.3233/IDT-23028318:1(613-631)Online publication date: 1-Jan-2024
  • (2024)How graph features from message passing affect graph classification and regression?Intelligent Data Analysis10.3233/IDA-22719028:1(57-75)Online publication date: 1-Jan-2024
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media