Abstract
We propose a new one-class classification method, called One Class Random Forest, that is able to learn from one class of samples only. This method, based on a random forest algorithm and an original outlier generation procedure, makes use of the ensemble learning mechanisms offered by random forest algorithms to reduce both the number of artificial outliers to generate and the size of the feature space in which they are generated. We show that One Class Random Forests perform well on various UCI public datasets in comparison to few other state-of-the-art one class classification methods (gaussian density models, Parzen estimators, gaussian mixture models and one-class SVMs).
Chapter PDF
Similar content being viewed by others
References
Hempstalk, K., Frank, E., Witten, I.: One-Class Classification by Combining Density and Class Probability Estimation. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part I. LNCS (LNAI), vol. 5211, pp. 505–519. Springer, Heidelberg (2008)
Scholkopf, B., Platt, J., Shawe-Taylor, J., Smola, A., Williamson, R.: Estimating the support of a high-dimensional distribution. Neural Computation 13(7), 1443–1471 (2001)
Tarassenko, L., Clifton, D., Bannister, P., King, S., King, D.: Novelty detection. Encyclopedia of Structural Health Monitoring (2009)
Khan, S., Madden, M.: A survey of recent trends in one class classification. Artificial Intelligence and Cognitive Science, 188–197 (2010)
Tax, D., Duin, R.: Combining One-Class Classifiers. In: Kittler, J., Roli, F. (eds.) MCS 2001. LNCS, vol. 2096, pp. 299–308. Springer, Heidelberg (2001)
Dietterich, T.: Ensemble Methods in Machine Learning. In: Kittler, J., Roli, F. (eds.) MCS 2000. LNCS, vol. 1857, pp. 1–15. Springer, Heidelberg (2000)
Breiman, L.: Random forests. Machine Learning 45(1), 5–32 (2001)
Robnik-Sikonja, M.: Improving Random Forests. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), vol. 3201, pp. 359–370. Springer, Heidelberg (2004)
Bernard, S., Heutte, L., Adam, S.: Forest-rk: A new random forest induction method. In: Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence, pp. 430–437 (2008)
Geurts, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Machine Learning 63(1), 3–42 (2006)
Ho, T.: The random subspace method for constructing decision forests. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(8), 832–844 (1998)
Tax, D., Ypma, A., Duin, R.: Support vector data description applied to machine vibration analysis. In: Proc. 5th Annual Conference of the Advanced School for Computing and Imaging, Heijen, NL, Citeseer (1999)
Blake, C., Merz, C.: Uci repository of machine learning databases. Department of Information and Computer Science, vol. 55. University of California, Irvine (1998), http://www.ics.uci.edu/~mlearn/mlrepository.html
Baldi, P., Brunak, S., Chauvin, Y., Andersen, C., Nielsen, H.: Assessing the accuracy of prediction algorithms for classification: an overview. Bioinformatics 16(5), 412–424 (2000)
Duin, R.: PRTools version 3.0: A matlab toolbox for pattern recognition. In: Proc. of SPIE, Citeseer (2000)
Bernard, S., Heutte, L., Adam, S.: Influence of Hyperparameters on Random Forest Accuracy. In: Benediktsson, J.A., Kittler, J., Roli, F. (eds.) MCS 2009. LNCS, vol. 5519, pp. 171–180. Springer, Heidelberg (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Désir, C., Bernard, S., Petitjean, C., Heutte, L. (2012). A New Random Forest Method for One-Class Classification. In: Gimel’farb, G., et al. Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2012. Lecture Notes in Computer Science, vol 7626. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34166-3_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-34166-3_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34165-6
Online ISBN: 978-3-642-34166-3
eBook Packages: Computer ScienceComputer Science (R0)