Abstract
A genetic algorithm is employed in order to select the appropriate number of components for mixture model classifiers. In this classifier, each class-conditional probability density function can be approximated well using the mixture model of Gaussian distributions. Therefore, the classification performance of this classifier depends on the number of components by nature. In this method, the appropriate number of components is selected on the basis of class separability, while a conventional method is based on likelihood. The combination of mixture models is evaluated by a classification oriented MDL (minimum description length) criterion, and its optimization is carried out using a genetic algorithm. The effectiveness of this method is shown through the experimental results on some artificial and real datasets.
Chapter PDF
Similar content being viewed by others
Keywords
References
Rissanen, J.: A Universal Prior for Integers and Estimation by Minimum Description Length. Annals of Statistics. 11 (1983) 416–431
Tenmoto, H., Kudo, M., Shimbo, M.: MDL-Based Selection of the Number of Components in Mixture Models for Pattern Recognition. Lecture Notes in Computer Science 1451, Advances in Pattern Recognition. Springer (1998) 831–836
Tenmoto, H., Kudo, M., Shimbo, M.: Determination of the Number of Components Based on Class Separability in Mixture-Based Classifiers. Proceedings of the Third International Conference on Knowledge-Based Intelligent Information Engineering Systems. (1999) 439–442
Holland, J.H.: Adaptation in Natural and Artificial Systems. University of Michigan Press (1975)
Dempster, A. P., Laird, N. M., Rubin, D. B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, Series B. 39 (1977) 1–38
Bezdek, J.C.: Cluster Validity with Fuzzy Sets. Journal of Cybernetics. 33(1974) 58–73
Kudo, M., Shimbo, M.: Selection of Classifiers Based on the MDL Principle Using the VC Dimension. Proceedings of the 11th International Conference on Pattern Recognition. (1996) 886–890.
Park, Y., Sklansky, J.: Automated Design of Multiple-Class Piecewise Linear Classifiers. Journal of Classification. 6 (1989) 195–222
Murphy, P. M., Aha, D.W.: UCI Repository of Machine Learning Databases [Machine-Readable Data Repository]. University of California Irvine, Department of Information and Computation Science. (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tenmoto, H., Kudo, M., Shimbo, M. (2000). Selection of the Number of Components Using a Genetic Algorithm for Mixture Model Classifiers. In: Ferri, F.J., Iñesta, J.M., Amin, A., Pudil, P. (eds) Advances in Pattern Recognition. SSPR /SPR 2000. Lecture Notes in Computer Science, vol 1876. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44522-6_53
Download citation
DOI: https://doi.org/10.1007/3-540-44522-6_53
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67946-2
Online ISBN: 978-3-540-44522-7
eBook Packages: Springer Book Archive