Abstract
Although there has been considerable study in the area of trading- off accuracy and comprehensibility of decision tree models, the bulk of the methods dwell on sacrificing comprehensibility for the sake of accuracy, or fine-tuning the balance between comprehensibility and accuracy. Invariably, the level of trade-off is decided {ıtshape a priori}. It is possible for such decisions to be made {ıtshape a posteriori} which means the induction process does not discriminate against any of the objectives. In this paper, we present such a method that uses multi-objective Genetic Programming to optimize decision tree models. We have used this method to build decision tree models from Diabetes data in a bid to investigate its capability to trade-off comprehensibility and performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Murthy, S.K.: Automatic construction of decision trees from data: a multi disciplinary survey, pp. 1–49. Kluwer academic publishers, Boston (1998)
Ittner, A., Schlosser, M.: Discovery of relevant new features by generatingn on-linear decision trees. In: Proc. of 2nd International Conference on Knowledge Discovery and Data Mining, vol. 294, pp. 108–113. AAAI Press, Menlo Park (1996)
Ittner, A.: Non-linear decision trees NDT. In: International conference on machine learning (1996)
Nikolaev, N., Slavov, V.: Concepts of inductive genetic programming. In: Banzhaf, W., Poli, R., Schoenauer, M., Fogarty, T.C. (eds.) EuroGP 1998. LNCS, vol. 1391, pp. 49–59. Springer, Heidelberg (1998)
Bot, M.C.J., Langdon, W.B.: Application of genetic programming to induction of linear classification trees. In: EuroGP 2000:European conference, Edindburgh, Scotland, UK, Edindburgh (April 2000)
Breslow, L.A., Aha, D.W.: Simplifying decision trees: a survey. Navy Center for Applied Research in Knowledge Engineering Review Technical Report (1998)
Emmanouillidis, C.: Evolutionary multi-objective feature selection and ROC analysis with applocation to industrial machinery fault diagnosis. Evolutionary methods for design, optimization and control, Barcelona (2002)
Hunter, A.: Expression Inference - genetic symbolic classification integrated with non-linear coefficient optimization. In: Calmet, J., Benhamou, B., Caprotti, O., Hénocque, L., Sorge, V. (eds.) AISC 2002 and Calculemus 2002. LNCS (LNAI), vol. 2385, pp. 117–127. Springer, Heidelberg (2002)
Hanley, J., McNeill, B.: The meaninga nd use of the area under a receiver operator characteristic curve. Diagn. Radiology 143, 29–36 (1982)
Ittner, A., Zeidler, J., Rossius, R., Dilger, W., Schlosser, M.: Feature space partitioning by non-Linear and fuzzy decision trees. Chemnitz University of Technology, Department of Computer Science, Chemnitz (1997)
Hyafil, L., Rivest, R.L.: Constructingop timal binary decision trees is NPcomplete. Information Processing Letters 5(1), 15–17 (1976)
Loh, W., Vanichsetakul, N.: Tree structured classification via generalized discriminant analysis. J.of the American Statistical Association 83(403), 715–728 (1988)
Murthy, S., Kasif, S., Salzberga, S., Beigel, R.: A system for induction of oblique decision trees. J. of Artificial Intelligence Research 2, 1–33 (1994)
Hampson, S.E., Volper, D.J.: Linear function neurons:Structure and training. Biological Cybernetics 53(4), 203–217 (1986)
Michie, D.: Inducingkno wledge from data; First Principles. In: Unpublished manuscript for a talk given at the Seventh International Conference on Machine Learning. Austin,Texas
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and regression trees. In: Belmont, CA: Wadsworth International Group
Pena-Reyes, C., Sipper, M.: Fuzzy CoCo:Balancing accuracy and interpretability of fuzzy models by means of coevolution. In: Logic Systems Laboratory, Swiss Federal Institute of Technology in Lausanne, CH-1015 Lausanne, Switzerland
Lavrac, N.: Selected techniques for data mining in medicine. Artificial Intelligence in Medicine 16, 3–23 (1999)
Hunter, A.: Using multiobjective genetic programming to infer logistic polynomial regression models. In: 15th European Conference on Artificial Intelligence, Lyon, France (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mugambi, E.M., Hunter, A. (2003). Multi-objective Genetic Programming Optimization of Decision Trees for Classifying Medical Data. In: Palade, V., Howlett, R.J., Jain, L. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2003. Lecture Notes in Computer Science(), vol 2773. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45224-9_42
Download citation
DOI: https://doi.org/10.1007/978-3-540-45224-9_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40803-1
Online ISBN: 978-3-540-45224-9
eBook Packages: Springer Book Archive