Abstract
The applications of the rough set theory to identify the most relevant attributes and to induce decision rules from a medical data set are discussed in this paper. The real life medical data set concerns children with diabetes mellitus. Three methods are considered for identification of the most relevant attributes. The first method is based on the notion of reduct and its stability. The second method is based on particular attribute significance measured by relative decrease of positive region after its removal. The third method is inspired by the wrapper approach, where the classification accuracy is used for ranking attributes. The rough set approach additionally offers the set of decision rules. For the rough set based reduced data application of nearest neighbor algorithms is also investigated. The presented methods are general and one can apply all of them to different kinds of data sets.
Preview
Unable to display preview. Download preview PDF.
References
Bazan J.G.: A comparison of dynamic and non-dynamic rough set methods for extracting laws from decision tables. [in:] L. Polkowski, A. Skowron, (eds.), Rough Sets in Knowledge Discovery 1. Methodology and Applications. Physica-Verlag, Heidelberg, 1998, pp. 321–365.
Carlin U.S., Komorowski J., Ohrn A.: Rough Set Analysis of Patients with Suspected Acute Appendicitis, Proceedings of IPMU’98, Paris, France, July 1998, pp. 1528–1533.
Kohavi R., John G.H.: Wrappers for Feature Subset Selection, Artificial Intelligence Journal, 97, 1997, pp. 273–324.
Langley P., Iba W. Average-case Analysis of a Nearest Neighbor Algorithm, Proceedings of the 13th International Joint Conference on Artificial Intelligence, Morgan Kaufmann, San Mateo, CA, 1993, pp. 889–894.
Nguyen H.S., Nguyen S.H.: Discretization Methods in Data Mining, [in:] L. Polkowski, A. Skowron (eds.): Rough Sets in Knowledge Discovery 1. Methodology and Applications. Physica-Verlag, Heidelberg 1998, pp. 451–482.
Ohrn A., Komorowski J., Skowron A., Synak P. The Design and Implementation of a Knowledge Discovery Toolkit Based on Rough Sets—The Rosetta System, [in:] L. Polkowski, A. Skowron (eds.). Rough Sets in Knowledge Discovery 1. Methodology and Applications. Physica-Verlag, Heidelberg 1998, pp. 376–399.
Paszek P., Wakulicz-Deja A.: Optimization Diagnose in Progressive Encephalopathy Applying The Rough Set Theory, Proceedings of the Fourth European Congress on Intelligent Techniques and Soft Computing, Aachen, Germany, September 2–5, 1996, Vol. 1, pp. 192–196.
Pawlak Z.: Rough Sets. Theoretical Aspects of Reasoning about Data, Kluwer Academic Publishers, Dordrecht, 1991.
Pawlak Z., Slowinski K., Slowinski R. Rough classification of patients after highly selective vagotomy for duodenal ulcer. International Journal of Man-Machine Studies, 24, 1998, pp. 413–433.
Slowinski K.: Rough Classification of HSV Patients, (ed.) Slowinski R., Intelligent Decision Support—Handbook of Applications and Advances of the Rough Sets Theory. Kluwer Academic Publishers, Dordrecht, 1992, pp. 77–93.
Stefanowski J.,Slowinski K.: Rough Set Theory and Rule Induction Techniques for Discovery of Attribute Dependencies in Medical Information Systems, Lecture Notes in Artificial Intelligence 1263, Springer-Verlag, 1997, pp. 36–46.
Stepaniuk J., Urban M., Baszun-Stepaniuk E.: The Application of Rough Set Based Data Mining Technique in the Prognostication of the Diabetic Nephropathy Prevalence, Proceedings of the Seventh International Workshop on Intelligent Information Systems, Malbork, Poland, June 15–19, 1998, pp. 388–391.
Tsumoto S., Ziarko W.: The Application of Rough Sets—Based Data Mining Technique to Differential Diagnosis of Meningoencephalitis, Proceedings of the 9th International Symposium, Foundations of Intelligent Systems, Zakopane, Poland, 9–13 June, 1996, Lecture Notes in Artificial Intelligence 1079, pp. 438–447.
Urban M., Baszun-Stepaniuk E., Stepaniuk J.: Application of the Rough Set Theory in the Prognostication of the Diabetic Nephropathy Prevalence. Preliminary Communication Endokrynologia, Diabetologia i Choroby Przemiany Materii Wieku Rozwojowego 1998 4, 2.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Stepaniuk, J. (1999). Rough set data mining of diabetes data. In: RaÅ›, Z.W., Skowron, A. (eds) Foundations of Intelligent Systems. ISMIS 1999. Lecture Notes in Computer Science, vol 1609. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0095133
Download citation
DOI: https://doi.org/10.1007/BFb0095133
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65965-5
Online ISBN: 978-3-540-48828-6
eBook Packages: Springer Book Archive