Rough set data mining of diabetes data

Stepaniuk, Jaroslaw

doi:10.1007/BFb0095133

Jaroslaw Stepaniuk¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1609))

Included in the following conference series:

International Symposium on Methodologies for Intelligent Systems

167 Accesses
13 Citations

Abstract

The applications of the rough set theory to identify the most relevant attributes and to induce decision rules from a medical data set are discussed in this paper. The real life medical data set concerns children with diabetes mellitus. Three methods are considered for identification of the most relevant attributes. The first method is based on the notion of reduct and its stability. The second method is based on particular attribute significance measured by relative decrease of positive region after its removal. The third method is inspired by the wrapper approach, where the classification accuracy is used for ranking attributes. The rough set approach additionally offers the set of decision rules. For the rough set based reduced data application of nearest neighbor algorithms is also investigated. The presented methods are general and one can apply all of them to different kinds of data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bazan J.G.: A comparison of dynamic and non-dynamic rough set methods for extracting laws from decision tables. [in:] L. Polkowski, A. Skowron, (eds.), Rough Sets in Knowledge Discovery 1. Methodology and Applications. Physica-Verlag, Heidelberg, 1998, pp. 321–365.
Google Scholar
Carlin U.S., Komorowski J., Ohrn A.: Rough Set Analysis of Patients with Suspected Acute Appendicitis, Proceedings of IPMU’98, Paris, France, July 1998, pp. 1528–1533.
Google Scholar
Kohavi R., John G.H.: Wrappers for Feature Subset Selection, Artificial Intelligence Journal, 97, 1997, pp. 273–324.
Article MATH Google Scholar
Langley P., Iba W. Average-case Analysis of a Nearest Neighbor Algorithm, Proceedings of the 13th International Joint Conference on Artificial Intelligence, Morgan Kaufmann, San Mateo, CA, 1993, pp. 889–894.
Google Scholar
Nguyen H.S., Nguyen S.H.: Discretization Methods in Data Mining, [in:] L. Polkowski, A. Skowron (eds.): Rough Sets in Knowledge Discovery 1. Methodology and Applications. Physica-Verlag, Heidelberg 1998, pp. 451–482.
Google Scholar
Ohrn A., Komorowski J., Skowron A., Synak P. The Design and Implementation of a Knowledge Discovery Toolkit Based on Rough Sets—The Rosetta System, [in:] L. Polkowski, A. Skowron (eds.). Rough Sets in Knowledge Discovery 1. Methodology and Applications. Physica-Verlag, Heidelberg 1998, pp. 376–399.
Google Scholar
Paszek P., Wakulicz-Deja A.: Optimization Diagnose in Progressive Encephalopathy Applying The Rough Set Theory, Proceedings of the Fourth European Congress on Intelligent Techniques and Soft Computing, Aachen, Germany, September 2–5, 1996, Vol. 1, pp. 192–196.
Google Scholar
Pawlak Z.: Rough Sets. Theoretical Aspects of Reasoning about Data, Kluwer Academic Publishers, Dordrecht, 1991.
MATH Google Scholar
Pawlak Z., Slowinski K., Slowinski R. Rough classification of patients after highly selective vagotomy for duodenal ulcer. International Journal of Man-Machine Studies, 24, 1998, pp. 413–433.
Article Google Scholar
Slowinski K.: Rough Classification of HSV Patients, (ed.) Slowinski R., Intelligent Decision Support—Handbook of Applications and Advances of the Rough Sets Theory. Kluwer Academic Publishers, Dordrecht, 1992, pp. 77–93.
Google Scholar
Stefanowski J.,Slowinski K.: Rough Set Theory and Rule Induction Techniques for Discovery of Attribute Dependencies in Medical Information Systems, Lecture Notes in Artificial Intelligence 1263, Springer-Verlag, 1997, pp. 36–46.
Google Scholar
Stepaniuk J., Urban M., Baszun-Stepaniuk E.: The Application of Rough Set Based Data Mining Technique in the Prognostication of the Diabetic Nephropathy Prevalence, Proceedings of the Seventh International Workshop on Intelligent Information Systems, Malbork, Poland, June 15–19, 1998, pp. 388–391.
Google Scholar
Tsumoto S., Ziarko W.: The Application of Rough Sets—Based Data Mining Technique to Differential Diagnosis of Meningoencephalitis, Proceedings of the 9th International Symposium, Foundations of Intelligent Systems, Zakopane, Poland, 9–13 June, 1996, Lecture Notes in Artificial Intelligence 1079, pp. 438–447.
Google Scholar
Urban M., Baszun-Stepaniuk E., Stepaniuk J.: Application of the Rough Set Theory in the Prognostication of the Diabetic Nephropathy Prevalence. Preliminary Communication Endokrynologia, Diabetologia i Choroby Przemiany Materii Wieku Rozwojowego 1998 4, 2.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science, Bialystok University of Technology, Wiejska 45A, 15-351, Bialystok, Poland
Jaroslaw Stepaniuk

Authors

Jaroslaw Stepaniuk
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Zbigniew W. Raś Andrzej Skowron

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Stepaniuk, J. (1999). Rough set data mining of diabetes data. In: Raś, Z.W., Skowron, A. (eds) Foundations of Intelligent Systems. ISMIS 1999. Lecture Notes in Computer Science, vol 1609. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0095133

Download citation

DOI: https://doi.org/10.1007/BFb0095133
Published: 20 October 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65965-5
Online ISBN: 978-3-540-48828-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics