Using Reliable Short Rules to Avoid Unnecessary Tests in Decision Trees

Sug, Hyontai

doi:10.1007/11925231_57

Hyontai Sug²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4293))

Included in the following conference series:

Mexican International Conference on Artificial Intelligence

990 Accesses

Abstract

It is known that in decision trees the reliability of lower branches is worse than the upper branches due to data fragmentation problem. As a result, unnecessary tests of attributes may be done, because decision trees may require tests that are not best for some part of the data objects. To supplement the weak point of decision trees of data fragmentation, using reliable short rules with decision tree is suggested, where the short rules come from limited application of association rule finding algorithms. Experiment shows the method can not only generate more reliable decisions but also save test costs by using the short rules.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 239.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Mining Maximal Association Rules on Soft Sets Using Critical Relative Support Based Pruning

Reducing Redundant Association Rules Using Type-2 Fuzzy Logic

Rule Acquisition in Generalized One-Sided Decision Systems

References

Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Breiman, L., Friedman, J., Olshen, R., Stone, C.: Classification and Regression Trees. Wadsworth International Group, Inc. (1984)
Google Scholar
StatSoft, Inc.: Electronic Statistics Textbook. Tulsa, OK, StatSoft (2004), WEB: http://www.statsoft.com/textbook/stathome.html
Mehta, M., Agrawal, R., Rissanen, J.: SLIQ: A Fast Scalable Classifier for Data Mining. In: Apers, P.M.G., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, Springer, Heidelberg (1996)
Chapter Google Scholar
Shafer, J., Agrawal, R., Mehta, M.: SPRINT: A Scalable Parallel Classifier for Data Mining. In: Proc. 1996 Int. Conf. Very Large Data Bases, Bombay, India, September 1996, pp. 544–555 (1996)
Google Scholar
Rastogi, R., Shim, K.: PUBLIC: A Decision Tree Classifier that Integrates Building and Pruning. Data Mining and Knowledge Discovery 4(4), 315–344 (2002)
Article Google Scholar
Gehrke, J., Ramakrishnan, R., Ganti, V.: Rainforest: A Framework for Fast Decision Tree Construction of Large Datasets. In: Proc. 1998 Int. Conf. Very Large Data Bases, New York, August 1998, pp. 416–427 (1998)
Google Scholar
Catlett, J.: Megainduction: Machine Learning on Very Large Databases. PhD thesis, University of Sydney, Australia (1991)
Google Scholar
SAS: Decision Tree Modeling Course Notes. SAS Publishing (2002)
Google Scholar
Jolliffe, I.T.: Principal Component Analysis, 2nd edn. Springer, Heidelberg (2002)
MATH Google Scholar
Almuallim, H., Dietterich, T.G.: Efficient Algorithms for Identifying Relevant Features. In: Proc. of the 9th Canadian Conference on Artificial Intelligence, pp. 38–45 (1992)
Google Scholar
Kononenko, I., et al.: Overcoming the Myopia of Inductive Learning Algorithms with RELIEF. Applied Intelligence 7(1), 39–55 (1997)
Article Google Scholar
Liu, H., Motoda, H.: Feature Extraction, Construction and Selection: A Data Mining Perspective. Kluwer International (1998)
Google Scholar
Liu, B., Hsu, W., Ma, Y.: Integrating Classification and Association Rule Mining. In: Proc. of the 4th International Conference on Knowledge Discovery and Data Mining (KDD 1998), New York, pp. 80–86 (1998)
Google Scholar
Liu, B., Hu, M., Hsu, W.: Multi-level Organization and Summarization of the Discovered Rule. In: Proc. of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston, MA, pp. 208–217 (2000)
Google Scholar
Wang, K., Zhou, S., He, Y.: Growing Decision Trees on Support-less Association Rules. In: Proc. of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston, MA, pp. 265–269 (2000)
Google Scholar
Berzal, F., Cubero, J., Sanchez, D., Serrano, J.M.: ART: A Hybrid Classification Model. Machine Learning 54, 67–92 (2004)
Article MATH Google Scholar
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2000)
Google Scholar
Li, W., Han, J., Pei, J.: CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules. In: Proceedings 2001 Int. Conf. on Data Mining (ICDM 2001), San Jose, CA (2001)
Google Scholar
Liu, B., Hsu, W., Ma, Y.: Integrating Classification and Association Rule Mining. In: Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (KDD 1998), New York (1998)
Google Scholar
Hettich, S., Bay, S.D.: The UCI KDD Archive. University of California, Department of Information and Computer Science, Irvine, CA (1999), http://kdd.ics.uci.edu
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
MATH Google Scholar
Agrawal, R., Mannila, H., Toivonen, H., Verkamo, A.I.: Fast Discovery of Association Rules. In: Fayyad, U.M., Piatetsky-Shapiro, G., Smith, P., Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 307–328. AAAI Press/The MIT Press (1996)
Google Scholar
Pak, J.S., Chen, M., Yu, P.S.: Using a Hash-Based Method with Transaction Trimming for Mining Association Rules. IEEE Transactions on Knowledge and Data Engineering 9(5), 813–825 (1997)
Article Google Scholar
Toivonen, H.: Discovery of Frequent Patterns in Large Data Collections. Phd thesis, Department of Computer Science, University of Helsinki, Finland (1996)
Google Scholar
Savasere, A., Omiecinski, E., Navathe, S.: An Efficient Algorithm for Mining Association Rules in Large Databases. College of Computing, Georgia Institute of Technology, Technical Report No.: GIT–CC–95–04
Google Scholar
Cochran, W.G.: Sampling Techniques. Wiley, Chichester (1977)
MATH Google Scholar
Aggarawal, C.C., Yu, P.S.: A New Frame Work for Itemset Generation. In: PODS 1998, pp. 18–24 (1998)
Google Scholar
Liu, H., Hussain, F., Tan, C.L., Dash, M.: Discretization: An Enabling Techniques. Data Mining and Knowledge Discovery 6(4), 393–423 (2002)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Division of Computer and Information Engineering, Dongseo University, Busan, 617-716, South Korea
Hyontai Sug

Authors

Hyontai Sug
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Computing Research, National Polytechnic Institute, 07738, Mexico City, México
Alexander Gelbukh
Instituto Nacional de Astrofísica, Óptica y Electrónica (INAOE), Luis Enrique Erro No. 1, Sta. Ma. Tonanzintla, 72840, Puebla, México
Carlos Alberto Reyes-Garcia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sug, H. (2006). Using Reliable Short Rules to Avoid Unnecessary Tests in Decision Trees. In: Gelbukh, A., Reyes-Garcia, C.A. (eds) MICAI 2006: Advances in Artificial Intelligence. MICAI 2006. Lecture Notes in Computer Science(), vol 4293. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11925231_57

Download citation

DOI: https://doi.org/10.1007/11925231_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49026-5
Online ISBN: 978-3-540-49058-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Using Reliable Short Rules to Avoid Unnecessary Tests in Decision Trees

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Mining Maximal Association Rules on Soft Sets Using Critical Relative Support Based Pruning

Reducing Redundant Association Rules Using Type-2 Fuzzy Logic

Rule Acquisition in Generalized One-Sided Decision Systems

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Using Reliable Short Rules to Avoid Unnecessary Tests in Decision Trees

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Mining Maximal Association Rules on Soft Sets Using Critical Relative Support Based Pruning

Reducing Redundant Association Rules Using Type-2 Fuzzy Logic

Rule Acquisition in Generalized One-Sided Decision Systems

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation