An Effective Feature Selection Method Using the Contribution Likelihood Ratio of Attributes for Classification

Zhang, Zhiwang; Shi, Yong; Gao, Guangxia; Chai, Yaohui

doi:10.1007/978-3-540-89376-9_16

Zhiwang Zhang²¹,
Yong Shi²²,
Guangxia Gao²³ &
…
Yaohui Chai²⁴

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4977))

Included in the following conference series:

Asia-Pacific Web Conference

933 Accesses

Abstract

Feature selection is a very crucial step in data mining process. It aims to find the most important feature subset from a given feature set without degradation of classifying information. As for the traditional feature selection method, the number of candidate feature subsets created by algorithm in an iterative computational way is exponential in the size of the initial attribute set. And relevant algorithm occupies a lot of the system resources in time and space. In this paper, we study and develop a novel feature selection method and provide its mathematic principle, which is based on the factors of attributes contributing to target attribute and their maximum information divergence value (MIDV) to select small enough feature subset and improve the classification accuracy. And then the extensive experiment shows that our proposed method is very efficient in computational performance and scalability than traditional methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Optimal and Novel Hybrid Feature Selection Framework for Effective Data Classification

Why Feature Selection in Data Mining Is Prominent? A Survey

Feature subset selection combining maximal information entropy and maximal information coefficient

Article 29 July 2019

References

Mitchell, T.: Machine Learning. McGraw-Hill, New York (1997)
MATH Google Scholar
de Angelis, V., Felici, G., Mancinelli, G.: Feature selection for data mining. In: Data mining & Knowledge discovery based on rules reduction, pp. 227–237. Springer, Heidelberg (2006)
Google Scholar
Langley, P.: Selection of relevant features in machine learning. In: Proceedings of the AAAI Fall Symposium on Relevance. AAAI Press, Menlo Park (1994)
Google Scholar
Duda, R.O., Hart, P.E.: Pattern Classification, 2nd edn., pp. 90–101. Elsevier Science, USA (2003)
Google Scholar
Fukunaga, Keinosuke: Introduction to Statistical Pattern Recognition, 2nd edn., pp. 489–503. Elsevier Academic Press (1999)
Google Scholar
Ruiz, R., Aguilar–Ruiz, J.S.: Analysis of Feature Rankings for Classification, Spain
Google Scholar
Abe, N., Kudo, M.: Entropy Criterion for Classifier-Independent Feature Selection, Japan
Google Scholar
Abe, N., Kudo, M.: A Divergence Criterion for Classifier-Independent Feature Selection, Japan
Google Scholar
Cang, S., Partridge, D.: Feature ranking and best feature subset using mutual information. Springer, London (2004)
Google Scholar
Chizi, B., Maimon, O.: Data Mining & Knowledge Discovery Handbook. Springer Science, pp. 93–109 (2005)
Google Scholar
Ian, H., Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools And Techniques, 2nd edn. Elsevier Inc., Amsterdam (2005)
Google Scholar
Han, J., Kamber, M.: Data Mining: Concepts and Techniques, pp. 10–19. Morgan Kaufmann Publishers, San Francisco (2001)
Google Scholar
Ye, N.: The Handbook of Data Mining, pp. 414–417. Lawrence Erlbaum Associates, Mahwah (2003)
Google Scholar
Nixon, M.S., Aguado, A.S.: Feature Extraction and Image Processing. Elsevier Newnes, Amsterdam (2002)
Google Scholar
Fukunaga, K.: Introduction to Statistical Pattern Recognition, 2nd edn., pp. 489–503. Elsevier Academic Press, Amsterdam (1999)
Google Scholar
Marques de Sa, J.P.: Pattern Recognition Concepts Methods and Applications, pp. 65–69. Springer, Heidelberg (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information of Graduate University of Chinese Academy of Sciences, China Chinese Academy of Sciences Research Center on Fictitious Economy and Data Science, Beijing, (100080), China
Zhiwang Zhang
Research Center on Fictitious Economy and Data Science, Chinese Academy of Sciences, Beijing 100080, China; College of Information Science and Technology, University of Nebraska at Omaha, Omaha, NE 68182, USA
Yong Shi
Foreign Language Department, Shandong Institute of Business and Technology, Yantai, Shandong, 264005, China
Guangxia Gao
School of Management of Graduate University of Chinese Academy of Sciences, China Chinese Academy of Sciences Research Center on Fictitious Economy and Data Science, Beijing, (100080), China
Yaohui Chai

Authors

Zhiwang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yong Shi
View author publications
You can also search for this author in PubMed Google Scholar
Guangxia Gao
View author publications
You can also search for this author in PubMed Google Scholar
Yaohui Chai
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Nagoya University, Nagoya, Japan
Yoshiharu Ishikawa
CAS Research Center on Data Technology and Knowledge Economy, Beijing, China
Jing He & Yong Shi &
Victoria University, Melbourne, Australia
Guandong Xu
Institute of Software, Chinese Academy of Sciences, Beijing, China
Guangyan Huang
CSIRO ICT Centre, Brisbane, QLD, Australia
Chaoyi Pang & Qing Zhang &
Northeastern University, Shenyang, China
Guoren Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, Z., Shi, Y., Gao, G., Chai, Y. (2008). An Effective Feature Selection Method Using the Contribution Likelihood Ratio of Attributes for Classification. In: Ishikawa, Y., et al. Advanced Web and Network Technologies, and Applications. APWeb 2008. Lecture Notes in Computer Science, vol 4977. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89376-9_16

Download citation

DOI: https://doi.org/10.1007/978-3-540-89376-9_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89375-2
Online ISBN: 978-3-540-89376-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

An Effective Feature Selection Method Using the Contribution Likelihood Ratio of Attributes for Classification

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Optimal and Novel Hybrid Feature Selection Framework for Effective Data Classification

Why Feature Selection in Data Mining Is Prominent? A Survey

Feature subset selection combining maximal information entropy and maximal information coefficient

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

An Effective Feature Selection Method Using the Contribution Likelihood Ratio of Attributes for Classification

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Optimal and Novel Hybrid Feature Selection Framework for Effective Data Classification

Why Feature Selection in Data Mining Is Prominent? A Survey

Feature subset selection combining maximal information entropy and maximal information coefficient

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation