Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
透過您的圖書館登入
IP:70.40.220.129
  • 期刊
  • OpenAccess

Heuristic Feature Selection with Classification Efficiency Using Soft Cluster Analysis for Biological Datasets

摘要


With a deeper investigation to deciphering the sophisticated relations among input and output variables of multi-class classification problems, the goal of this paper is to propose a new model of variable selection which maximizes the discrimination and minimizes the size of the selected feature subsets. For molecular datasets with a tremendous amount of input variables, the proposed heuristic algorithm is capable of exploring the essential factors of classification problems. Our model devotes to three accomplishments of multiclass classification tasks. Feature discretization using fuzzy clustering analysis for the improvement of feature discrimination is the first. Multivariate analysis for the investigation of information relevance and redundancy is the second achievement in this study. The third is a novel heuristic feature selection algorithm with effectiveness but without overfitting problem. Experimental results convince our model acquires significant discrimination improvement for microarray classification problems.