Abstract
Learning from given patterns is realized by learning from their appropriate representations. This is usually practiced either by defining a set of features or by measuring proximities between pairs of objects. Both approaches are problem dependent and aim at the construction of some representation space, where discrimination functions can be defined.
In most situations, some feature reduction or prototype selection is mandatory. In this paper, a pairwise selection for creating a suitable representation space is proposed. To determine an informative set of features (or prototypes), the correlations between feature pairs are taken into account. By this, some dependencies are detected, while overtraining is avoided as the criterion is evaluated in two-dimensional feature spaces. Several experiments show that for small sample size problems, the proposed algorithm can outperform traditional selection methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Blake CL and Merz CJ (1998) UCI Repository of machine learning databases, http://www.ics.uci.edu/mlearn/MLRepository.html.
Bo T and Jonassen I (2002) New feature subset selection procedures for classification of expression profiles, Genome biology 3.
Borg I and Groenen P (1997) Modern Multidimensional Scaling. Springer-Verlag.
Breiman L, Friedman JH, Olshen RA and Stone CJ (1984) Classification and regression trees, Wadsworth, California.
Duda RO, Hart PE and Stork DG (2001) Pattern Classification 2nd. edition, John Wiley & Sons.
Gower JC, A general coeficient of similarity and some of its properties, Biometrics vol. 27, 25–33, 1971.
Harol A, Pekalska E and Duin RPW (2005), Pairwise prototype selection on distance data, submitted.
van der Heijden F, Duin RPW, de Ridder D and Tax DMJ (2004) Classification, Parameter Estimation and State Estimation. An Engineering Approach using Matlab. John Wiley & Sons Ltd.
Jain AK, Duin RPW and Mao J (2000) Statistical Pattern Recognition: A Review. IEEE Trans on PAMI 22:4–37.
Kohavi R and John GH (1997) Wrappers for feature subset selection. Artificial Intelligence 97:273–324.
Li L, Weinberg CR, Darden TA and Pedersen LG (2001) Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method. Bioinformatics 17:1131–1142.
Pekalska E, Duin RPW and PaclÃk P (2005), Prototype Selection for Dissimilarity-based Classifiers. accepted to Pattern Recognition.
Pekalska E (2005) Dissimilarity representations in pattern recognition. Concepts, theory and applications. PhD thesis. Delft University of Technology.
Pudil P, Novovicova J, and Kittler J (1994) Floating search methods in feature selection. Pattern Recognition Letters 15:1119–1125.
Raudys S, and Duin RPW (1998) On expected classification error of the Fisher linear classifier with pseudo-inverse covariance matrix. Pattern Recognition Letters 19:385–392.
Skurichina M (2001), Stabilizing weak classifiers. PhD thesis. Delft University of Technology.
Somorjai RL, Dolenko B, Demko A, Mandelzweig M, Nikulin AE, Baumgartner R, Pizzi NJ (2004) Mapping high-dimensional data onto a relative distance plane an exact method for visualizing and characterizing high-dimensional patterns, Journal of Biomedical Informatics 37:366–379.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pekalska, E., Harol, A., Lai, C., Duin, R.P.W. (2005). Pairwise Selection of Features and Prototypes. In: Kurzyński, M., Puchała, E., Woźniak, M., żołnierek, A. (eds) Computer Recognition Systems. Advances in Soft Computing, vol 30. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-32390-2_31
Download citation
DOI: https://doi.org/10.1007/3-540-32390-2_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25054-8
Online ISBN: 978-3-540-32390-7
eBook Packages: EngineeringEngineering (R0)