Towards a Methodology to Search for Near-Optimal Representations in Classification Problems

del Valle, Manuel; Sánchez, Beatriz; Lago-Fernández, Luis F.; Corbacho, Fernando J.

doi:10.1007/11499305_30

Manuel del Valle¹⁸,
Beatriz Sánchez^18,19,
Luis F. Lago-Fernández^18,20 &
…
Fernando J. Corbacho^18,20

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3562))

Included in the following conference series:

International Work-Conference on the Interplay Between Natural and Artificial Computation

2162 Accesses

Abstract

This paper provides a first step towards a methodology that allows the search for near-optimal representations in classification problems by combining feature transformations from an initial family of basis functions. The original representation for the problem data may not be the most appropriate, and therefore it might be necessary to search for a new representation space that is closer to the structure of the problem to be solved. The outcome of this search is critical for the successful solution of the problem. For instance, if the objective function has certain global statistical properties, such as periodicity, it will be hard for methods based on local pattern information to capture the underlying structure and, hence, afford generalization capabilities. Conversely, once this optimal representation is found, most of the problems may be solved by a linear method. Hence, the key is to find the proper representation. As a proof of concept we present a particular problem where the class distributions have a very intricate overlap on the space of original attributes. For this problem, the proposed algorithm finds a representation based on the trigonometric basis that provides a solution where some of the classical learning methods, e.g. multilayer perceptrons and decision trees, fail. The methodology is composed by a discrete search within the space of basis functions and a linear mapping performed by a Fisher discriminant. We play special emphasis on the first part. Finding the optimal combination of basis functions is a difficult problem because of its nongradient nature and the large number of possible combinations. We rely on the global search capabilities of a genetic algorithm to scan the space of function compositions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Hierarchical Nonlinear Discriminant Classifier Trained Through an Evolutionary Algorithm

Contrasting the Landscapes of Feature Selection Under Different Machine Learning Models

Implications of the curse of dimensionality for supervised learning classifier systems: theoretical and empirical analyses

Article 21 August 2017

References

Bishop, C.M.: Neural Networks for Pattern Recognition. Oxford Univ. Press, Oxford (1995)
Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, pp. 84–214. John Wiley and Sons, Chichester (2001)
MATH Google Scholar
Kohavi, R., John, G.H.: Wrappers for Feature Subset Selection. Artificial Intelligence 97(1–2), 273–324 (1997)
Article MATH Google Scholar
Lago-Fernández, L.F., Corbacho, F.J.: Optimal Extraction of Hidden Causes. In: Dorronsoro, J.R. (ed.) ICANN 2002. LNCS, vol. 2415, pp. 631–636. Springer, Heidelberg (2002)
Chapter Google Scholar
Levine, D.: Users Guide to the PGAPack Parallel Genetic Algorithm Library. T.R.ANL-95/18 (1996)
Google Scholar
Pao, Y.H., Park, G.H., Sobajic, D.J.: Learning and Generalization Characteristics of the Random Vector Functional Link Net. Neurocomputing 6, 163–180 (1994)
Article Google Scholar
Quinlan, J.R.: Morgan Kaufmann Publishers Inc. Morgan Kaufmann Publishers Inc, San Francisco (1992)
Google Scholar
Sierra, A., Macías, J.A., Corbacho, F.: Evolution of Functional Link Networks. IEEE Trans. Evol. Comp. 5(1), 54–65 (2001)
Article Google Scholar
Vapnik, V.N.: Statistical Learning Theory. John Wiley and Sons, Chichester (1998)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Escuela Politécnica Superior, Universidad Autónoma de Madrid, 28049, Madrid, Spain
Manuel del Valle, Beatriz Sánchez, Luis F. Lago-Fernández & Fernando J. Corbacho
Telefónica Investigación y Desarrollo, C/ Emilio Vargas 6, 28043, Madrid, Spain
Beatriz Sánchez
Cognodata Consulting, C/ Caracas 23, 28010, Madrid, Spain
Luis F. Lago-Fernández & Fernando J. Corbacho

Authors

Manuel del Valle
View author publications
You can also search for this author in PubMed Google Scholar
Beatriz Sánchez
View author publications
You can also search for this author in PubMed Google Scholar
Luis F. Lago-Fernández
View author publications
You can also search for this author in PubMed Google Scholar
Fernando J. Corbacho
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

E.T.S.I. Informática, Universidad Nacional de Educación a Distancia, 28040, Madrid, Spain
José Mira
E.T.S. de Ingeniería Informática, Departamento de Intelifencia Artificial, Universidad Nacional de Educación a Distancia, Juan del Rosal, 16, 28040, Madrid, Spain
José R. Álvarez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

del Valle, M., Sánchez, B., Lago-Fernández, L.F., Corbacho, F.J. (2005). Towards a Methodology to Search for Near-Optimal Representations in Classification Problems. In: Mira, J., Álvarez, J.R. (eds) Artificial Intelligence and Knowledge Engineering Applications: A Bioinspired Approach. IWINAC 2005. Lecture Notes in Computer Science, vol 3562. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11499305_30

Download citation

DOI: https://doi.org/10.1007/11499305_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26319-7
Online ISBN: 978-3-540-31673-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics