Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

PGGP

Published: 01 March 2016 Publication History

Abstract

Graphical abstractDisplay Omitted HighlightsA genetic program for prototype generation (PGGP) is proposed.PGGP learns to combine instances via genetic programming to generate prototypes.An extensive experimental evaluation is performed.The proposed method is compared to many other techniques.The proposed approach compares favorably with most generation methods proposed so far. Prototype generation (PG) methods aim to find a subset of instances taken from a large training data set, in such a way that classification performance (commonly, using a 1NN classifier) when using prototypes is equal or better than that obtained when using the original training set. Several PG methods have been proposed so far, most of them consider a small subset of training instances as initial prototypes and modify them trying to maximize the classification performance on the whole training set. Although some of these methods have obtained acceptable results, training instances may be under-exploited, because most of the times they are only used to guide the search process. This paper introduces a PG method based on genetic programming in which many training samples are combined through arithmetic operators to build highly effective prototypes. The genetic program aims to generate prototypes that maximize an estimate of the generalization performance of an 1NN classifier. Experimental results are reported on benchmark data to assess PG methods. Several aspects of the genetic program are evaluated and compared to many alternative PG methods. The empirical assessment shows the effectiveness of the proposed approach outperforming most of the state of the art PG techniques when using both small and large data sets. Better results were obtained for data sets with numeric attributes only, although the performance of the proposed technique on mixed data was very competitive as well.

References

[1]
A. Cervantes, I.M. Galvan, P. Isasi, AMPSO: a new particle swarm method for nearest neighborhood classification, IEEE Trans. Syst. Man Cybern. B, 39 (2009) 1082-1091.
[2]
L.P. Cordella, C.D. Stefano, F. Fontanella, A. Marcelli, Looking for prototypes by genetic programming, 2006.
[3]
T. Cover, P. Hart, Nearest neighbor pattern classification, IEEE Trans. Inform. Theory, 13 (1967) 21-27.
[4]
C. Decaestecker, Finding prototypes for nearest neighbour classification by means of gradient descent and deterministic annealing, Pattern Recogn., 30 (1997) 281-288.
[5]
H.J. Escalante, M. Sotomayor, M. Montes-y-Gomez, A.P. Lopez-Monroy, Object recognition with naive Bayes-KNN via prototype generation, Springer, 2014.
[6]
H.J. Escalante, K.M. Mendoza, M. Graff, A. Morales-Reyes, Genetic programming of prototypes for pattern classification, Springer, 2013.
[7]
P.G. Espejo, S. Ventura, F. Herrera, A survey on the application of genetic programming to classification, Trans. Syst. Man Cybern. Part C, 40 (2010 Mar) 121-144.
[8]
F. Fernandez, P. Isasi, Evolutionary design of nearest prototype classifiers, J. Heuristics, 10 (2004) 431-454.
[9]
U. Garain, Prototype reduction using an artificial immune system, Pattern Anal Appl., 11 (2008) 353-363.
[10]
S. García, J. Derrac, J.R. Cano, F. Herrera, Prototype selection for nearest neighbor classification: taxonomy and empirical study, IEEE Trans. Pattern Anal. Mach. Intell., 34 (2012) 417-435.
[11]
M. García-Limón, H.J. Escalante, E. Morales, A. Morales, Simultaneous generation of prototypes and features through genetic programming, in: Proc. of GECCO: Genetic and Evolutionary Computation Conference, 2014, pp. 517-524.
[12]
E. Han, G. Karypis, Centroid-based document classification: analysis and experimental results, Springer, 2000.
[13]
T. Hastie, R. Tibshirani, J. Friedman, The Elements of Statistical Learning, Springer New York Inc., New York, NY, USA, 2001.
[14]
S.W. Kim, B.J. Oommen, A brief taxonomy and ranking of creative prototype reduction schemes, Pattern Anal. Appl., 6 (2003) 232-244.
[15]
J. Koplowitz, T. Brown, On the relation of performance to editing in nearest neighbor rules, Pattern Recogn., 13 (1981) 251-255.
[16]
T. Kovacs, Genetics-based machine learning, in: Handbook of Natural Computing: Theory, Experiments and Applications, Springer, 2011.
[17]
M. Lozano, J.M. Sotoca, J.S. Sánchez, F. Pla, E. Pkalska, R.P.W. Duin, Experimental study on prototype optimisation algorithms for prototype-based classification in vector spaces, Pattern Recogn., 39 (2006) 1827-1838.
[18]
K.M. Mendoza, Programación genética para la generación automática de prototipos. Master's thesis, UANL, San Nicolas de los Garza, N. L., Mexico, 2012.
[19]
L. Nanni, A. Lumini, Particle swarm optimization for prototype reduction, Neurocomputing, 72 (2008) 1092-1097.
[20]
A. Olvera, J.A. Carrasco-Ochoa, J.F. Martinez-Trinidad, J. Kittler, A review of instance selection methods, Artif. Intell. Rev., 34 (2010) 133-143.
[21]
R. Poli, W.B. Langdon, N.F. McPhee, A Field Guide to Genetic Programming, 2008.
[22]
H. Takcy, T. Gungor, A high performance centroid-based classification approach for language identification, Pattern Recogn. Lett., 33 (2012) 2077-2084.
[23]
I. Triguero, J. Derrac, S. García, F. Herrera, A taxonomy and experimental study on prototype generation for nearest neighbor classification, IEEE Trans. Syst. Man Cybern. C, 42 (2012) 86-100.
[24]
I. Triguero, S. Garcia, F. Herrera, Differential evolution for optimizing the positioning of prototypes in nearest neighbor classification, Pattern Recogn., 44 (2011) 901-916.
[25]
J. Wan, V. Athitsos, P. Jangyodsuk, H.J. Escalante, Q. Ruan, I. Guyon, CSMMI: Class-Specific Maximization of Mutual Information for Action and Gesture Recognition, IEEE Trans. Image Process., 23 (2014) 3152-3165.

Cited By

View all
  • (2024)Reduction Through Homogeneous Clustering: Variations for Categorical Data and Fast Data ReductionSN Computer Science10.1007/s42979-024-03007-95:6Online publication date: 25-Jun-2024
  • (2023)Very fast variations of training set size reduction algorithms for instance-based classificationProceedings of the 27th International Database Engineered Applications Symposium10.1145/3589462.3589493(64-70)Online publication date: 5-May-2023
  • (2023)Multilabel Prototype Generation for data reduction in K-Nearest Neighbour classificationPattern Recognition10.1016/j.patcog.2022.109190135:COnline publication date: 1-Mar-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Applied Soft Computing
Applied Soft Computing  Volume 40, Issue C
March 2016
683 pages

Publisher

Elsevier Science Publishers B. V.

Netherlands

Publication History

Published: 01 March 2016

Author Tags

  1. 1NN classification
  2. 68T10
  3. 68T20
  4. Genetic programming
  5. Pattern classification
  6. Prototype generation

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Reduction Through Homogeneous Clustering: Variations for Categorical Data and Fast Data ReductionSN Computer Science10.1007/s42979-024-03007-95:6Online publication date: 25-Jun-2024
  • (2023)Very fast variations of training set size reduction algorithms for instance-based classificationProceedings of the 27th International Database Engineered Applications Symposium10.1145/3589462.3589493(64-70)Online publication date: 5-May-2023
  • (2023)Multilabel Prototype Generation for data reduction in K-Nearest Neighbour classificationPattern Recognition10.1016/j.patcog.2022.109190135:COnline publication date: 1-Mar-2023
  • (2023)A Constructive Method for Data Reduction and Imbalanced SamplingAlgorithms and Architectures for Parallel Processing10.1007/978-981-97-0798-0_28(476-489)Online publication date: 20-Oct-2023
  • (2023)Addressing Class Imbalance in Multilabel Prototype Generation for k-Nearest Neighbor ClassificationPattern Recognition and Image Analysis10.1007/978-3-031-36616-1_2(15-27)Online publication date: 27-Jun-2023
  • (2022)Fast data reduction by space partitioning via convex hull and MBR computationPattern Recognition10.1016/j.patcog.2022.108553126:COnline publication date: 1-Jun-2022
  • (2021)A model to estimate the Self-Organizing Maps grid dimension for Prototype GenerationIntelligent Data Analysis10.3233/IDA-20512325:2(321-338)Online publication date: 1-Jan-2021
  • (2019)A Neighbor Prototype Selection Method Based on CCHPSO for Intrusion DetectionSecurity and Communication Networks10.1155/2019/12834952019Online publication date: 20-Aug-2019
  • (2017)An iterative genetic programming approach to prototype generationGenetic Programming and Evolvable Machines10.1007/s10710-016-9279-318:2(123-147)Online publication date: 1-Jun-2017

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media