Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Advertisement

Feature selection for ensembles applied to handwriting recognition

  • Original Paper
  • Published:
International Journal of Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Abstract

Feature selection for ensembles has shown to be an effective strategy for ensemble creation due to its ability of producing good subsets of features, which make the classifiers of the ensemble disagree on difficult cases. In this paper we present an ensemble feature selection approach based on a hierarchical multi-objective genetic algorithm. The underpinning paradigm is the “overproduce and choose”. The algorithm operates in two levels. Firstly, it performs feature selection in order to generate a set of classifiers and then it chooses the best team of classifiers. In order to show its robustness, the method is evaluated in two different contexts:supervised and unsupervised feature selection. In the former, we have considered the problem of handwritten digit recognition and used three different feature sets and multi-layer perceptron neural networks as classifiers. In the latter, we took into account the problem of handwritten month word recognition and used three different feature sets and hidden Markov models as classifiers. Experiments and comparisons with classical methods, such as Bagging and Boosting, demonstrated that the proposed methodology brings compelling improvements when classifiers have to work with very low error rates. Comparisons have been done by considering the recognition rates only.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Hansen, L., Salomon, O.: Neural network ensembles. IEEE Trans. Pattern Anal. Mach. Intell. 12(10), 993–1001 (1990)

    Article  Google Scholar 

  2. Hashem, S.: Optimal linear combinations of neural networks. Neural Networks 10(4), 599–614 (1997)

    Article  PubMed  Google Scholar 

  3. Krogh, A., Vedelsby, J.: Neural networks ensembles, cross validation, and active learning. In: Tesauro, G. et al. (eds.). Advances in Neural Information Processing Systems 7, pp. 231–238. MIT Press (1995)

  4. Optiz, D.W.: Feature selection for ensembles. In: Proceedings of the 16th International Conference on Artificial Intelligence, pp. 379–384 (1999)

  5. Ho, T.K.: The random subspace method for constructing decision forests. IEEE Trans. Pattern Anal. Mach. Intell. 20(8), 832–844 (1998)

    Article  Google Scholar 

  6. Last, M., Bunke, H., Kandel, A.: A feature-based serial approach to classifier combination. Pattern Anal. Appl. 5, 385–398 (2002)

    Article  MathSciNet  Google Scholar 

  7. Tsymbal, A., Puuronen, S., Patterson, D.W.: Ensemble feature selection with the simple Bayesian classification. Inf. Fusion 4, 87–100 (2003)

    Article  Google Scholar 

  8. Gerra-Salcedo, C., Whitley, D.: Genetic approach to feature selection for ensemble creatin. In: Proceedings of the Genetic and Evolutionary Computation Conference, pp. 236–243. Orlando-USA (1999)

  9. De Stefano, C., Cioppa, A.D., Marcelli, A.: Exploiting reliability for dynamic selection of classifiers by means of genetic algorithms. In: Proceedings of the 7th International Conference on Document Analysis and Recognition, pp. 671–675. Edinbugh-Scotland IEEE Computer Society (2003)

  10. Emmanouilidis, C., Hunter, A., MacIntyre, J.: A multiobjective evolutionary setting for feature selection and a commonality-based crossover operator. In: Proceedings of the Congress on Evolutionary Computation, vol. 1, pp. 309–316 (2000)

  11. Oliveira, L.S., Sabourin, R., Bortolozzi, F., Suen, C.Y.: A methodology for feature selection using multi-objective genetic algorithms for handwritten digit string recognition. Int. J. Pattern Recog. Artif. Intell. 17(6), 903–930 (2003)

    Article  Google Scholar 

  12. Siedlecki, W., Sklansky, J.: A note on genetic algorithms for large scale on feature selection. Pattern Recog. Lett. 10, 335–347 (1989)

    Article  MATH  Google Scholar 

  13. Kudo, M., Sklansky, J.: Comparision of algorithms that select features for pattern classifiers. Pattern Recog. 33(1), 25–41 (2000)

    Article  Google Scholar 

  14. Partridge, D., Yates, W.B.: Engineering multiversion neural-net systems. Neural Comput. 8(4), 869–893 (1996)

    Article  PubMed  Google Scholar 

  15. Giacinto, G., Roli, F.: Design of effective neural network ensemble for image classification purposes. Image Vision Comput. J. 9–10, 697–705 (2001)

    Google Scholar 

  16. Kuncheva, L.I., Whitaker, C.J.: Measures of diversity in classifier ensembles. Mach. Learn. 51, 181–207 (2003)

    Google Scholar 

  17. Ruta, D., Gabrys, B.: Analysis of the correlation between majority voting error and the diversity measures in multiple classifier systems. In: Proceedings of the 4th International Symposium on Soft Computing. Paisley, UK (2001)

  18. Ruta, D.: Multilayer selection-fusion model for pattern classification. In: Proceedings of the IASTED Artificial Intelligence and Application Conference. Insbruck, Austria (2004)

  19. Kuncheva, L.: That elusive diversity in classifier ensembles. In: Proceedings of the ibPRIA, LNCS 2652, pp. 1126–1138. Mallorca, Spain (2003)

  20. Jain, A.K., Duin, R.P.W., Mao, J.: Statistical pattern recognition: A review. IEEE Trans. Pattern Anal. Mach. Intell. 22(1), 4–37 (2000)

    Article  Google Scholar 

  21. Kuncheva, L., Bezdek, J.C., Duin, R.P.W.: Decision templates for multiple classifier fusion: An experimental comparison. Pattern Recog. 34(2), 299–314 (2001)

    Article  MATH  Google Scholar 

  22. Efron, B., Tibshirani, R.: An Introduction to the Bootstrap. Chapman and Hall (1993)

  23. Breiman, L.: Stacked regressions. Mach. Learn. 24(1), 49–64 (1996)

    MATH  MathSciNet  Google Scholar 

  24. Freund, Y., Schapire, R.: Experiments with a new boosting algorithm. In: Proceedings of the 13th International Conference on Machine Learning, pp. 148–156. Bary-Italy (1996)

  25. Tumer, K., Oza, N.C.: Input decimated ensembles. Pattern Anal. Appl. 6, 65–77 (2003)

    Article  MathSciNet  MATH  Google Scholar 

  26. Kuncheva, L., Jain, L.C.: Designing classifier fusion systems by genetic algorithms. IEEE Trans. Evol. Comput. 4(4), 327–336 (2000)

    Article  Google Scholar 

  27. Gunter, S., Bunke, H.: Creation of classifier ensembles for handwritten word recogntion using feature selection algorithms. In: Proceedings of the 8th IWFHR, pp. 183–188. Niagara-on-the-Lake, Canada (2002)

  28. Deb, K.: Multi-Objective Optimization using Evolutionary Algorithms. Wiley, 2nd edn. April (2002)

  29. Srinivas, N., Deb, K.: Multiobjective optimization using nondominated sorting in genetic algorithms. Evol. Comput. 2(3), 221–248 (1995)

    Article  Google Scholar 

  30. Deb, K., Goldberg, D.E.: An investigation of niche and species formation in genetic function. In: Proceedings of the 3rd International Conference on Genetic Algorithms, pp. 42–50 (1989)

  31. Oliveira, L.S., Sabourin, R., Bortolozzi, F., Suen, C.Y.: Automatic recognition of handwritten numerical strings: A recognition and verification strategy. IEEE Trans. Pattern Anal. Mach. Intell. 24(11), 1438–1454 (2002)

    Article  Google Scholar 

  32. Oh, I.-S., Suen, C.Y.: Distance features for neural network-based recognition of handwritten characters. Int. J. Doc. Anal. Recog. 1(2), 73–88 (1998)

    Article  Google Scholar 

  33. Chim, Y.C., Kassim, A.A., Ibrahim, Y.: Dual classifier system for handprinted alphanumeric character recognition. Pattern Anal. Appl. 1(3), 155–162 (1998)

    Article  MATH  Google Scholar 

  34. Bishop, C.M.: Neural Networks for Pattern Recognition. Oxford University Press, Oxford, U.K. (1995)

    Google Scholar 

  35. Fumera, G., Roli, F., Giacinto, G.: Reject option with multiple thresholds. Pattern Recog. 33(12), 2099–2101 (2000)

    Article  Google Scholar 

  36. Oliveira, J.J. Jr., Carvalho, J.M., Freitas, C.O.A., Sabourin, R.: Evaluating NN and HMM classifiers for handwritten word recognition. In: Proceedings of the 15th Brazilian Symposium on Computer Graphics and Image Processing, pp. 210–217. IEEE Computer Society Fortaleza, Brazil (2002)

  37. Morita, M., El Yacoubi, A., Sabourin, R., Bortolozzi, F., Suen, C.Y.: Handwritten month word recognition on Brazilian bank cheques. In: Proceedings of the 6th ICDAR, pp. 972–976 (2001)

  38. John, G., Kohavi, R., Pfleger, K.: Irrelevant features and the subset selection problems. In: Proceedings of the 11th International Conference on Machine Learning, pp. 121–129 (1994)

  39. Moody, J., Utans, J.: Principled architecture selection for neural networks: Application to corporate bond rating prediction. In: Moody, J., Hanson, S.J., Lippmann, R.P. (eds.). Advances in Neural Information Processing Systems 4. Morgan Kaufmann (1991)

  40. Yuan, H., Tseng, S.S., Gangshan, W., Fuyan, Z.: A two-phase feature selection method using both filter and wrapper. In: Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, vol. 2, pp. 132–136 (1999)

  41. Kuncheva, L.I., Whitaker, C.J.: Ten measures of diversity in classifier ensembles: limits for two classifiers. In: Proceedings of the IEE Workshop on Intelligent Sensor Processing, pp. 1–10 (2001)

  42. Kittler, J., Hatef, M., Duin, R., Matas, J.: On combining classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 20(3), 226–239 (1998)

    Article  Google Scholar 

  43. Huang, X.D., Ariki, Y., Jack, M.A.: Hidden Markov Models for Speech Recognition. Edinburgh University Press (1990)

  44. Dy, J.G., Brodley, C.E.: Feature subset selection and order identification for unsupervised learning. In: Proceedings of the 17th International Conference on Machine Learning. Stanford University, CA, July (2000)

  45. Kim, Y.S., Street, W.N., Menczer, F.: Feature selection in unsupervised learning via evolutionary search. In: Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 365–369 (2000)

  46. Morita, M., Sabourin, R., Bortolozzi, F., Suen, C.Y.: Unsupervised feature selection using multi-objective genetic algorithms for handwritten word recognition. In: Proceedings of the 7th International Conference on Document Analysis and Recognition, pp. 666–670. IEEE Computer Society, Edinbugh-Scotland (2003)

  47. Davis, L.: Handbook on Genetic Algorithms. Van Nostrand Reinhold (1991)

  48. Cantu-Paz, E.: Efficient and Accurate Parallel Genetic Algorithms. Kluwer Academic Publishers (2000)

  49. Miki, M., Hiroyasu, T., Kaneko, K., Hatanaka, K.: A parallel genetic algorithm with distributed environment scheme. In: Proceedings of the International Conference on System, Man, and Cybernetics, vol. 1, pp. 695–700 (1999)

  50. Morita, M., Oliveira, L.S., Sabourin, R., Bortolozzi, F., Suen, C.Y.: An HMM-MLP hybrid system to recognize handwritten dates. In: Proceedings of the International Joint Conference on Neural Networks, pp. 1–6. IEEE Computer Society, Honolulu, USA (2002)

  51. Tumer, K., Ghosh, J.: Error correlation and error reduction in ensemble classifiers. Connect. Sci. 8(3–4), 385–404 (1996)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luiz S. Oliveira.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Oliveira, L.S., Morita, M. & Sabourin, R. Feature selection for ensembles applied to handwriting recognition. IJDAR 8, 262–279 (2006). https://doi.org/10.1007/s10032-005-0013-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10032-005-0013-6

Keywords