Abstract
In document recognition, it is often important to obtain high accuracy or reliability and to reject patterns that cannot be classified with high confidence. This is the case for applications such as the processing of financial documents in which errors can be very costly and therefore far less tolerable than rejections. This paper presents a new approach based on Linear Discriminant Analysis (LDA) to reject less reliable classifier outputs. To implement the rejection, which can be considered a two-class problem of accepting the classification result or otherwise, an LDA-based measurement is used to determine a new rejection threshold. This measurement (LDAM) is designed to take into consideration the confidence values of the classifier outputs and the relations between them, and it represents a more comprehensive measurement than traditional rejection measurements such as First Rank Measurement and First Two Ranks Measurement. Experiments are conducted on the CENPARMI database of numerals, the CENPARMI Arabic Isolated Numerals Database, and the numerals in the NIST Special Database 19. The results show that LDAM is more effective, and it can achieve a higher reliability while maintaining a high recognition rate on these databases of very different origins and sizes.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Lauer F., Suen C.Y., Bloch G.: A trainable feature extractor for handwritten digit recognition. Pattern Recognit. 40(6), 1816–1824 (2007)
Zhang P., Bui T.D., Suen C.Y.: A novel cascade ensemble classifier system with a high recognition performance on handwritten digits. Pattern Recognit. 40(12), 3415–3429 (2007)
Liu C.L., Nakashima K., Sako H., Fujisawa H.: Handwritten digit recognition: investigation of normalization and feature extraction techniques. Pattern Recognit. 37(2), 265–279 (2004)
Lam L., Suen C.Y.: Application of majority voting to pattern recognition: an analysis of its behavior and performance. IEEE Trans. Syst. Man Cybern. 27(5), 553–568 (1997)
van Erp, M., Vuurpijl, L., Schomaker, L.: An overview and comparison of voting methods for pattern recognition. In: Proceedings 8th International Workshop on Frontiers in Handwriting Recognition (IWFHR’02), pp. 195–200 (2002)
Brunelli R., Poggio T.: Face recognition: features versus templates. IEEE Trans. Pattern Anal. Mach. Intell. 15(10), 1042–1052 (1993)
Ventura, A.D., Schettini, R.: Graphic Symbol Recognition using a Signature Technique. In: Proceedings 12th International Conference Pattern Recognition (ICPR’94), vol. 2, pp. 533–535. Jerusalem, Israel (1994)
Suda, P., Bridoux, C., Kammerer, B., Maderlechner, G.: Logo and word matching using a general approach to signal registration. In: Proceedings 4th International Conference Document Analysis and Recognition (ICDAR’97), pp. 61–65. Ulm, Germany (1997)
Cao J., Ahmadi M., Shridhar M.: Recognition of handwritten numerals with multiple feature and multistage classifier. Pattern Recognit. 28(2), 153–160 (1995)
Koerich, A.L.: Rejection strategies for handwritten word recognition. In: Proceedings 9th International Workshop on Frontiers in Handwriting Recognition (IWFHR’04), pp. 479–484 (2004)
Pitrelli, J.F., Perrone, M.P.: Confidence-scoring post-processing for off-line handwritten-character recognition verification. In: Proceedings 7th International Conference on Document Analysis and Recognition (ICDAR’03) I, pp. 278–282 (2003)
Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines. Software available at:. http://www.csie.ntu.edu.tw/~cjlin/libsvm (2001)
Fisher R.A.: The use of multiple measurements in taxonomic problems. Ann. Eugen. 7, 179–188 (1936)
Gao T.-F., Liu C.-L.: High accuracy handwritten Chinese character recognition using LDA-based compound distances. Pattern Recognit. 41(11), 3442–3451 (2008)
Tang F., Tao H.: Fast linear discriminant analysis using binary bases. Pattern Recognit. Lett. 28(16), 2209–2218 (2007)
Dong J.X., Krzyzak A., Suen C.Y.: Fast SVM training algorithm with decomposition on very large datasets. IEEE Trans. Pattern Anal. Mach. Intell. 27(4), 603–618 (2005)
Suen C.Y., Nadal C., Legault R., Mai T.A., Lam L.: Computer recognition of unconstrained handwritten numerals. Proc. IEEE 80(7), 1162–1180 (1992)
Alamri, H., Sadri, J., Suen, C.Y., Nobile, N.: A novel comprehensive database for Arabic off-Line handwriting recognition. In: Proceedings of the 11th International Conference on Frontiers in Handwriting Recognition (ICFHR 2008), pp. 664–669. Montreal, Canada (2008)
Grother, P.J.: NIST Special Database 19 Handprinted Forms and Characters Database. NIST Gaithersburg, MD, USA
He, C.L., Lam, L., Suen, C.Y.: A novel rejection measurement in handwritten numeral recognition based on Linear Discriminant Analysis. In: Proceedings of the 10th International Conference on Document Analysis and Recognition(ICDAR 2009), pp. 451–455. Barcelona, Spain (2009)
Casey R.G.: Moment normalization of handprinted character. IBM J. Res. Dev. 14, 548–557 (1970)
Otsu N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9, 62–66 (1979)
Shi M., Fujisawa Y., Wakabayashi T., Kimura F.: Handwritten numeral recognition using gradient and curvature of gray scale image. Pattern Recognit. 35(10), 2051–2059 (2002)
Pal, U., Wakabayashi, T., Kimura, F.: Comparative study of Devnagari handwritten character recognition using different feature and classifiers. In: Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR 2009), pp. 1111–1115. Barcelona, Spain (2009)
Vapnik V., Lerner A.: Pattern recognition using generalized portrait method. Autom. Remote Control 24, 774–780 (1963)
Abdleazeem S., El-Sherif E.: Arabic handwritten digit recognition. Int. J. Doc. Anal. Recognit. 11, 128–141 (2008)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
He, C.L., Lam, L. & Suen, C.Y. Rejection measurement based on linear discriminant analysis for document recognition. IJDAR 14, 263–272 (2011). https://doi.org/10.1007/s10032-011-0154-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10032-011-0154-8