Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Synchronous Multi-Stream Hidden Markov Model for offline Arabic handwriting recognition without explicit segmentation

Published: 19 November 2016 Publication History

Abstract

Arabic handwriting recognition is still a challenging task due especially to the unlimited variation in human handwriting, the large variety of Arabic character shapes, the presence of ligature between characters and overlapping of the components. In this paper, we propose an offline Arabic-handwritten recognition system for Tunisian city names. A review of the literature shows that the Hidden Markov Model (HMM) adopting the sliding window approach are the mainly used models, which gives good results when a relevant feature-extraction process is performed. However, these models are utilized especially to model one dimensional signal. Consequently, to model bi-dimensional signals or multiple features, a solution based on combining multi-classifiers and then a post-treatment selecting the best hypothesis is applied. The problem considered in this case consists in searching the best way to combine the contribution of these classifiers. In this study, we put forward an extension of the HMM, which can surmount this problem. Our proposed system is based on a synchronous multi-stream HMM which has the advantage of efficiently modelling the interaction between multiple features. These features are composed of a combination of statistical and structural ones, which are extracted over the columns and rows using a sliding window approach. In fact, two word models are implemented based on the holistic and analytical approaches without any explicit segmentation. In the first approach, all the classes share the same architecture nevertheless, the parameters are different. In the second approach, each class has its own model by concatenating their components models. The results carried out on the IFN/ENIT database show that the analytical approach performs better than the holistic one and that the data fusion model is more efficient than the state fusion model.

References

[1]
A. Lawgali, A survey on Arabic character recognition, Int. J. Signal Process. Image Process. Pattern Recognit., 8 (2015) 401-426.
[2]
M. Shatnawi, Off-line handwritten Arabic character recognition: a survey, in: Proceedings of the International Conference on Image Processing, Computer Vision, and Pattern Recognition (IPCV), 2015.
[3]
Y.M. Alginahi, A survey on Arabic character segmentation, Int. J. Doc. Anal. Recognit. (IJDAR), 16 (2013) 105-126.
[4]
L. Likforman-Sulem, M. Sigelle, Recognition of degraded characters using dynamic Bayesian networks, Pattern Recognit., 41 (2008) 3092-3103.
[5]
N. Murru, R. Rossini, A Bayesian approach for initialization of weights in back propagation neural net with application to character recognition, Neurocomputing (2016).
[6]
S. Naz, A.I. Umar, R. Ahmad, Offline cursive Urdu-Nastaliq script recognition using multidimensional recurrent neural networks, Neurocomputing (2015).
[7]
Q. Guo, F. Wang, J. Lei, Convolutional feature learning and Hybrid CNN-HMM for scene number recognition, Neurocomputing (2015).
[8]
G. Zhai, J. Chen, C. LI, Pattern recognition approach to identify loose particle material based on modified MFCC and HMMs, Neurocomputing, 155 (2015) 135-145.
[9]
D.D. Mauá, A. Antonucci, C.P. de Campos, Hidden Markov models with set-valued parameters, Neurocomputing (2015).
[10]
S. Naz, A.I. Umar, S.H. Shirazi, Segmentation techniques for recognition of Arabic-like scripts: a comprehensive survey, Educ. Inf. Technol. (2015) 1-17.
[11]
A. Giménez, I. Khoury, J. Andrés-Ferrer, Handwriting word recognition using windowed Bernoulli HMMs, Pattern Recognit. Lett., 35 (2014) 149-156.
[12]
M. Hamdani, H. El Abed, M. Kherallah, Combining multiple HMMs using on-line and off-line features for off-line Arabic handwriting recognition, in: Proceedings of the International Conference on Document Analysis and Recognition, 2009, pp. 201-205.
[13]
J.H. AlKhateeb, Word-based handwritten Arabic scripts recognition using dynamic Bayesian network, in: Proceedings of the International Conference on Information Technology, Amman-Jordan, 2011.
[14]
R. El-Hajj Mohamad, L. Likforman-Sulem, C. Mokbel, Combining slanted-frame classifiers for improved HMM-based Arabic handwriting recognition, IEEE Trans. Pattern Anal. Mach. Intell., 31 (2009) 1165-1177.
[15]
M. Glodek, F. Honold, T. Geier, Fusion paradigms in cognitive technical systems for human-computer interaction, Neurocomputing, 161 (2015) 17-37.
[16]
Y. Kessentini, T. Paquet, A.B. Hamadou, Off-line handwritten word recognition using multi-stream Hidden Markov Models, Pattern Recognit. Lett., 31 (2010) 60-70.
[17]
S. Poria, E. Cambria, N. Howard, Fusing audio, visual and textual clues for sentiment analysis from multimodal content, Neurocomputing, 174 (2016) 50-59.
[18]
O. Missaoui, H. Frigui, P. Gader, Multi-stream continuous Hidden Markov Models with application to landmine detection, EURASIP J. Adv. Signal Process., 1 (2013) 1-23.
[19]
N. Raman, S.J. Maybank, Activity Recognition using a supervised non-parametric hierarchical HMM, Neurocomputing (2016).
[20]
K. Jayech, N. Trimech, M.A. Mahjoub, Dynamic hierarchical Bayesian network for Arabic handwritten word recognition, in: Proceedings of the International Conference on Information and Communication Technology and Accessibility, 2013, pp. 1-6.
[21]
S.M. Touj, N. Essoukri Ben Amara, H. Amiri, Arabic handwritten words recognition based on a planar Hidden Markov Model, Int. Arab J. Inf. Technol., 2 (2005) 318-325.
[22]
K.P. Murphy, Dynamic Bayesian Networks: Representation, Inference and Learning, Doctoral Dissertation, University of California, Berkeley, 2002.
[23]
M. Mahjoub, N. Ghanmy, K. Jayech, Multiple models of Bayesian networks applied to offline recognition of Arabic handwritten city names, Int. J. Imaging Robot., 9 (2013) 84-105.
[24]
J. Al Abodi, L. Xue, An effective approach to offline Arabic handwriting recognition, Comput. Electr. Eng., 40 (2014) 1883-1901.
[25]
O.A. Arqub, Z. Abo-Hammour, Numerical solution of systems of second-order boundary value problems using continuous genetic algorithm, Inf. Sci., 279 (2014) 396-415.
[26]
A. Benouareth, A. Ennaji, M. Sellami, Semi-continuous HMMs with explicit state duration for unconstrained Arabic word modelling and recognition, Pattern Recognit. Lett., 29 (2008) 1742-1752.
[27]
M.T. Parvez, S.A. Mahmoud, Arabic handwriting recognition using structural and syntactic pattern attributes, Pattern Recognit., 46 (2013) 141-154.
[28]
K. Jayech, M.A. Mahjoub, N. Essoukri Ben, Amara, Arabic handwriting recognition based on synchronous multi-stream HMM without explicit segmentation, Hybrid. Artif. Intell. Syst. (2015) 22-24.
[29]
V. Edwin, C.D. Maciel, Efficient methods for learning Bayesian network super-structures, Neurocomputing, 123 (2014) 3-12.
[30]
A. Jarraya, L. Philippe, A. Masmoudi, Discrete exponential Bayesian networks: definition, learning and application for density estimation, Neurocomputing, 137 (2014) 142-149.
[31]
P.A.P. Ramirez, I.B. Utne, Use of dynamic Bayesian networks for life extension assessment of ageing systems, Reliab. Eng. Syst. Saf., 133 (2015) 119-136.
[32]
R. Donat, P. Leray, L. Bouillaut, A dynamic Bayesian network to represent discrete duration models, Neurocomputing, 73 (2010) 570-577.
[33]
H.H. Avilés-Arriaga, L. Sucar-Succar, C. Mendoza-Durán, A comparison of dynamic naive Bayesian classifiers and Hidden Markov Models for gesture recognition, J. Appl. Res. Technol., 9 (2011) 81-102.
[34]
M. Pechwitz, V. Maergner, H. El Abed, Comparison of two different feature sets for offline recognition of handwritten Arabic words, in: Proceedings of the International Workshop on Frontiers in Handwriting Recognition, 2006.
[35]
J.H. AlKhateeb, J. Jiang, J. Ren, Multiclass classification of unconstrained handwritten Arabic words using machine learning approaches, Open. Signal Process. J., 2 (2009) 21-28.
[36]
L. Dinges, A. Al-Hamadi, M. Elzobi, Offine automatic segmentation based recognition of handwritten Arabic words, International Journal of Signal Processing, Image Process. Pattern Recognit., 4 (2011) 131-143.
[37]
A. Ameen Khan, N.U. Reddy, M. Rao, Speaker recognition system using combined vector quantization and discrete Hidden Markov model, Int. J. Comput. Eng., 2 (2012) 692-696.
[38]
G. Abandah, F. Jamour, Recognizing handwritten Arabic script through efficient skeleton-based grapheme segmentation algorithm, in: Proceedings of the International Conference on Intelligent Systems Design and Applications, 2010, pp. 977-982.
[39]
P. Dreuw, G. Heigold, H. Ney, Confidence-based discriminative training for model adaptation in offline Arabic handwriting recognition, in: Proceedings of the International Conference on Document Analysis and Recognition, 2009, pp. 596-600.
[40]
A. Graves, J. Schmidhuber, Offline handwriting recognition with multidimensional recurrent neural networks, Adv. Neural Inf. Process. Syst. (2009) 545-552.
[41]
F. Menasri, Contributions à la reconnaissance de l'écriture arabe manuscrite (Thèse de doctorat), Université Paris Descartes, Paris, 2008.
[42]
M.P. Schambach, Model length adaptation of an HMM based cursive word recognition system, in: Proceedings of the International Conference on Document Analysis and Recognition, 2003.

Cited By

View all
  • (2023)Recent advances of ML and DL approaches for Arabic handwriting recognitionInternational Journal of Hybrid Intelligent Systems10.3233/HIS-23000519:1,2(61-78)Online publication date: 1-Jan-2023
  • (2023)Improved Learning for Online Handwritten Chinese Text Recognition with Convolutional Prototype NetworkDocument Analysis and Recognition - ICDAR 202310.1007/978-3-031-41685-9_3(38-53)Online publication date: 21-Aug-2023
  • (2021)Densely connected layer to improve VGGnet-based CRNN for Arabic handwriting text line recognitionInternational Journal of Hybrid Intelligent Systems10.3233/HIS-21000917:3-4(113-127)Online publication date: 1-Jan-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Neurocomputing
Neurocomputing  Volume 214, Issue C
November 2016
1063 pages

Publisher

Elsevier Science Publishers B. V.

Netherlands

Publication History

Published: 19 November 2016

Author Tags

  1. Arabic handwriting recognition
  2. Dynamic Bayesian network
  3. Implicit segmentation
  4. Multi-Stream Hidden Markov Model

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Recent advances of ML and DL approaches for Arabic handwriting recognitionInternational Journal of Hybrid Intelligent Systems10.3233/HIS-23000519:1,2(61-78)Online publication date: 1-Jan-2023
  • (2023)Improved Learning for Online Handwritten Chinese Text Recognition with Convolutional Prototype NetworkDocument Analysis and Recognition - ICDAR 202310.1007/978-3-031-41685-9_3(38-53)Online publication date: 21-Aug-2023
  • (2021)Densely connected layer to improve VGGnet-based CRNN for Arabic handwriting text line recognitionInternational Journal of Hybrid Intelligent Systems10.3233/HIS-21000917:3-4(113-127)Online publication date: 1-Jan-2021
  • (2021)Arabic Handwritten Word Recognition Based on Stationary Wavelet Transform Technique using Machine LearningACM Transactions on Asian and Low-Resource Language Information Processing10.1145/347439121:3(1-21)Online publication date: 13-Dec-2021
  • (2021)Holistic word descriptor for lexicon reduction in handwritten arabic documentsPattern Recognition10.1016/j.patcog.2021.108072119:COnline publication date: 1-Nov-2021
  • (2020)Automatic processing of Historical Arabic DocumentsPattern Recognition10.1016/j.patcog.2019.107144100:COnline publication date: 1-Apr-2020
  • (2020)Recognition of Off-line Handwritten Uyghur Words Using Bayesian Networks with Grapheme NodesSN Computer Science10.1007/s42979-020-00308-71:5Online publication date: 4-Sep-2020
  • (2019)Hybrid one-class classifier ensemble based on fuzzy integral for open-lexicon handwritten Arabic word recognitionPattern Analysis & Applications10.1007/s10044-018-0735-y22:1(99-113)Online publication date: 1-Feb-2019
  • (2019)An improved discriminative region selection methodology for online handwriting recognitionInternational Journal on Document Analysis and Recognition10.1007/s10032-018-0314-122:1(1-14)Online publication date: 1-Mar-2019
  • (2018)Information-dense actions as contextsNeurocomputing10.1016/j.neucom.2018.05.056311:C(164-175)Online publication date: 15-Oct-2018

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media