Abstract
Text segmentation is an essential pre-processing stage for many systems such as text recognition and word spotting. However, few methods have been published for Arabic text segmentation. In Arabic handwritten documents, separating text into words is challenging due to the enormous different Arabic handwriting styles. In this paper, we present a new segmentation methodology of an Arabic handwritten text line into words. Our proposed approach of text segmentation utilizes the knowledge of Arabic writing characteristics. This method shows promising results.
Chapter PDF
Similar content being viewed by others
References
Huang, C., Srihari, S.: Word Segmentation of Off-line Handwritten Documents. In: Proceedings of the Document Recognition and Retrieval (DRR) XV, IST/SPIE Annual Symposium, San Jose, CA, USA, vol. 6815 (2008)
Olivier, G., Miled, H., Romeo, K., Lecourtier, Y.: Segmentation and Coding of Arabic Handwritten Words. In: Proceedings of 13th International Conference of Pattern Recognition (ICPR 1996), vol. 3, pp. 264–268 (1996)
Amin, A.: Recognition of Printed Arabic Text based on Global Features and Decision Tree Learning Techniques. Pattern Recognition 33(8), 1309–1323 (2000)
Miled, H., Amara, N.E.B.: Planar Markov Modeling for Arabic Writing Recognition: Advancement State. In: Proceedings of the 6th International Conference on Document Analysis and Recognition (ICDAR 2001), Seattle, USA, pp. 69–73 (2001)
Westall, J.M., Narasimha, M.S.: Vertex Directed Segmentation of Handwritten Numerals. Pattern Recognition 26(10), 1,473–1,186 (1993)
AlKhateeb, J.H., Ren, J., Ipson, S.S., Jiang, J.: Knowledge-based Baseline Detection and Optimal Thresholding for Words Segmentation in Efficient Pre-processing of handwritten Arabic text. In: Proceedings of 5th International Conference on Information Technology: New Generations (ITNG 2008), Las Vegas, Nevada, pp. 1158–1159 (2008)
Kchaou, M., Kanoun, S., Ogier, J.: Segmentation and Word Spotting Methods for Printed and Handwritten Arabic Texts: A Comparative Study. In: Proceedings of 13th International Conference on Frontiers in Handwriting Recognition (ICFHR 2012), Bari, Italy, pp. 274–279 (2012)
AlKhateeb, J.H., Jiang, J., Ren, J., Ipson, S.: Component-based Segmentation of Words from Handwritten Arabic text. In: Proceedings of World Academy of Science, Engineering and Technology (WASET), Vienna, Austria, vol. 31 (2008) ISSN: 1307- 6884
Elanwar, R.I., Rashwan, M., Mashali, S.: Arabic Online Word Extraction from Handwritten Text Using SVM-RBF Classifiers Decision Fusion. In: Proceedings of the 4th WSEAS International Conference on Nanotechnology, Cambridge, UK, pp. 68–73 (2012)
Pechwitz, M., Maddouri, S.S., M"argner, V., Ellouze, N., Amiri, H.: IFN/ENIT- Database of Handwritten Arabic Words. In: 7th Colloque International Francophone sur l’Ecrit et le Document (CIFED), Hammamet, Tunis, vol. 2, pp. 127–136 (2002)
Al-Ohali, Y., Cheriet, M., Suen, C.Y.: Databases for Recognition of Handwritten Arabic Cheques. Pattern Recognition 36(1), 111–121 (2003)
Jamal, A.T., Suen, C.Y.: Removal of Secondary Components of Arabic Handwritten Words Using Morphological Reconstruction. In: Proceedings of the 2nd International Conference on Information Technology (ICIT), Dubai, UAE (2014)
Manmatha, R., Rath, T.M.: Indexing Handwritten Historical Documents - Recent Progress. In: Proceedings of the Symposium on Document Image Understanding Technology, Greenbelt, USA, pp. 77–85 (2003)
Louloudis, G., Gatos, B., Pratikakis, I., Halatsis, C.: Text Line and Word Segmentation of Handwritten Documents. Pattern Recognition 42(12), 3169–3183 (2009)
Nikolaos, S., Gatos, B., Louloudis, G., Pal, U., Alaei, A.: ICDAR 2013 Handwriting Segmentation Contest. In: Proceedings of 12th International Conference on Document Analysis and Recognition (ICDAR), Washington DC, USA, pp. 1402–1406 (2013)
Al-Khayat, M., Lam, L., Suen, C.Y., Yin, F., Liu, C.-L.: Arabic Handwritten Text Line Extraction by Applying an Adaptive Mask to Morphological Dilation. In: Proceedings of the 10th IAPR International Workshop on Document Analysis Systems (DAS), Gold Coast, Australia, pp. 100–104 (2012)
Aghbari, Z., Brook, S.: HAH manuscripts: A Holistic Paradigm for Classifying and Retrieving Historical Arabic Handwritten Documents: Expert Systems with Applications. An International Journal 36(8), 10942–10951 (2009)
Srihari, S., Srinivasan, H., Babu, P., Bhole, C.: Handwritten Arabic Word Spotting Using the CEDARABIC Document Analysis System. In: Proceedings of Symposium on Document Image Understanding Technology (SDIUT), pp. 123–132. College Park, USA (2005)
Alamri, H., Sadri, J., Suen, C.Y., Nobile, N.: A Novel Comprehensive Database for Arabic Off-line Handwriting Recognition. In: Proceedings of 11th International Conference on Frontiers in Handwriting Recognition (ICFHR 2008), Montreal, Canada, pp. 664–669 (2008)
Chang, C., Lin, C.: LIBSVM: A Library for Support Vector Machines (2001), http://www.csie.ntu.edu.tw/cjlin/libsvm
Gatos, B., Pratikakis, I., Kesidis, A.L., Perantonis, S.J.: Efficient Off-Line Cursive Handwriting Word Recognition. In: The Proceedings of the 10th International Workshop on Frontiers in Handwriting Recognition (IWFHR 2006), La Baule, France, pp. 121–125 (2006)
Liu, C.-L., Nakashima, K., Sako, H., Fujisawa, H.: Handwritten Digit Recognition: Investigation of Normalization and Feature Extraction Techniques. Pattern Recognition 37(2), 265–279 (2004)
Zhang, T.Y., Suen, C.Y.: A Fast Parallel Algorithm for Thinning Digital Patterns. Communications of the ACM 27(3), 236–239 (1984)
Sagheer, M.W.: Novel Word Recognition and Word Spotting Systems for Offline Urdu Handwriting. Master Thesis, Concordia University, Montreal, Canada (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Jamal, A.T., Nobile, N., Suen, C.Y. (2014). End-Shape Recognition for Arabic Handwritten Text Segmentation. In: El Gayar, N., Schwenker, F., Suen, C. (eds) Artificial Neural Networks in Pattern Recognition. ANNPR 2014. Lecture Notes in Computer Science(), vol 8774. Springer, Cham. https://doi.org/10.1007/978-3-319-11656-3_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-11656-3_21
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11655-6
Online ISBN: 978-3-319-11656-3
eBook Packages: Computer ScienceComputer Science (R0)