Computer Science > Computer Vision and Pattern Recognition
[Submitted on 17 Oct 2014 (this version), latest version 17 Oct 2015 (v3)]
Title:Large Vocabulary Arabic Online Handwriting Recognition System
View PDFAbstract:Arabic handwriting is a consonantal and cursive writing. The analysis of Arabic script is further complicated due to obligatory dots/strokes that are placed above or below most letters and usually written delayed in order. Due to ambiguities and diversities of the different writing styles, recognition systems are generally based on a set of possible words called lexicon (vocabulary). When the lexicon is small, recognition accuracy is more important as the recognition time is minimal. On the other hand, recognition speed as well as the accuracy are critical issues when handling large lexicons. Arabic language is rich in morphology and syntax which makes its lexicon large. Therefore, a practical online handwriting recognition system should be able to handle the large lexicon of the Arabic language with reasonable performance in terms of both accuracy and time.
In this paper, we introduce a fully-fledged Hidden Markov Model (HMM) based system for Arabic online handwriting recognition that provides solutions for most of the difficulties inherent in recognizing the Arabic script. A new preprocessing technique for handling the delayed strokes is introduced. We use advanced modeling techniques for building our recognition system from the training data to provide more detailed representation for the differences between the writing units, minimize the variances between writers in the training data, enhance the models discrimination power and have a better representation for the features space. The system results are enhanced using an additional post-processing step using a higher order language model. The system performance is evaluated using two databases covering small and large lexicons. Our proposed system outperforms state-of-art systems for the small lexicon database and shows promising results (accuracy and time) when supporting large vocabulary.
Submission history
From: Ibrahim Abdelaziz [view email][v1] Fri, 17 Oct 2014 11:09:35 UTC (286 KB)
[v2] Sat, 8 Nov 2014 14:31:48 UTC (248 KB)
[v3] Sat, 17 Oct 2015 09:47:28 UTC (248 KB)
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
Connected Papers (What is Connected Papers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.