Trainning Tesseract
Trainning Tesseract
Trainning Tesseract
net/publication/277142272
CITATION
3 authors:
Mohammed Oumsis
High School of Technology-Salé, Mohammed V-Agdal University, Rab…
45 PUBLICATIONS 130 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Fadoua Ataa Allah on 11 June 2015.
Abstract: - The Optical Character Recognition is the operation of converting a text image into an editable
text file. Several tools have been developed as OCR systems. Techniques used in each system vary from
one system to another, therefore the accuracy changes. In this paper, we present an example of available
OCR tools, and we train TESSERACT tool on the Amazigh language transcribed in Latin characters.
5. Training Tesseract on Amazigh This interface (Figure 5) allows adding text file
language containing characters to train, define the font
desired and specify noise degree in order to
5.1. Why Tesseract? generate boxes.