MOJ-DB

A database of Arabic historical subwords Under LICENSE-CC-BY-NC-ND-4.0 Original Paper : https://doi.org/10.1016/j.patrec.2022.04.040

The proposed database contains 560000 subwords distributed on 5600 different classes. It was built using 64 pages extracted from 10 books written in the 17th and 16th centuries. MOJ-DB database is divided into three sets; 70%,20%, and 10% for training, testing, and validation, respectively. Ground truth is established iteratively to guarantee minimal error. It includes information about the subword as of the sourcebook and page. We conducted several experiments to verify the robustness of the proposed database as well as the validity of the segmentation process. The database is freely available for the public research community. It can be used for word and subword recognition, word spotting, subword extraction, and database construction.

To get access to this material, please contact the author abdelhay.zoizou@usmba.ac.ma / zoizou.abdelhay@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitattributes		.gitattributes
LICENCE.md		LICENCE.md
README.md		README.md
description.txt		description.txt
gitattributes		gitattributes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MOJ-DB

About

Releases

Packages

License

Abdelhay-Z/MOJ-DB

Folders and files

Latest commit

History

Repository files navigation

MOJ-DB

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages