Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2596695.2596712acmconferencesArticle/Chapter ViewAbstractPublication Pagesw4aConference Proceedingsconference-collections
research-article

Making Arabic PDF books accessible using gamification

Published: 07 April 2014 Publication History

Abstract

Most of online Arabic books are not accessible to Arab people with visual impairments. They cannot read online books because they are usually scanned images of the original ones. There is also a problem in PDF encoding of some of the textual books. One of the solutions is to use an Arabic OCR to convert scanned books into text; however Arabic OCR is still in its early stages and suffers from many limitations. In this paper we propose the use of human recognition skills to replace OCR limitations by incorporating the concepts of crowdsourcing and gamification. Our proposed system is in the form of a mobile recall game that presents players with word images segmented from the books to be converted into text. The players' answers are checked using techniques similar to what is used in word spotting. We initially implemented two components of the system; the segmentation, and the feature extraction and matching components. For the feature extraction and matching component, which is used to verify the player's answers, we performed four tests to choose a similarity measure threshold for accepting an entered word as a correct answer. Future work will consider other means of input correctness assurance.

References

[1]
S. N. Srihari and G. Ball, "An Assessment of Arabic Handwriting Recognition Technology," in Guide to OCR for Arabic Scripts, V. Märgner and H. E. Abed, Eds. Springer London, 2012, pp. 3--34.
[2]
A. Almasoud and H. Al-Khalifa, "Investigating Accessibility Problems of Arabic PDF Documents," the Fourth International Conference On Information and Communication Technology and Accessibility (ICTA 2013), Hammamet, TUNISIA, 2013.
[3]
B. Al-Badr and S. A. Mahmoud, "Survey and bibliography of Arabic optical text recognition," Signal Process., vol. 41, no. 1, pp. 49--77, Jan. 1995.
[4]
"How to... use crowdsourcing as a research tool Part: 1." {Online}. Available: http://www.emeraldinsight.com/research/guides/methods/crowdsourcing.htm?PHPSESSID=4u0tuepsvgipnt0asqsi3amo84. {Accessed: 10-Jan-2014}.
[5]
M. Kobayashi, T. Ishihara, T. Itoko, H. Takagi, and C. Asakawa, "Age-Based Task Specialization for Crowdsourced Proofreading," in Universal Access in Human-Computer Interaction. User and Context Diversity, C. Stephanidis and M. Antona, Eds. Springer Berlin Heidelberg, 2013, pp. 104--112.
[6]
M. Wald, "Crowdsourcing Correction of Speech Recognition Captioning Errors," in Proceedings of the International Cross-Disciplinary Conference on Web Accessibility, New York, NY, USA, 2011, pp. 22:1--22:2.
[7]
J. Feng, Y. Ni, J. Dong, Z. Wang, and S. Yan, "Purposive Hidden-Object-Game: Embedding Human Computation in Popular Game," IEEE Trans. Multimed., vol. 14, no. 5, pp. 1496--1507, 2012.
[8]
J. Šimko, M. Tvarožek, and M. Bieliková, "Semantics Discovery via Human Computation Games:," Int. J. Semantic Web Inf. Syst., vol. 7, no. 3, pp. 23--45, 33 2011.
[9]
J. Wang and B. Yu, "Sentence Recall Game: A Novel Tool for Collecting Data to Discover Language Usage Patterns," in Proceedings of the ACM SIGKDD Workshop on Human Computation, New York, NY, USA, 2010, pp. 56--59.
[10]
J. P. Bigham, C. Jayant, H. Ji, G. Little, A. Miller, R. C. Miller, R. Miller, A. Tatarowicz, B. White, S. White, and T. Yeh, "VizWiz: Nearly Real-time Answers to Visual Questions," in Proceedings of the 23Nd Annual ACM Symposium on User Interface Software and Technology, New York, NY, USA, 2010, pp. 333--342.
[11]
H. Takagi, A. Kosugi, S. Saito, and M. Teraguchi, "Crowdsourcing Platform for Workplace Accessibility," in Proceedings of the 10th International Cross-Disciplinary Conference on Web Accessibility, New York, NY, USA, 2013, pp. 28:1--28:4.
[12]
C. Cardonha, D. Gallo, P. Avegliano, R. Herrmann, F. Koch, and S. Borger, "A Crowdsourcing Platform for the Construction of Accessibility Maps," in Proceedings of the 10th International Cross-Disciplinary Conference on Web Accessibility, New York, NY, USA, 2013, pp. 26:1--26:4.
[13]
L. von Ahn, "Human computation," in 46th ACM/IEEE Design Automation Conference, 2009. DAC '09, 2009, pp. 418--419.
[14]
M. Krause, A. Takhtamysheva, M. Wittstock, and R. Malaka, "Frontiers of a Paradigm: Exploring Human Computation with Digital Games," in Proceedings of the ACM SIGKDD Workshop on Human Computation, New York, NY, USA, 2010, pp. 22--25.
[15]
T. Ishihara, T. Itoko, D. Sato, A. Tzadok, and H. Takagi, "Transforming Japanese Archives into Accessible Digital Books," in Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries, New York, NY, USA, 2012, pp. 91--100.
[16]
"O-RID KYBER - Services -." {Online}. Available: http://www.oridkyber.com/en/service/. {Accessed: 10-Jan-2014}.
[17]
O. Chrons and S. Sundell, "Digitalkoot: Making Old Archives Accessible Using Crowdsourcing," in Workshops at the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011.
[18]
"Almaktaba AlShamela" {Online}. Available: http://shamela.ws/. {Accessed: 10-Jan-2014}.
[19]
A. F. Aparicio, F. L. G. Vela, J. L. G. Sánchez, and J. L. I. Montes, "Analysis and Application of Gamification," in Proceedings of the 13th International Conference on InteracciÓN Persona-Ordenador, New York, NY, USA, 2012, pp. 17:1--17:2.
[20]
J. Ferguson, M. Bell, and M. Chalmers, "Mutually Reinforcing Systems," in Proceedings of the ACM SIGKDD Workshop on Human Computation, New York, NY, USA, 2010, pp. 34--37.
[21]
A. M. Zeki, M. S. Zakaria, and C.-Y. Liong, "Segmentation of Arabic Characters: A Comprehensive Survey," Int J Technol Diffus, vol. 2, no. 4, pp. 48--82, Oct. 2011.
[22]
S. S. Bukhari, F. Shafait, and T. M. Breuel, "Layout Analysis of Arabic Script Documents," in Guide to OCR for Arabic Scripts, V. Märgner and H. E. Abed, Eds. Springer London, 2012, pp. 35--53.
[23]
S. Belongie, J. Malik, and J. Puzicha, "Shape matching and object recognition using shape contexts," IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 4, pp. 509--522, 2002.
[24]
J. Almazán, A. Gordo, A. Fornés, and E. Valveny, "Efficient Exemplar Word Spotting," Sep-2012. {Online}. Available: http://eprints.pascal-network.org/archive/00009588/. {Accessed: 10-Jan-2014}.
[25]
G. Mori, "Matching with Shape Contexts." {Online}. Available: http://www.eecs.berkeley.edu/Research/Projects/CS/vision/shape/sc_digits.html. {Accessed: 10-Jan-2014}
[26]
"Arabic Alphabet." Wikipedia, the free encyclopedia {Online}. Available: http://en.wikipedia.org/w/index.php?title=Arabic_alphabet&oldid=588855519. {Accessed: 10-Jan-2014}

Cited By

View all
  • (2024)Gamification in Crowdsourcing ApplicationsEncyclopedia of Computer Graphics and Games10.1007/978-3-031-23161-2_46(819-824)Online publication date: 5-Jan-2024
  • (2017)Accessibility of Portable Document Format in Education RepositoriesProceedings of the 9th International Conference on Education Technology and Computers10.1145/3175536.3175574(239-242)Online publication date: 20-Dec-2017
  • (2016)Expense ControlProceedings of the 21st International Conference on Intelligent User Interfaces10.1145/2856767.2856790(31-42)Online publication date: 7-Mar-2016
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
W4A '14: Proceedings of the 11th Web for All Conference
April 2014
192 pages
ISBN:9781450326513
DOI:10.1145/2596695
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 April 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Arabic
  2. PDF documents
  3. accessibility
  4. eBooks

Qualifiers

  • Research-article

Conference

W4A '14
Sponsor:
  • ACM
  • Ability Magazine
  • IW3C2
  • TPG

Acceptance Rates

W4A '14 Paper Acceptance Rate 6 of 14 submissions, 43%;
Overall Acceptance Rate 171 of 371 submissions, 46%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 26 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Gamification in Crowdsourcing ApplicationsEncyclopedia of Computer Graphics and Games10.1007/978-3-031-23161-2_46(819-824)Online publication date: 5-Jan-2024
  • (2017)Accessibility of Portable Document Format in Education RepositoriesProceedings of the 9th International Conference on Education Technology and Computers10.1145/3175536.3175574(239-242)Online publication date: 20-Dec-2017
  • (2016)Expense ControlProceedings of the 21st International Conference on Intelligent User Interfaces10.1145/2856767.2856790(31-42)Online publication date: 7-Mar-2016
  • (2016)Gamification Solutions to Enhance Software User Engagement—A Systematic ReviewInternational Journal of Human-Computer Interaction10.1080/10447318.2016.118333032:8(613-642)Online publication date: 2-May-2016
  • (2015)Gamification in Crowdsourcing ApplicationsEncyclopedia of Computer Graphics and Games10.1007/978-3-319-08234-9_46-1(1-6)Online publication date: 12-Dec-2015

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media