Toward accessible comics for blind and low vision readers

Rigaud, Christophe; Burie, Jean-Christophe; Petit, Samuel

Computer Science > Artificial Intelligence

arXiv:2407.08248 (cs)

[Submitted on 11 Jul 2024 (v1), last revised 10 Sep 2024 (this version, v2)]

Title:Toward accessible comics for blind and low vision readers

Authors:Christophe Rigaud (L3I), Jean-Christophe Burie (L3I), Samuel Petit (Comix AI)

View PDF

Abstract:This work explores how to fine-tune large language models using prompt engineering techniques with contextual information for generating an accurate text description of the full story, ready to be forwarded to off-the-shelve speech synthesis tools. We propose to use existing computer vision and optical character recognition techniques to build a grounded context from the comic strip image content, such as panels, characters, text, reading order and the association of bubbles and characters. Then we infer character identification and generate comic book script with context-aware panel description including character's appearance, posture, mood, dialogues etc. We believe that such enriched content description can be easily used to produce audiobook and eBook with various voices for characters, captions and playing sound effects.

Comments:	Accepted to MANPU 2024 (Athens, Greece, August 30, 2024)
Subjects:	Artificial Intelligence (cs.AI)
Report number:	Published at MANPU 2024 (Athens, Greece, August 30, 2024)
Cite as:	arXiv:2407.08248 [cs.AI]
	(or arXiv:2407.08248v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2407.08248

Submission history

From: Christophe Rigaud [view email] [via CCSD proxy]
[v1] Thu, 11 Jul 2024 07:50:25 UTC (1,182 KB)
[v2] Tue, 10 Sep 2024 07:59:21 UTC (2,618 KB)

Computer Science > Artificial Intelligence

Title:Toward accessible comics for blind and low vision readers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Toward accessible comics for blind and low vision readers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators