Transformer-based HTR for Historical Documents

Ströbel, Phillip Benjamin; Clematide, Simon; Volk, Martin; Hodel, Tobias

Computer Science > Computer Vision and Pattern Recognition

arXiv:2203.11008 (cs)

[Submitted on 21 Mar 2022]

Title:Transformer-based HTR for Historical Documents

Authors:Phillip Benjamin Ströbel, Simon Clematide, Martin Volk, Tobias Hodel

View PDF

Abstract:We apply the TrOCR framework to real-world, historical manuscripts and show that TrOCR per se is a strong model, ideal for transfer learning. TrOCR has been trained on English only, but it can adapt to other languages that use the Latin alphabet fairly easily and with little training material. We compare TrOCR against a SOTA HTR framework (Transkribus) and show that it can beat such systems. This finding is essential since Transkribus performs best when it has access to baseline information, which is not needed at all to fine-tune TrOCR.

Comments:	This is an abstract submitted and accepted at ComHum 2022 in Lausanne. We will be elaborating on these initial findings in the paper that we will submit after the conference
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
Cite as:	arXiv:2203.11008 [cs.CV]
	(or arXiv:2203.11008v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2203.11008

Submission history

From: Phillip Benjamin Ströbel [view email]
[v1] Mon, 21 Mar 2022 14:23:10 UTC (1,746 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2022-03

Change to browse by:

cs
cs.CL

References & Citations

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Transformer-based HTR for Historical Documents

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Transformer-based HTR for Historical Documents

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators