Comparative Error Analysis in Neural and Finite-state Models for Unsupervised Character-level Transduction

Ryskina, Maria; Hovy, Eduard; Berg-Kirkpatrick, Taylor; Gormley, Matthew R.

Computer Science > Computation and Language

arXiv:2106.12698 (cs)

[Submitted on 24 Jun 2021]

Title:Comparative Error Analysis in Neural and Finite-state Models for Unsupervised Character-level Transduction

Authors:Maria Ryskina, Eduard Hovy, Taylor Berg-Kirkpatrick, Matthew R. Gormley

View PDF

Abstract:Traditionally, character-level transduction problems have been solved with finite-state models designed to encode structural and linguistic knowledge of the underlying process, whereas recent approaches rely on the power and flexibility of sequence-to-sequence models with attention. Focusing on the less explored unsupervised learning scenario, we compare the two model classes side by side and find that they tend to make different types of errors even when achieving comparable performance. We analyze the distributions of different error classes using two unsupervised tasks as testbeds: converting informally romanized text into the native script of its language (for Russian, Arabic, and Kannada) and translating between a pair of closely related languages (Serbian and Bosnian). Finally, we investigate how combining finite-state and sequence-to-sequence models at decoding time affects the output quantitatively and qualitatively.

Comments:	Accepted to SIGMORPHON 2021
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2106.12698 [cs.CL]
	(or arXiv:2106.12698v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2106.12698

Submission history

From: Maria Ryskina [view email]
[v1] Thu, 24 Jun 2021 00:09:24 UTC (5,488 KB)

Computer Science > Computation and Language

Title:Comparative Error Analysis in Neural and Finite-state Models for Unsupervised Character-level Transduction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Comparative Error Analysis in Neural and Finite-state Models for Unsupervised Character-level Transduction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators