Code-Switching without Switching: Language Agnostic End-to-End Speech Translation

Huber, Christian; Ugan, Enes Yavuz; Waibel, Alexander

Computer Science > Computation and Language

arXiv:2210.01512 (cs)

[Submitted on 4 Oct 2022 (v1), last revised 9 Nov 2022 (this version, v2)]

Title:Code-Switching without Switching: Language Agnostic End-to-End Speech Translation

Authors:Christian Huber, Enes Yavuz Ugan, Alexander Waibel

View PDF

Abstract:We propose a) a Language Agnostic end-to-end Speech Translation model (LAST), and b) a data augmentation strategy to increase code-switching (CS) performance. With increasing globalization, multiple languages are increasingly used interchangeably during fluent speech. Such CS complicates traditional speech recognition and translation, as we must recognize which language was spoken first and then apply a language-dependent recognizer and subsequent translation component to generate the desired target language output. Such a pipeline introduces latency and errors. In this paper, we eliminate the need for that, by treating speech recognition and translation as one unified end-to-end speech translation problem. By training LAST with both input languages, we decode speech into one target language, regardless of the input language. LAST delivers comparable recognition and speech translation accuracy in monolingual usage, while reducing latency and error rate considerably when CS is observed.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2210.01512 [cs.CL]
	(or arXiv:2210.01512v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.01512

Submission history

From: Christian Huber [view email]
[v1] Tue, 4 Oct 2022 10:34:25 UTC (172 KB)
[v2] Wed, 9 Nov 2022 16:52:45 UTC (170 KB)

Computer Science > Computation and Language

Title:Code-Switching without Switching: Language Agnostic End-to-End Speech Translation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Code-Switching without Switching: Language Agnostic End-to-End Speech Translation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators