Injecting Text in Self-Supervised Speech Pretraining.

AllVideos Images Books Maps News Shopping

[2108.12226] Injecting Text in Self-Supervised Speech Pretraining

Aug 27, 2021 · In this paper, we propose to jointly learn representations during pretraining from two different modalities: speech and text.

Scholarly articles for Injecting Text in Self-Supervised Speech Pretraining.

scholar.google.com › citations

Injecting text in self-supervised speech pretraining
Chen · Cited by 37

Injecting Text in Self-Supervised Speech Pretraining - IEEE Xplore

ieeexplore.ieee.org › iel7

Unspoken text is complementary to un-transcribed speech in self-supervised learning. It is also much easier to collect than un-transcribed speech. Pretraining ...

Injecting Text in Self-Supervised Speech Pretraining | Request PDF

www.researchgate.net › ... › Speech

Self-supervised pretraining for Automated Speech Recognition (ASR) has shown varied degrees of success. In this paper, we propose to jointly learn ...

[2108.12226] Injecting Text in Self-Supervised Speech Pretraining

www.reddit.com › speechtech › comments

Aug 30, 2021 · We demonstrate that this novel pretraining method yields Word Error Rate (WER) reductions of 10% relative on the well-benchmarked, Librispeech ...

Injecting Text in Self-Supervised Speech Pretraining | Request PDF

www.researchgate.net › publication › 35...

The TTS4ASR line of work [23, 24, 25] uses an approach similar to dual learning, in which a TTS model is used to provide supervision for unpaired audio before ...

Injecting Text in Self-Supervised Speech Pre-Training (TTS4PreTrain)

www.youtube.com › watch

Aug 31, 2021 · In this tutorial I explain the paper "Injecting Text in Self-Supervised Speech Pre-Training" by Zhehuai Chen, Yu Zhang, Andrew Rosenberg, ...

210827 Injecting Text in Self-Supervised Speech Pretraining ...

github.com › blob › main › papers › 210...

Apr 28, 2022 · transcription이 없는 speech를 사용한 프리트레이닝에 tts를 사용해서 speech가 없는 텍스트에 대한 프리트레이닝을 결합. asr도 self supervision이 ...

[PDF] arXiv:2312.09100v1 [eess.AS] 14 Dec 2023

arxiv.org › pdf

Dec 14, 2023 · 5 draws conclusions. 2. RELATED WORK. Injecting text into the self-supervised speech pre-training task has been widely studied, including ...

Andrew Rosenberg | Papers With Code

paperswithcode.com › author › andrew-r...

Self-supervised learning from speech signals aims to learn the latent structure inherent in the signal, while self-supervised learning from text attempts to ...

[PDF] Unified Speech-Text Pre-training for Speech Translation and Recognition

aclanthology.org › 2022.acl-long.1...

May 22, 2022 · Abstract. We describe a method to jointly pre-train speech and text in an encoder-decoder mod- eling framework for speech translation and.