Optimizing Alignment of Speech and Language Latent Spaces for End-To-End Speech Recognition and Understanding.

AllBooks Images Shopping Maps Videos News

Optimizing Alignment of Speech and Language Latent Spaces for End-to ...

Oct 23, 2021 · The modality switch training randomly swaps speech and text embeddings based on the forced alignment result to learn a joint representation ...

optimizing alignment of speech and language latent spaces for

ieeexplore.ieee.org › iel7

ABSTRACT. The advances in attention-based encoder-decoder (AED) networks have brought great progress to end-to-end (E2E) automatic speech recognition (ASR).

Optimizing Alignment of Speech and Language Latent Spaces for End ...

www.semanticscholar.org › paper › Opti...

This paper proposes an embedding aligner and modality switch training to better align the speech and text latent spaces and proves its effectiveness on ...

Optimizing Alignment of Speech and Language Latent Spaces for End-to ...

www.researchgate.net › ... › Speech

The modality switch training randomly swaps speech and text embeddings based on the forced alignment result to learn a joint representation space. Experimental ...

Optimizing Alignment of Speech and Language Latent Spaces for End ...

www.researchgate.net › publication › 36...

... Our technique decreases Librispeech ASR WER by 14% to 19%. We also tested its influence on spoken language understanding (SLU) and saw a 2.5% to 2.8% F1 ...

[PDF] optimizing alignment of speech and language latent spaces for end-to ...

nlp.csie.ntust.edu.tw › files › meeting

Nov 11, 2021 · OPTIMIZING ALIGNMENT OF SPEECH AND LANGUAGE. LATENT SPACES FOR END-TO-END SPEECH. RECOGNITION AND UNDERSTANDING.

Optimizing Alignment of Speech and Language Latent Spaces for End-to ...

slides.hanklu.tw › optimize-speech-and-l...

Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding ; ASR results. Adding text encoder trained with ...

ddlBoJack/Awesome-Speech ... - GitHub

github.com › Awesome-Speech-Pretraining

Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding - W Wang et al, INTERSPEECH 2022; STPT: Unified ...

[PDF] arXiv:2303.10949v1 [eess.AS] 20 Mar 2023

arxiv.org › pdf

Mar 20, 2023 · Qian, and Michael Zeng, “Optimizing alignment of speech and language latent spaces for end-to-end speech recognition and understanding,” in ...

Shujie Liu at Microsoft Research

www.microsoft.com › my-publications

Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding. ICASSP 2022. Zhengyang Chen, Sanyuan Chen, Yu ...