Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-EndSpeech Recognition

Wang, Xiong; Sun, Sining; Xie, Lei; Ma, Long

Computer Science > Sound

arXiv:2106.09236 (cs)

[Submitted on 17 Jun 2021]

Title:Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-EndSpeech Recognition

Authors:Xiong Wang, Sining Sun, Lei Xie, Long Ma

View PDF

Abstract:End-to-end models are favored in automatic speech recognition (ASR) because of their simplified system structure and superior performance. Among these models, Transformer and Conformer have achieved state-of-the-art recognition accuracy in which self-attention plays a vital role in capturing important global information. However, the time and memory complexity of self-attention increases squarely with the length of the sentence. In this paper, a prob-sparse self-attention mechanism is introduced into Conformer to sparse the computing process of self-attention in order to accelerate inference speed and reduce space consumption. Specifically, we adopt a Kullback-Leibler divergence based sparsity measurement for each query to decide whether we compute the attention function on this query. By using the prob-sparse attention mechanism, we achieve impressively 8% to 45% inference speed-up and 15% to 45% memory usage reduction of the self-attention module of Conformer Transducer while maintaining the same level of error rate.

Subjects:	Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2106.09236 [cs.SD]
	(or arXiv:2106.09236v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2106.09236

Submission history

From: Xiong Wang [view email]
[v1] Thu, 17 Jun 2021 04:04:04 UTC (528 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.SD

< prev | next >

new | recent | 2021-06

Change to browse by:

cs
eess
eess.AS

References & Citations

DBLP - CS Bibliography

listing | bibtex

Sining Sun
Lei Xie
Long Ma

export BibTeX citation

Computer Science > Sound

Title:Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-EndSpeech Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-EndSpeech Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators