A Novel Method for Rapid Speaker Adaptation Using Reference Support Speaker Selection

Wang, Jian; Yang, Zhen; Lei, Jianjun; Guo, Jun

doi:10.1007/11940098_47

Jian Wang²²,
Zhen Yang²²,
Jianjun Lei²² &
…
Jun Guo²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4285))

Included in the following conference series:

International Conference on Computer Processing of Oriental Languages

1031 Accesses

Abstract

In this paper, we propose a novel method for rapid speaker adaptation based on speaker selection, called reference support speaker selection (RSSS). The speakers, who are acoustically close to the test speaker, are selected from reference speakers using our proposed algorithm. Furthermore, a single-pass re-estimation procedure, conditioned on the selected speakers is shown. The proposed method can quickly obtain a more optimal reference speaker subset because the selection is dynamically determined according to reference support vectors. This adaptation strategy was evaluated in a large vocabulary speech recognition task. From the experiments, we confirm the effectiveness of proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Approaches for Out-of-Domain Adaptation to Improve Speaker Recognition Performance

Automatic Speaker Recognition Using Hybrid Parameters Based on Machine Learning Applied on Two Dataset

Investigating Text-Independent Speaker Verification Systems Under Varied Data Conditions

Article 18 January 2019

References

Gauvain, J.L., Lee, C.H.: Maximum a posterior estimation for multivariate Gaussian observations of Markov chains. IEEE Trans. Speech Audio Processing 2, 291–298 (1994)
Article Google Scholar
Legetter, C.J., Woodland, P.C.: Maximum likelihood linear regression for speaker adaptation of continuous density HMM’s in Compute. Speech Lang. 9, 171–186 (1996)
Article Google Scholar
Sankar, A., Beaufays, F., Digalakis, V.: Training data clustering for improved speech recognition. In: Proc. Eurospeech, pp. 502–505.4 (1995)
Google Scholar
Padmanabhan, M., Bahl, L.R., Nahamoo, D., Picheny, M.A.: Speaker clustering and transformation for speaker adaptation in speech recognition systems. IEEE Trans. Speech Audio Processing, January 1998, 71–77 (1998)
Google Scholar
Scholkopf, B., Burges, C., Vapnik, V.: Incorporating invariances in support vector learing machines. In: Vorbrüggen, J.C., von Seelen, W., Sendhoff, B. (eds.) ICANN 1996. LNCS, vol. 1112, pp. 47–52. Springer, Heidelberg (1996)
Google Scholar
Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1998)
MATH Google Scholar
Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Knowledge Discovery Data Mining 2(2), 121–167 (1998)
Article Google Scholar
Gunn, S.R.: Support vector machines for classification and regression. Technical Report Image Speech and Intelligent Systems Research Group, University of Southampton (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Engineering, Beijing University of Posts and Telecommunications, 100876, Beijing, China
Jian Wang, Zhen Yang, Jianjun Lei & Jun Guo

Authors

Jian Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhen Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jianjun Lei
View author publications
You can also search for this author in PubMed Google Scholar
Jun Guo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Graduate School of Information Science, Nara Institute of Science and Technology, 630-0192, Takayama, Ikoma, Nara, Japan
Yuji Matsumoto
Dept of ECE, University of Illinois at Urbana Champaign, IL 61801, Urbana, USA
Richard W. Sproat
Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong
Kam-Fai Wong
State Key Lab of Intelligent Tech. & Sys., Tsinghua University,
Min Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, J., Yang, Z., Lei, J., Guo, J. (2006). A Novel Method for Rapid Speaker Adaptation Using Reference Support Speaker Selection. In: Matsumoto, Y., Sproat, R.W., Wong, KF., Zhang, M. (eds) Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead. ICCPOL 2006. Lecture Notes in Computer Science(), vol 4285. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11940098_47

Download citation

DOI: https://doi.org/10.1007/11940098_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49667-0
Online ISBN: 978-3-540-49668-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Novel Method for Rapid Speaker Adaptation Using Reference Support Speaker Selection

Abstract

Access this chapter

Preview

Similar content being viewed by others

Approaches for Out-of-Domain Adaptation to Improve Speaker Recognition Performance

Automatic Speaker Recognition Using Hybrid Parameters Based on Machine Learning Applied on Two Dataset

Investigating Text-Independent Speaker Verification Systems Under Varied Data Conditions

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Novel Method for Rapid Speaker Adaptation Using Reference Support Speaker Selection

Abstract

Access this chapter

Preview

Similar content being viewed by others

Approaches for Out-of-Domain Adaptation to Improve Speaker Recognition Performance

Automatic Speaker Recognition Using Hybrid Parameters Based on Machine Learning Applied on Two Dataset

Investigating Text-Independent Speaker Verification Systems Under Varied Data Conditions

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation