Abstract
In this paper, we propose a novel method for rapid speaker adaptation based on speaker selection, called reference support speaker selection (RSSS). The speakers, who are acoustically close to the test speaker, are selected from reference speakers using our proposed algorithm. Furthermore, a single-pass re-estimation procedure, conditioned on the selected speakers is shown. The proposed method can quickly obtain a more optimal reference speaker subset because the selection is dynamically determined according to reference support vectors. This adaptation strategy was evaluated in a large vocabulary speech recognition task. From the experiments, we confirm the effectiveness of proposed method.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Gauvain, J.L., Lee, C.H.: Maximum a posterior estimation for multivariate Gaussian observations of Markov chains. IEEE Trans. Speech Audio Processing 2, 291–298 (1994)
Legetter, C.J., Woodland, P.C.: Maximum likelihood linear regression for speaker adaptation of continuous density HMM’s in Compute. Speech Lang. 9, 171–186 (1996)
Sankar, A., Beaufays, F., Digalakis, V.: Training data clustering for improved speech recognition. In: Proc. Eurospeech, pp. 502–505.4 (1995)
Padmanabhan, M., Bahl, L.R., Nahamoo, D., Picheny, M.A.: Speaker clustering and transformation for speaker adaptation in speech recognition systems. IEEE Trans. Speech Audio Processing, January 1998, 71–77 (1998)
Scholkopf, B., Burges, C., Vapnik, V.: Incorporating invariances in support vector learing machines. In: Vorbrüggen, J.C., von Seelen, W., Sendhoff, B. (eds.) ICANN 1996. LNCS, vol. 1112, pp. 47–52. Springer, Heidelberg (1996)
Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1998)
Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Knowledge Discovery Data Mining 2(2), 121–167 (1998)
Gunn, S.R.: Support vector machines for classification and regression. Technical Report Image Speech and Intelligent Systems Research Group, University of Southampton (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, J., Yang, Z., Lei, J., Guo, J. (2006). A Novel Method for Rapid Speaker Adaptation Using Reference Support Speaker Selection. In: Matsumoto, Y., Sproat, R.W., Wong, KF., Zhang, M. (eds) Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead. ICCPOL 2006. Lecture Notes in Computer Science(), vol 4285. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11940098_47
Download citation
DOI: https://doi.org/10.1007/11940098_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49667-0
Online ISBN: 978-3-540-49668-7
eBook Packages: Computer ScienceComputer Science (R0)