Abstract
Control access to secret or personal information by using the speaker voice transmitted by long distance communication systems, such as the telephone system, requires accuracy and robustness of the identification or identity verification system, since the speech signal is distorted during the transmission process. Taking in consideration these requirements, a robust text independent speaker identifications system is proposed in which the speaker features are extracted using the Lineal Prediction Cepstral Coefficients (LPCEPSTRAL) and the Gaussian Mixture Models, which provides the features distribution and estimates the optimum model for each speaker, is used for identification. The proposed system, was evaluate using a data-base of 80 different speakers, with a pronoun phrase of 3-5s and digits in Japanese language stored during 4 months. Evaluation results show that proposed system achieves more than 90% of recognition rate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Douglas A. Reynolds, “Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models”, IEEE Transactions on Speech and Audio Processing, Vol. 3,No.1, January 1995.
Richard J. Mammone, Xiaoyu Zhang, Ravi P. Ramachandran, “Robust Speaker Recognition”, IEEE Signal Processing Magazine, September 1996.
Douglas O’ Shaughnessy, “Sppech Communication”
Lawrence Rabiner, Biing-Hwang Juang, “Fundamentals Of Speech Recognition”, Prentice Hall, New Jersey, 1993.
Joseph Picone, “Signal Modeling Techniques In Speech Recognition”, Procceding of the IEEE, Jun 3, 1993.
Steve Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, P. Woodland, “The HTK Book (for HTK Version 3.0) ”, Microsoft Corporation, July, 2000.
E. Simancas, M. Nakano Miyatake and H. Perez.Meana, “Speaker Verification Using Pitch and Melspec Information”, To appear in The Journal of Telecommunications and Radio Engineering, 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Simancas-Acevedo, E., Kurematsu, A., Nakano Miyatake, M., Perez-Meana2, H. (2001). Speaker Recognition Using Gaussian Mixtures Models. In: Mira, J., Prieto, A. (eds) Bio-Inspired Applications of Connectionism. IWANN 2001. Lecture Notes in Computer Science, vol 2085. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45723-2_34
Download citation
DOI: https://doi.org/10.1007/3-540-45723-2_34
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42237-2
Online ISBN: 978-3-540-45723-7
eBook Packages: Springer Book Archive