Abstract
The best performing systems in the area of automatic speaker recognition have focused on using short-term, low-level acoustic information, such as cepstral features. Recently, various works have demonstrated that high-level features convey more speaker information and can be added to the low-level features in order to increase the robustness of the system. This paper describes a text-independent speaker recognition system exploiting high-level information provided by ALISP (Automatic Language Independent Speech Processing), a data-driven segmentation. This system, denoted here as ALISP n-gram system, captures the speaker specific information only by analyzing sequences of ALISP units. The ALISP n-gram system was fused with an acoustic ALISP-based Gaussian Mixture Models (GMM) system exploiting the speaker discriminating properties of individual speech classes. The resulting fused system reduced the error rate over the individual systems on the NIST 2004 Speaker Recognition Evaluation data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Reynolds, D., Andrews, W., Campbell, J., Navratil, J., Peskin, B., Adami, A., Jin, Q., Klusacek, D., Abramson, J., Mihaescu, R., Godfrey, J., Jones, J., Xiang, B.: The supersid project: Exploiting high-level information for high-accuracy speaker recognition. In: Proc. ICASSP (2003)
Doddington, G.: Speaker recognition based on idiolectal differences between speakers. Eurospeech 4, 2517–2520 (2001)
Andrews, W., Kohler, M., Campbell, J., Godfrey, J.: Phonetic, idiolectal, and acoustic speaker recognition. In: Speaker Odyssey Workshop (2001)
Chollet, G., Černocký, J., Constantinescu, A., Deligne, S., Bimbot, F.: Towards ALISP: a proposal for Automatic Language Independent Speech Processing. In: Ponting, K. (ed.) NATO ASI: Computational models of speech pattern processing. Springer, Heidelberg (1999)
El-Hannani, A., Petrovska-Delacrétaz, D.: Improving speaker verification system using alisp-based specific GMMs. Submitted to AVBPA (2005)
Haykin, S.: Neural Networks: A Comprehensive Foundation. IEEE Computer Society Press, Los Alamitos (1994)
Kittler, J., Hatef, M., Duin, R., Matas, J.: On combining classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence 20, 226–239 (1998)
El-Hannani, A., Petrovska-Delacrétaz, D., Chollet, G.: Linear and non-linear fusion of alisp-based and GMM systems for text-independent speaker verification. In proc. of ODYSSEY 2004, The Speaker and Language Recognition Workshop (2004)
Magrin-Chagnolleau, I., Gravier, G., Blouet, R.: Overview of the 2000-2001 elisa consortium research activities. In: Speaker Odyssey Workshop (2001)
Blouet, R., Mokbel, C., Mokbel, H., Sanchez, E., Chollet, G., Greige, H.: Becars: A free software for speaker verification. In: Proc. Odyssey (2004)
Martin, A., Doddington, G., Kamm, T., Ordowski, M., Przybocki, M.: The det curve in assessment of detection task performance. In: Proc. Eurospeech 1997, vol. 4, pp. 1895–1898 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
El Hannani, A., Petrovska-Delacrétaz, D. (2006). Exploiting High-Level Information Provided by ALISP in Speaker Recognition. In: Faundez-Zanuy, M., Janer, L., Esposito, A., Satue-Villar, A., Roure, J., Espinosa-Duro, V. (eds) Nonlinear Analyses and Algorithms for Speech Processing. NOLISP 2005. Lecture Notes in Computer Science(), vol 3817. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11613107_4
Download citation
DOI: https://doi.org/10.1007/11613107_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31257-4
Online ISBN: 978-3-540-32586-4
eBook Packages: Computer ScienceComputer Science (R0)