Abstract
In recent years, speaker verification technologies have received an extensive amount of attention. Designing and developing machines that could communicate with humans are believed to be one of the primary motivations behind such developments. Speaker verification technologies are applied to numerous fields such as security, Biometrics, and forensics.
In this paper, the authors study the effects of different languages on the performance of the automatic speaker verification (ASV) system. The MirasVoice speech corpus (MVSC), a bilingual English and Farsi speech corpus, is used in this study. This study collects results from both an I-vector based ASV system and a GMM-UBM based ASV system. The experimental results show that a mismatch between the enrolled data used for training and verification data can lead to a significant decrease in the overall system efficiency. This study shows that it is best to use an i-vector based framework with data from the English language used in the enrollment phase to improve the robustness of the ASV systems. The achieved results in this study indicate that this can narrow the degradation gap caused by the language mismatch.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bhattacharjee, U., Sarmah, K.: GMM-UBM based speaker verification in multilingual environments. IJCSI Int. J. Comput. Sci. Issues 9(6), 2 (2012)
Bhattacharjee, U., Kshirod S.: A multilingual speech database for speaker recognition. In: 2012 IEEE International Conference on Signal Processing, Computing and Control (ISPCC), pp. 1–5. IEEE (2012)
Pandey, B., Ranjan, A., Kumar, R., Shukla, A.: Multilingual speaker recognition using ANFIS. In: 2nd International Conference on Signal Processing Systems (ICSPS), vol. 3, pp. V3-714–V3-718 (2010)
Deng, L., Droppo, J., Yu, D., Acero, A.: Learning methods in multilingual speech recognition. In: Speech Research Group Microsoft Research Redmond, WA, 98052
Vaheb, A., Choobbasti, A.J., Mortazavi Najafabadi, S.H.E., Safavi, S.: MirasVoice: a bilingual (English-Farsi) speech corpus. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC), Miyazaki, Japan (2018)
Misra, A., Hansen, J.: Spoken language mismatch in speaker verification: an investigation with NIST-SRE and CRSS Bi-ling Corpora. In: IEEE the Spoken Language Technology Workshop (SLT) (2014)
Dehak, N., Dehak, R., Kenny, P., Brummer, N., Ouellet, P., Dumouchel, P.: Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification. In: Proceedings of Interspeech, pp. 1559–1562 (2009)
Garcia-Romero, D., Carol, Y.E.-W.: Analysis of i-vector length normalization in speaker recognition systems. In: Twelfth Annual Conference of the International Speech Communication Association (2011)
Safavi, S.: Speaker characterization using adult and children’s speech. Ph.D. dissertation, University of Birmingham (2015)
Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Lang. Process. 19(4), 788–798 (2011)
Kleynhans, N.T., Barnard, E.: Language dependence in multilingual speaker verification. In: PRASA (2005)
Safavi, S., Hanani, A., Russell, M., Jancovic, P., Carey, M.: Contrasting the effects of different frequency bands on speaker and accent identification. IEEE Sig. Process. Lett. 19(12), 829–832 (2012)
Kenny, P., Ouellet, Q., Dehak, N., Gupta, V., Dumouchel, P.: A study of interspeaker variability in speaker verification. IEEE Trans. Audio Speech Lang. Process. 16(5), 980–988 (2008)
Safavi, S., Russell, M., Jancovic, P.: Identification of age-group from children’s speech by computers and humans. In: INTERSPEECH, pp. 243–247 (2014)
Auckenthaler, R., Carey, M.J., Mason, J.S.D.: Language dependency in text-independent speaker verification. In: ICASSP 2001, May 2001
Wu, Z., Evans, N., Kinnunen, T., Yamagishi, J., Alegre, F., Lia, H.: Spoofing and countermeasures for speaker verification: a survey. Speech Commun. 66, 130–153 (2015)
Qing, X., Chen, K.: On use of GMM for multilingual speaker verification: an empirical study. In: Proceedings of ISCSLP, pp. 263–266 (2000)
Harald, H., Draxler, C., van den Heuvel, H., Johansen, F.T., Sanders, E.P., Tropf, H.S.: Speechdat multilingual speech databases for teleservices: across the finish line (1999)
Stuker, S., Schultz, T., Metze, F., Waibel, A.: Multilingual articulatory features: acoustics, speech, and signal processing. In: Proceedings of IEEE International Conference (ICASSP 2003), no. 7 (2003)
Safavi, S., Russell, M., Jancovic, P.: Automatic speaker, age-group and gender identification from children’s speech. Comput. Speech Lang. 50, 141–156 (2018)
Campell Jr., J.P.: Speaker recognition: a tutorial. Proc. IEEE 85, 1437–1462 (1997)
Safavi, S., Meng, L.: Comparison of two scoring method within i-vector framework for speaker recognition from children’s speech. In: ICMI Workshop on Child Computer Interaction (WOCCI), Glasgow, Scotland, November 2017
Acknowledgements
The authors of this study would like to thank all the volunteers and especially the people at Miras Technologies international Miras Technologies International, http://miras-tech.com/ for all their efforts to help gather the MirasVoice speech corpus [5]. This work was partially supported by the H2020 Project entitled “AudioCommons” funded by the European Commission with Grand Agreement number 688382.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Vaheb, A., Choobbasti, A.J., Najafabadi, S.H.E.M., Safavi, S. (2018). Investigating Language Variability on the Performance of Speaker Verification Systems. In: Karpov, A., Jokisch, O., Potapova, R. (eds) Speech and Computer. SPECOM 2018. Lecture Notes in Computer Science(), vol 11096. Springer, Cham. https://doi.org/10.1007/978-3-319-99579-3_73
Download citation
DOI: https://doi.org/10.1007/978-3-319-99579-3_73
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99578-6
Online ISBN: 978-3-319-99579-3
eBook Packages: Computer ScienceComputer Science (R0)