Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Investigating Language Variability on the Performance of Speaker Verification Systems

  • Conference paper
  • First Online:
Speech and Computer (SPECOM 2018)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11096))

Included in the following conference series:

  • 1500 Accesses

Abstract

In recent years, speaker verification technologies have received an extensive amount of attention. Designing and developing machines that could communicate with humans are believed to be one of the primary motivations behind such developments. Speaker verification technologies are applied to numerous fields such as security, Biometrics, and forensics.

In this paper, the authors study the effects of different languages on the performance of the automatic speaker verification (ASV) system. The MirasVoice speech corpus (MVSC), a bilingual English and Farsi speech corpus, is used in this study. This study collects results from both an I-vector based ASV system and a GMM-UBM based ASV system. The experimental results show that a mismatch between the enrolled data used for training and verification data can lead to a significant decrease in the overall system efficiency. This study shows that it is best to use an i-vector based framework with data from the English language used in the enrollment phase to improve the robustness of the ASV systems. The achieved results in this study indicate that this can narrow the degradation gap caused by the language mismatch.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://github.com/miras-tech/MirasVoice.

References

  1. Bhattacharjee, U., Sarmah, K.: GMM-UBM based speaker verification in multilingual environments. IJCSI Int. J. Comput. Sci. Issues 9(6), 2 (2012)

    Google Scholar 

  2. Bhattacharjee, U., Kshirod S.: A multilingual speech database for speaker recognition. In: 2012 IEEE International Conference on Signal Processing, Computing and Control (ISPCC), pp. 1–5. IEEE (2012)

    Google Scholar 

  3. Pandey, B., Ranjan, A., Kumar, R., Shukla, A.: Multilingual speaker recognition using ANFIS. In: 2nd International Conference on Signal Processing Systems (ICSPS), vol. 3, pp. V3-714–V3-718 (2010)

    Google Scholar 

  4. Deng, L., Droppo, J., Yu, D., Acero, A.: Learning methods in multilingual speech recognition. In: Speech Research Group Microsoft Research Redmond, WA, 98052

    Google Scholar 

  5. Vaheb, A., Choobbasti, A.J., Mortazavi Najafabadi, S.H.E., Safavi, S.: MirasVoice: a bilingual (English-Farsi) speech corpus. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC), Miyazaki, Japan (2018)

    Google Scholar 

  6. Misra, A., Hansen, J.: Spoken language mismatch in speaker verification: an investigation with NIST-SRE and CRSS Bi-ling Corpora. In: IEEE the Spoken Language Technology Workshop (SLT) (2014)

    Google Scholar 

  7. Dehak, N., Dehak, R., Kenny, P., Brummer, N., Ouellet, P., Dumouchel, P.: Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification. In: Proceedings of Interspeech, pp. 1559–1562 (2009)

    Google Scholar 

  8. Garcia-Romero, D., Carol, Y.E.-W.: Analysis of i-vector length normalization in speaker recognition systems. In: Twelfth Annual Conference of the International Speech Communication Association (2011)

    Google Scholar 

  9. Safavi, S.: Speaker characterization using adult and children’s speech. Ph.D. dissertation, University of Birmingham (2015)

    Google Scholar 

  10. Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Lang. Process. 19(4), 788–798 (2011)

    Article  Google Scholar 

  11. Kleynhans, N.T., Barnard, E.: Language dependence in multilingual speaker verification. In: PRASA (2005)

    Google Scholar 

  12. Safavi, S., Hanani, A., Russell, M., Jancovic, P., Carey, M.: Contrasting the effects of different frequency bands on speaker and accent identification. IEEE Sig. Process. Lett. 19(12), 829–832 (2012)

    Article  Google Scholar 

  13. Kenny, P., Ouellet, Q., Dehak, N., Gupta, V., Dumouchel, P.: A study of interspeaker variability in speaker verification. IEEE Trans. Audio Speech Lang. Process. 16(5), 980–988 (2008)

    Article  Google Scholar 

  14. Safavi, S., Russell, M., Jancovic, P.: Identification of age-group from children’s speech by computers and humans. In: INTERSPEECH, pp. 243–247 (2014)

    Google Scholar 

  15. Auckenthaler, R., Carey, M.J., Mason, J.S.D.: Language dependency in text-independent speaker verification. In: ICASSP 2001, May 2001

    Google Scholar 

  16. Wu, Z., Evans, N., Kinnunen, T., Yamagishi, J., Alegre, F., Lia, H.: Spoofing and countermeasures for speaker verification: a survey. Speech Commun. 66, 130–153 (2015)

    Article  Google Scholar 

  17. Qing, X., Chen, K.: On use of GMM for multilingual speaker verification: an empirical study. In: Proceedings of ISCSLP, pp. 263–266 (2000)

    Google Scholar 

  18. Harald, H., Draxler, C., van den Heuvel, H., Johansen, F.T., Sanders, E.P., Tropf, H.S.: Speechdat multilingual speech databases for teleservices: across the finish line (1999)

    Google Scholar 

  19. Stuker, S., Schultz, T., Metze, F., Waibel, A.: Multilingual articulatory features: acoustics, speech, and signal processing. In: Proceedings of IEEE International Conference (ICASSP 2003), no. 7 (2003)

    Google Scholar 

  20. Safavi, S., Russell, M., Jancovic, P.: Automatic speaker, age-group and gender identification from children’s speech. Comput. Speech Lang. 50, 141–156 (2018)

    Article  Google Scholar 

  21. Campell Jr., J.P.: Speaker recognition: a tutorial. Proc. IEEE 85, 1437–1462 (1997)

    Article  Google Scholar 

  22. Safavi, S., Meng, L.: Comparison of two scoring method within i-vector framework for speaker recognition from children’s speech. In: ICMI Workshop on Child Computer Interaction (WOCCI), Glasgow, Scotland, November 2017

    Google Scholar 

Download references

Acknowledgements

The authors of this study would like to thank all the volunteers and especially the people at Miras Technologies international Miras Technologies International, http://miras-tech.com/ for all their efforts to help gather the MirasVoice speech corpus [5]. This work was partially supported by the H2020 Project entitled “AudioCommons” funded by the European Commission with Grand Agreement number 688382.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Amir Vaheb .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Vaheb, A., Choobbasti, A.J., Najafabadi, S.H.E.M., Safavi, S. (2018). Investigating Language Variability on the Performance of Speaker Verification Systems. In: Karpov, A., Jokisch, O., Potapova, R. (eds) Speech and Computer. SPECOM 2018. Lecture Notes in Computer Science(), vol 11096. Springer, Cham. https://doi.org/10.1007/978-3-319-99579-3_73

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-99579-3_73

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-99578-6

  • Online ISBN: 978-3-319-99579-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics