Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Regeneration of Speech in Voice-Loss Patients

  • Conference paper
13th International Conference on Biomedical Engineering

Part of the book series: IFMBE Proceedings ((IFMBE,volume 23))

  • 189 Accesses

Abstract

This paper considers regeneration of natural sounding speech from whisper-speech, produced by patients with vocal tract lesions affecting the glottis. Such reconstruction is important for both total and partial laryngectomy patients to improve on the monotonous robotized sound typical of electrolarynx devices.

Reconstruction of speech from whispers has been demonstrated previously, however the resulting speech does not exhibit particularly high intelligibility, and more importantly, sounds un-natural. It is the conjecture of the authors that limited pitch variations in the reconstructed speech contributes most to that lack of naturalness.

In this paper, a method for pitch contour variation in reconstructed speech is presented. This method extracts voice factors which are important to ‘naturalness’ from the whispered signal and applies these to the reconstructed speech. The method is based upon our previous published work which implemented an analysis-by-synthesis approach to voice reconstruction using a modified CELP codec.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Vary P, Martin R (2006) Digital speech transmission, John Wiley & Sons Ltd, West Sussex

    Book  Google Scholar 

  2. Pietruch R, Michalska M, Konopka W, Grzanka A (2006) Methods for formant extraction in speech of patients after total laryngectomy, Biomedical Signal Processing and Control, Vol. 1, pp. 107–112

    Article  Google Scholar 

  3. Plack C J, Oxenham A J (2005) Pitch: neural coding and perception, Springer Handbook of Auditory Research, New York

    Google Scholar 

  4. Morris R W, Clements M A (2002) Reconstruction of speech from whispers, Medical Engineering and Physics, vol. 24, pp. 515–520

    Article  Google Scholar 

  5. Ahmadi F, McLoughlin I V, Sharifzadeh H R (2008) Analysis-bysynthesis method for whisper-speech reconstruction, IEEE Asia Pacific Conference on Circuits and Systems (APCCAS 2008), China

    Google Scholar 

  6. Atal B S (1982) Predictive coding of speech at low bit rates, IEEE Transaction on Communications, pp. 600–614

    Google Scholar 

  7. Weitzman R S, Sawashima M, Hirose H (1976) Devoiced and whispered vowels in Japanese, Annual Bulletin, Research Institute of Logopedics and Phoniatrics, vol. 10, pp. 61–79

    Google Scholar 

  8. Solomon N P, McCall G N, Trosset M W et al. (1989) Laryngeal configuration and constriction during two types of whispering, Journal of Speech and Hearing Research, vol. 32, pp 161–174

    Article  Google Scholar 

  9. Esling J H (1984) Laryngographic study of phonation type and laryngeal configuration, Journal of the International Phonetic Association, vol. 14, pp. 56–73

    Article  Google Scholar 

  10. Tartter V C (1989) What’s in whisper?, Journal of Acoustical Society of America, vol. 86, pp. 1678–1683

    Article  Google Scholar 

  11. Fant G (1960) Acoustic theory of speech production, Mouton & Co, The Hague

    Google Scholar 

  12. Thomas I B (1969) Perceived pitch of whispered vowels, Journal of the Acoustical Society of America, vol. 46, pp. 468–470

    Article  Google Scholar 

  13. Catford J C (1977) Fundamental problems in phonetics, Edinburgh University Press, Edinburgh

    Google Scholar 

  14. Stevens H E (2003) The representation of normally-voiced and whispered speech sounds in the temporal aspects of auditory nerve responses, PhD Thesis, University of Illinois

    Google Scholar 

  15. Lehiste I (1970) Suprasegmentals, MIT Press, Cambridge

    Google Scholar 

  16. Kallail K J, Emanuel F W (1985) The identifiability of isolated whispered and phonated vowel samples, Journal of Phonetics, vol. 13, pp. 11–17

    Article  Google Scholar 

  17. Klatt D H, Klatt L C (1990) Analysis, synthesis, and perception of voice quality, variations among male and female talkers, Journal of Acoustical Society of America, vol. 87, pp. 820–857

    Article  Google Scholar 

  18. Stevens K N (1998) Acoustic phonetics, The MIT Press, Cambridge

    Google Scholar 

  19. Goalic A, Saoudi S (1995) An intrinsically reliable and fast algorithm to compute the line spectrum pairs in low bit-rate CELP coding, In proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 728–731

    Google Scholar 

  20. McLoughlin I V (2007) Line spectral pairs, Signal Processing Journal, pp. 448–467

    Google Scholar 

  21. McLoughlin I V, Chance R J (1997) LSP-based speech modification for intelligibility enhancement, In proceedings of 13th International Conference on DSP, vol. 2, pp. 591–594

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 International Federation of Medical and Biological Engineering

About this paper

Cite this paper

Sharifzadeh, H.R., McLoughlin, I.V., Ahmadi, F. (2009). Regeneration of Speech in Voice-Loss Patients. In: Lim, C.T., Goh, J.C.H. (eds) 13th International Conference on Biomedical Engineering. IFMBE Proceedings, vol 23. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92841-6_262

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-92841-6_262

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-92840-9

  • Online ISBN: 978-3-540-92841-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics