Abstract
This paper considers regeneration of natural sounding speech from whisper-speech, produced by patients with vocal tract lesions affecting the glottis. Such reconstruction is important for both total and partial laryngectomy patients to improve on the monotonous robotized sound typical of electrolarynx devices.
Reconstruction of speech from whispers has been demonstrated previously, however the resulting speech does not exhibit particularly high intelligibility, and more importantly, sounds un-natural. It is the conjecture of the authors that limited pitch variations in the reconstructed speech contributes most to that lack of naturalness.
In this paper, a method for pitch contour variation in reconstructed speech is presented. This method extracts voice factors which are important to ‘naturalness’ from the whispered signal and applies these to the reconstructed speech. The method is based upon our previous published work which implemented an analysis-by-synthesis approach to voice reconstruction using a modified CELP codec.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Vary P, Martin R (2006) Digital speech transmission, John Wiley & Sons Ltd, West Sussex
Pietruch R, Michalska M, Konopka W, Grzanka A (2006) Methods for formant extraction in speech of patients after total laryngectomy, Biomedical Signal Processing and Control, Vol. 1, pp. 107–112
Plack C J, Oxenham A J (2005) Pitch: neural coding and perception, Springer Handbook of Auditory Research, New York
Morris R W, Clements M A (2002) Reconstruction of speech from whispers, Medical Engineering and Physics, vol. 24, pp. 515–520
Ahmadi F, McLoughlin I V, Sharifzadeh H R (2008) Analysis-bysynthesis method for whisper-speech reconstruction, IEEE Asia Pacific Conference on Circuits and Systems (APCCAS 2008), China
Atal B S (1982) Predictive coding of speech at low bit rates, IEEE Transaction on Communications, pp. 600–614
Weitzman R S, Sawashima M, Hirose H (1976) Devoiced and whispered vowels in Japanese, Annual Bulletin, Research Institute of Logopedics and Phoniatrics, vol. 10, pp. 61–79
Solomon N P, McCall G N, Trosset M W et al. (1989) Laryngeal configuration and constriction during two types of whispering, Journal of Speech and Hearing Research, vol. 32, pp 161–174
Esling J H (1984) Laryngographic study of phonation type and laryngeal configuration, Journal of the International Phonetic Association, vol. 14, pp. 56–73
Tartter V C (1989) What’s in whisper?, Journal of Acoustical Society of America, vol. 86, pp. 1678–1683
Fant G (1960) Acoustic theory of speech production, Mouton & Co, The Hague
Thomas I B (1969) Perceived pitch of whispered vowels, Journal of the Acoustical Society of America, vol. 46, pp. 468–470
Catford J C (1977) Fundamental problems in phonetics, Edinburgh University Press, Edinburgh
Stevens H E (2003) The representation of normally-voiced and whispered speech sounds in the temporal aspects of auditory nerve responses, PhD Thesis, University of Illinois
Lehiste I (1970) Suprasegmentals, MIT Press, Cambridge
Kallail K J, Emanuel F W (1985) The identifiability of isolated whispered and phonated vowel samples, Journal of Phonetics, vol. 13, pp. 11–17
Klatt D H, Klatt L C (1990) Analysis, synthesis, and perception of voice quality, variations among male and female talkers, Journal of Acoustical Society of America, vol. 87, pp. 820–857
Stevens K N (1998) Acoustic phonetics, The MIT Press, Cambridge
Goalic A, Saoudi S (1995) An intrinsically reliable and fast algorithm to compute the line spectrum pairs in low bit-rate CELP coding, In proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 728–731
McLoughlin I V (2007) Line spectral pairs, Signal Processing Journal, pp. 448–467
McLoughlin I V, Chance R J (1997) LSP-based speech modification for intelligibility enhancement, In proceedings of 13th International Conference on DSP, vol. 2, pp. 591–594
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 International Federation of Medical and Biological Engineering
About this paper
Cite this paper
Sharifzadeh, H.R., McLoughlin, I.V., Ahmadi, F. (2009). Regeneration of Speech in Voice-Loss Patients. In: Lim, C.T., Goh, J.C.H. (eds) 13th International Conference on Biomedical Engineering. IFMBE Proceedings, vol 23. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92841-6_262
Download citation
DOI: https://doi.org/10.1007/978-3-540-92841-6_262
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-92840-9
Online ISBN: 978-3-540-92841-6
eBook Packages: EngineeringEngineering (R0)