Regeneration of Speech in Voice-Loss Patients

Sharifzadeh, H. R.; McLoughlin, I. V.; Ahmadi, F.

doi:10.1007/978-3-540-92841-6_262

H. R. Sharifzadeh³,
I. V. McLoughlin³ &
F. Ahmadi³

Part of the book series: IFMBE Proceedings ((IFMBE,volume 23))

189 Accesses

Abstract

This paper considers regeneration of natural sounding speech from whisper-speech, produced by patients with vocal tract lesions affecting the glottis. Such reconstruction is important for both total and partial laryngectomy patients to improve on the monotonous robotized sound typical of electrolarynx devices.

Reconstruction of speech from whispers has been demonstrated previously, however the resulting speech does not exhibit particularly high intelligibility, and more importantly, sounds un-natural. It is the conjecture of the authors that limited pitch variations in the reconstructed speech contributes most to that lack of naturalness.

In this paper, a method for pitch contour variation in reconstructed speech is presented. This method extracts voice factors which are important to ‘naturalness’ from the whispered signal and applies these to the reconstructed speech. The method is based upon our previous published work which implemented an analysis-by-synthesis approach to voice reconstruction using a modified CELP codec.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Enhancing Voice Quality in Vocal Tract Rehabilitation Device

Speech Driven by Artificial Larynx: Potential Advancement Using Synthetic Pitch Contours

Voice Synthesizer for Partially Paralyzed Patients

References

Vary P, Martin R (2006) Digital speech transmission, John Wiley & Sons Ltd, West Sussex
Book Google Scholar
Pietruch R, Michalska M, Konopka W, Grzanka A (2006) Methods for formant extraction in speech of patients after total laryngectomy, Biomedical Signal Processing and Control, Vol. 1, pp. 107–112
Article Google Scholar
Plack C J, Oxenham A J (2005) Pitch: neural coding and perception, Springer Handbook of Auditory Research, New York
Google Scholar
Morris R W, Clements M A (2002) Reconstruction of speech from whispers, Medical Engineering and Physics, vol. 24, pp. 515–520
Article Google Scholar
Ahmadi F, McLoughlin I V, Sharifzadeh H R (2008) Analysis-bysynthesis method for whisper-speech reconstruction, IEEE Asia Pacific Conference on Circuits and Systems (APCCAS 2008), China
Google Scholar
Atal B S (1982) Predictive coding of speech at low bit rates, IEEE Transaction on Communications, pp. 600–614
Google Scholar
Weitzman R S, Sawashima M, Hirose H (1976) Devoiced and whispered vowels in Japanese, Annual Bulletin, Research Institute of Logopedics and Phoniatrics, vol. 10, pp. 61–79
Google Scholar
Solomon N P, McCall G N, Trosset M W et al. (1989) Laryngeal configuration and constriction during two types of whispering, Journal of Speech and Hearing Research, vol. 32, pp 161–174
Article Google Scholar
Esling J H (1984) Laryngographic study of phonation type and laryngeal configuration, Journal of the International Phonetic Association, vol. 14, pp. 56–73
Article Google Scholar
Tartter V C (1989) What’s in whisper?, Journal of Acoustical Society of America, vol. 86, pp. 1678–1683
Article Google Scholar
Fant G (1960) Acoustic theory of speech production, Mouton & Co, The Hague
Google Scholar
Thomas I B (1969) Perceived pitch of whispered vowels, Journal of the Acoustical Society of America, vol. 46, pp. 468–470
Article Google Scholar
Catford J C (1977) Fundamental problems in phonetics, Edinburgh University Press, Edinburgh
Google Scholar
Stevens H E (2003) The representation of normally-voiced and whispered speech sounds in the temporal aspects of auditory nerve responses, PhD Thesis, University of Illinois
Google Scholar
Lehiste I (1970) Suprasegmentals, MIT Press, Cambridge
Google Scholar
Kallail K J, Emanuel F W (1985) The identifiability of isolated whispered and phonated vowel samples, Journal of Phonetics, vol. 13, pp. 11–17
Article Google Scholar
Klatt D H, Klatt L C (1990) Analysis, synthesis, and perception of voice quality, variations among male and female talkers, Journal of Acoustical Society of America, vol. 87, pp. 820–857
Article Google Scholar
Stevens K N (1998) Acoustic phonetics, The MIT Press, Cambridge
Google Scholar
Goalic A, Saoudi S (1995) An intrinsically reliable and fast algorithm to compute the line spectrum pairs in low bit-rate CELP coding, In proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 728–731
Google Scholar
McLoughlin I V (2007) Line spectral pairs, Signal Processing Journal, pp. 448–467
Google Scholar
McLoughlin I V, Chance R J (1997) LSP-based speech modification for intelligibility enhancement, In proceedings of 13th International Conference on DSP, vol. 2, pp. 591–594
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Engineering, Nanyang Technological University, Singapore
H. R. Sharifzadeh, I. V. McLoughlin & F. Ahmadi

Authors

H. R. Sharifzadeh
View author publications
You can also search for this author in PubMed Google Scholar
I. V. McLoughlin
View author publications
You can also search for this author in PubMed Google Scholar
F. Ahmadi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Division of Bioengineering & Department of Mechanical Engineering Faculty of Engineering, National University of Singapore, 7 Engineering Drive 1 Block E3A #04-15, Singapore, 117574
Chwee Teck Lim
Department of Orthopaedic Surgery, YLL School of Medicine & Division of Bioengineering, Faculty of Engineering & NUS Tissue Engineering Program, Life Sciences Institute, Level 4, DSO (Kent Ridge) Building 27 Medical Drive, Singapore, 117510
James C. H. Goh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sharifzadeh, H.R., McLoughlin, I.V., Ahmadi, F. (2009). Regeneration of Speech in Voice-Loss Patients. In: Lim, C.T., Goh, J.C.H. (eds) 13th International Conference on Biomedical Engineering. IFMBE Proceedings, vol 23. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92841-6_262

Download citation

DOI: https://doi.org/10.1007/978-3-540-92841-6_262
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-92840-9
Online ISBN: 978-3-540-92841-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Regeneration of Speech in Voice-Loss Patients

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Enhancing Voice Quality in Vocal Tract Rehabilitation Device

Speech Driven by Artificial Larynx: Potential Advancement Using Synthetic Pitch Contours

Voice Synthesizer for Partially Paralyzed Patients

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Regeneration of Speech in Voice-Loss Patients

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Enhancing Voice Quality in Vocal Tract Rehabilitation Device

Speech Driven by Artificial Larynx: Potential Advancement Using Synthetic Pitch Contours

Voice Synthesizer for Partially Paralyzed Patients

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation