Abstract
A tone codebook mapping method is proposed to obtain a better performance in voice conversion of Mandarin speech than the conventional conversion method which deals mainly with short-time spectral envelopes. The pitch contour of the whole Mandarin syllable is used as a unit type for pitch conversion. The syllable pitch contours are first extracted from the source and target utterances. Time normalization and moving average filtering are then performed on them. These preprocessed pitch contours are classified to generate the source and target tone codebooks, and by associating them, a Mandarin tone mapping codebook is finally obtained in terms of speech alignment. Experiment results show that the proposed method for voice conversion can deliver a satisfactory performance in Mandarin speech.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Moulines, E., Sagisaka, Y.: Voice conversion: state of the art and perspectives. Special Issue of Speech Communication 16(2), 125–126 (1995)
Abe, M., Nakamura, S., Shikano, K., Kuwabara, H.: Voice Conversion through Vector Quantization. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, NY, USA, pp. 655–658 (1988)
Stylianou, Y., Cappe, O., Moulines, E.: Continuous Probabilistic Transform for Voice Conversion. IEEE Transaction on Speech and Audio Processing 6(2), 131–142 (1998)
Türk, O.: New Methods for Voice Conversion (MS thesis). Boğaziçi University, Turkey (2003)
Zhou, T.: Modern Chinese Phonetics. Beijing Normal University Press, Beijing (1990)
Chu, M.: Research on Chinese TTS system with high intelligibility and naturalness (Doctoral thesis). Institute of Acoustic, Chinese Academy of Sciences, Beijing (1995)
Zhu, T., Gao, W.: Data Mining for Learning Mandarin Prosodic Models. Chinese Journal of Computer 23(11), 1179–1183 (2000)
Kain, A., Macon, M.: Spectral Voice Conversion for Text-to-Speech Synthesis. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, Seattle, USA, May 1998, pp. 285–288 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zuo, G., Chen, Y., Ruan, X., Liu, W. (2006). Mandarin Voice Conversion Using Tone Codebook Mapping. In: Yeung, D.S., Liu, ZQ., Wang, XZ., Yan, H. (eds) Advances in Machine Learning and Cybernetics. Lecture Notes in Computer Science(), vol 3930. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11739685_101
Download citation
DOI: https://doi.org/10.1007/11739685_101
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33584-9
Online ISBN: 978-3-540-33585-6
eBook Packages: Computer ScienceComputer Science (R0)