Abstract
In this paper, we are introducing the speech database for analyzing the emotions present in speech signals. The proposed database is recorded in Telugu language using the professional artists from All India Radio (AIR), Vijayawada, India. The speech corpus is collected by simulating eight different emotions using the neutral (emotion free) statements. The database is named as Indian Institute of Technology Kharagpur Simulated Emotion Speech Corpus (IITKGP-SESC). The proposed database will be useful for characterizing the emotions present in speech. Further, the emotion specific knowledge present in speech at different levels can be acquired by developing the emotion specific models using the features from vocal tract system, excitation source and prosody. This paper describes the design, acquisition, post processing and evaluation of the proposed speech database (IITKGP-SESC). The quality of the emotions present in the database is evaluated using subjective listening tests. Finally, statistical models are developed using prosodic features, and the discrimination of the emotions is carried out by performing the classification of emotions using the developed statistical models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Database for Indian languages. Speech and Vision lab, Indian Institute of Technology Madras, India (2001)
Ramamohan, S., Dandapat, S.: Sinusoidal model-based analysis and classification of stressed speech. IEEE Trans. Speech and Audio Processing 14, 737–746 (2006)
Sagar, T.V.: Characterisation and synthesis of emotionsin speech using prosodic features. Master’s thesis, Dept. of Electronics and communications Engineering, Indian Institute of Technology Guwahati (May 2007)
Lee, C.M., Narayanan, S.: Toward detecting emotions in spoken dialogs. IEEEAUP 13(2), 293–303 (2005)
Ververidis, D., Kotropoulos, C.: A state of the art review on emotional speech databases. In: Eleventh Australasian International Conference on Speech Science and Technology, Auckland, New Zealand (December 2006)
Yang, L.: The expression and recognition of emotions through prosody. In: Proc. Int. Conf. Spoken Language Processing, pp. 74–77 (2000)
Cowie, R., Cornelius, R.R.: Describing the emotional states that are expressed in speech. Speech Communication 40, 5–32 (2003)
Prasanna, S.R.M., Yegnanarayana, B.: Extraction of pitch in adverse conditions. In: Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Montreal, Canada (May 2004)
Haykin, S.: Neural Networks: A Comprehensive Foundation. Pearson Education Aisa, Inc., New Delhi (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Koolagudi, S.G., Maity, S., Kumar, V.A., Chakrabarti, S., Rao, K.S. (2009). IITKGP-SESC: Speech Database for Emotion Analysis. In: Ranka, S., et al. Contemporary Computing. IC3 2009. Communications in Computer and Information Science, vol 40. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03547-0_46
Download citation
DOI: https://doi.org/10.1007/978-3-642-03547-0_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03546-3
Online ISBN: 978-3-642-03547-0
eBook Packages: Computer ScienceComputer Science (R0)