Exploring the Significance of Low Frequency Regions in Electroglottographic Signals for Emotion Recognition

Ajay, S. G.; Pravena, D.; Govind, D.; Pradeep, D.

doi:10.1007/978-3-319-67934-1_28

S. G. Ajay²⁰,
D. Pravena²⁰,
D. Govind²⁰ &
…
D. Pradeep²⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 678))

Included in the following conference series:

International Symposium on Signal Processing and Intelligent Recognition Systems

1651 Accesses
1 Citations

Abstract

Electroglottographic (EGG) signals are acquired directly from the glottis. Hence EGG signals effectively represent the excitation source part of the human speech production system. Compared to speech signals, EGG signals are smooth and carry perceptually relevant emotional information. The work presented in this paper includes a sequence of experiments conducted on the emotion recognition system developed by the Gaussian Mixture Modeling (GMM) of perceptually motivated Mel Frequency Cepstral Coefficients (MFCC) features extracted from the EGG. The conclusions drawn from these experiments are two folds. (1) The 13 static MFCC features showed improved emotion recognition performance than 39 MFCC features with dynamic coefficients (by adding $\varDelta $ and $\varDelta $ $\varDelta $). (2) Low frequency regions in the EGG are emphasized by increasing the number of Mel filters for MFCC computation found to improve the performance of emotion recognition for EGG. These experimental results are verified on the EGG data available in the classic German emotional speech database (EmoDb) for four emotions such as (Anger, Happy, Boredom and Fear) apart from Neutral signals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Significance of incorporating excitation source parameters for improved emotion recognition from speech and electroglottographic signals

Article 17 August 2017

Automatic Emotion Recognition from Cochlear Implant-Like Spectrally Reduced Speech

Robust Emotion Recognition using Pitch Synchronous and Sub-syllabic Spectral Features

References

Albornoz, E.M., Milone, D.H., Rufiner, H.L.: Spoken emotion recognition using hierarchical classifiers. Comput. Speech Lang. 25, 556–570 (2011)
Article Google Scholar
Ananthapadmanabha, T.V., Yegnanarayana, B.: Epoch extraction from linear prediction residual for identification of closed glottis interval. IEEE Trans. Acoust. Speech Sig. Process. 27(4), 309–319 (1979)
Article Google Scholar
Burkhardt, F., Paeschke, A., Rolfes, M., Sendlemeier, W., Weiss, B.: A database of German emotional speech. In: Proceedings of INTERSPEECH, pp. 1517–1520 (2005)
Google Scholar
Eyben, F., Wöllmer, M., Schuller, B.: Opensmile: the Munich versatile and fast open-source audio feature extractor, pp. 1459–1462 (2010)
Google Scholar
Govind, D., Prasanna, S.R.M.: Expressive speech synthesis: a review. Int. J. Speech Technol. 16(2), 237–260 (2013)
Article Google Scholar
Henrich, N., DAlessandro, C., Doval, B., Castellengo, M.: On the use of the derivative of electroglottographic signals for characterization of nonpathological phonation. J. Acoust. Soc. Am. 115(3), 1321–32 (2004)
Article Google Scholar
Kandali, A.B., Routray, A., Basu, T.K.: Emotion recognition from Assamese speeches using MFCC features and GMM classifier. In: IEEE Region 10 Conference (2008)
Google Scholar
Kitzing, P.: Clinical applications of electroglottography. J. Voice 4(3), 238–249 (1990)
Article Google Scholar
Koolagudi, S.G., Rao, K.S.: Two stage emotion recognition based on speaking rate. Int. J. Speech Technol. 14, 35–48 (2011)
Article Google Scholar
Koolagudi, S.G., Rao, K.S.: Emotion recognition from speech using source, system, and prosodic features. Int. J. Speech Technol. 15, 265–289 (2012)
Article Google Scholar
Neiberg, D., Elenius, K., Laskowski, K.: Emotion recognition in spontaneous speech using GMMS. In: INTERSPEECH (2006)
Google Scholar
Pati, D., Prasanna, S.R.M.: Processing of linear prediction residual in spectral and cepstral domains for speaker information. Int. J. Speech Technol. 18(3), 333–350 (2015)
Article Google Scholar
Prasanna, S.R.M., Govind, D.: Analysis of excitation source information in emotional speech. In: Proceedings INTERSPEECH, pp. 781–784 (2010)
Google Scholar
Pravena, D., Nandhakumar, S., Govind, D.: Significance of natural elicitation in developing simulated full blown speech emotion databases, pp. 261–265 (2016)
Google Scholar
Raviram, P., Umarani, S.D., Wahidabanu, R.S.D.: Isolated word recognition using enhanced MFCC and IIFS. In: Proceedings of the International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA), vol. 199, pp. 273–283. Springer (2013)
Google Scholar
Vondra, M., Vch, R.: Recognition of emotions in German speech using Gaussian mixture models. Multimodal Sig. 5398, 256–263 (2009)
Google Scholar
Young, S.J., Young, S.: The HTK hidden Markov model toolkit: design and philosophy (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

Centre for Computational Engineering and Networking (CEN), Amrita School of Engineering, Amrita Vishwa Vidyapeetham, Coimbatore, 641112, Tamilnadu, India
S. G. Ajay, D. Pravena, D. Govind & D. Pradeep

Authors

S. G. Ajay
View author publications
You can also search for this author in PubMed Google Scholar
D. Pravena
View author publications
You can also search for this author in PubMed Google Scholar
D. Govind
View author publications
You can also search for this author in PubMed Google Scholar
D. Pradeep
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to S. G. Ajay .

Editor information

Editors and Affiliations

School of CS/IT, Indian Institute of Information Technology and Management, Trivandrum, Kerala, India
Sabu M. Thampi
Department of Electrical and Computer Engineering, Ryerson University, Toronto, Ontario, Canada
Sri Krishnan
Department of Computer Science, University of Salamanca, Salamanca, Salamanca, Spain
Juan Manuel Corchado Rodriguez
Electronics and Communication Sciences Unit, Indian Statistical Institute, Kolkata, West Bengal, India
Swagatam Das
Department of Systems and Computer Networks, Wroclaw University of Science and Technology, Wroclaw, Poland
Michal Wozniak
Faculty of Engineering and Technology, Liverpool John Moores University, Liverpool, United Kingdom
Dhiya Al-Jumeily

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ajay, S.G., Pravena, D., Govind, D., Pradeep, D. (2018). Exploring the Significance of Low Frequency Regions in Electroglottographic Signals for Emotion Recognition. In: Thampi, S., Krishnan, S., Corchado Rodriguez, J., Das, S., Wozniak, M., Al-Jumeily, D. (eds) Advances in Signal Processing and Intelligent Recognition Systems. SIRS 2017. Advances in Intelligent Systems and Computing, vol 678. Springer, Cham. https://doi.org/10.1007/978-3-319-67934-1_28

Download citation

DOI: https://doi.org/10.1007/978-3-319-67934-1_28
Published: 27 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67933-4
Online ISBN: 978-3-319-67934-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Exploring the Significance of Low Frequency Regions in Electroglottographic Signals for Emotion Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Significance of incorporating excitation source parameters for improved emotion recognition from speech and electroglottographic signals

Automatic Emotion Recognition from Cochlear Implant-Like Spectrally Reduced Speech

Robust Emotion Recognition using Pitch Synchronous and Sub-syllabic Spectral Features

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Exploring the Significance of Low Frequency Regions in Electroglottographic Signals for Emotion Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Significance of incorporating excitation source parameters for improved emotion recognition from speech and electroglottographic signals

Automatic Emotion Recognition from Cochlear Implant-Like Spectrally Reduced Speech

Robust Emotion Recognition using Pitch Synchronous and Sub-syllabic Spectral Features

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation