Back close

Significance of exploring pitch only features for the recognition of spontaneous emotions from speech signals

Publication Type : Conference Paper

Publisher : 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Campus : Coimbatore

School : School of Engineering

Center : Computational Engineering and Networking

Verified : No

Year : 2017

Abstract : The emotional database can be classified as spontaneous and simulated emotions. Spontaneous emotions can be identified based on the two parameters 1) Arousal and 2) Valence values represented in a two dimensional plane. Arousal measures how calming or exciting the information is, whereas valence measures postive or negative affectivity of information. The objective of the paper is to predict the arousal and valence values from the speech signal which in turn provides the spontaneous emotion information. This paper also provides the significance of using pitch contours obtained from the speech signals having the spontaneous emotion information and is used as features for predicting the arousal and valence values using Deep learning based LSTM models. During testing pitch contours are extracted from the speech signals and used as the features for predicting the arousal and valence values so as to predict the emotions. In this paper a spontaneous database, REmote COLlaborative Affective interaction (RECOLA) database is used. The arousal and valence values predicted in this work has low RMSE. The effectiveness of using pitch to predict 2D values of emotional wheel is also compared on the full blown simulated German emotional database (German EmoDb) and better results were obtained on simulated database also.

Admissions Apply Now