Publication Type:

Conference Paper

Source:

Signal Processing and Communications (SPCOM), 2012 International Conference on (2012)

Keywords:

average pitch period, Databases, EGG recordings, electroglotto graph recordings, emotional speech signals, epoch estimation performance, epoch extraction, Estimation, fixed window length, Frequency estimation, German emotional speech corpus, Hindi emotional speech database, low pass filtering, low-pass filters, Market research, modified zero frequency filtering method, modified ZFF method, Resonant frequency, signal segments, Speech, speech fixed blocks, speech processing, zero frequency resonator, ZFR

Abstract:

This work proposes a modified zero frequency filtering (ZFF) method for epoch extraction from emotional speech. Epochs refers the instants of maximum excitation of the vocal tract. In the conventional ZFF method, the epochs are estimated by trend removing the output of the zero frequency resonator (ZFR) using the window length equal to the average pitch period of the utterance. Use of this fixed window length for the epoch estimation causes spurious or missed estimation from the speech signals having rapid pitch variations like in emotional speech. This work therefore proposes a refined ZFF method for epoch estimation by trend removing the output of ZFR using the variable windows obtained by finding the average pitch periods for every fixed blocks of speech and low pass filtering the resulting trend removed signal segments using the estimated pitch as the cutoff frequency. The epoch estimation performance is evaluated for five different emotions in the German emotional speech corpus having simultaneous electro-glotto graph (EGG) recordings. The improved epoch estimation performance indicates the robustness of the proposed method against rapid pitch variations in emotional speech signals. The effectiveness of the proposed method is also confirmed by the improved epoch estimation performance on the Hindi emotional speech database.

Cite this Research Publication

D. Govind and Prasanna, S. R. M., “Epoch extraction from emotional speech”, in Signal Processing and Communications (SPCOM), 2012 International Conference on, 2012.