Publication Type:

Journal Article

Source:

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Springer Verlag, Volume 10458 LNAI, p.777-787 (2017)

ISBN:

9783319664286

URL:

https://www.scopus.com/inward/record.uri?eid=2-s2.0-85029521841&doi=10.1007%2f978-3-319-66429-3_78&partnerID=40&md5=8c4a3584ee3abc6c8e3c320700f3d4ce

Abstract:

The paper deals with speech emotion conversion using Waveform Similarity Overlap Add (WSOLA) and subsequent linear prediction analysis for spectral transformation. Duration modification is done by taking the ratio between segment durations of neutral and target speech. After performing modification using WSOLA, the duration modified source speech is time aligned with target and further subjected to linear prediction analysis to yield the LP coefficients. The target emotion is re-synthesised by using the prosody manipulated residual and LPCs from source. The waveform similarity property of WSOLA is exploited to give output with minimal distortion. The proposed algorithm is subjectively and objectively evaluated along with popular TD-PSOLA algorithm. The correlation between synthesised and real target shows an average improvement of 60% across all emotions with the proposed technique. © Springer International Publishing AG 2017.

Notes:

cited By 0; Conference of 19th International Conference on Speech and Computer, SPECOM 2017 ; Conference Date: 12 September 2017 Through 16 September 2017; Conference Code:197479

Cite this Research Publication

S. Vekkot and Tripathi, S., “Vocal emotion conversion using wsola and linear prediction”, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 10458 LNAI, pp. 777-787, 2017.

207
PROGRAMS
OFFERED
6
AMRITA
CAMPUSES
15
CONSTITUENT
SCHOOLS
A
GRADE BY
NAAC, MHRD
8th
RANK(INDIA):
NIRF 2018
150+
INTERNATIONAL
PARTNERS