Publication Type:

Journal Article

Source:

Advances in Intelligent Systems and Computing, Springer Verlag, Volume 384, Kochi; India, p.147-152 (2016)

ISBN:

9783319230351

URL:

https://www.scopus.com/inward/record.uri?eid=2-s2.0-84945959940&partnerID=40&md5=45ce346fa61ec125eec59ba4d53c1ad3

Abstract:

Phone recognizers serve as the preprocessing unit for speech recognition systems and phonetic engines. Even though, most of the state of the art speech recognition achieve relatively better accuracy at the sentence level, the phone level recognition performance falls way below the sentence level performance. The increased recognition rates at the sentence levels are achieved with help of refined language models used for the language under consideration. Therefore, the objective of the present work is to improve the phoneme level accuracy of the hidden markov model(HMM) based acoustic phone models by combining excitation source features with the conventional mel frequency cepstral coefficients (MFCC) for American English. TIMIT and CMU Arctic database, is used for the experiments in the present work. The average spectral energy around the zero-frequency region of each frame is used as the excitation source feature to combine with the 13 MFCC features. The effectiveness of the phoneme recognition is confirmed by a 0.5% increase in the phone recognition accuracy against the state of the art HMM-GMM acoustic models with MFCC features. © Springer International Publishing Switzerland 2016.

Cite this Research Publication

P. M. Hisham, Pravena, D., Pardhu, Y., Gokul, V., Abhitej, B., and D., D. Govind, “Improved phone recognition using excitation source features”, Advances in Intelligent Systems and Computing, vol. 384, pp. 147-152, 2016.

207
PROGRAMS
OFFERED
6
AMRITA
CAMPUSES
15
CONSTITUENT
SCHOOLS
A
GRADE BY
NAAC, MHRD
8th
RANK(INDIA):
NIRF 2018
150+
INTERNATIONAL
PARTNERS