Auditory-based wavelet packet filterbank for speech recognition using neural network

Publication Type : Conference Paper

Publisher : Proceedings of the 15th International Conference on Advanced Computing and Communications, ADCOM 2007

Source : Proceedings of the 15th International Conference on Advanced Computing and Communications, ADCOM 2007, Guwahati, p.666-671 (2007)

Url : http://www.scopus.com/inward/record.url?eid=2-s2.0-52249120641&partnerID=40&md5=6f50a8c3435970fb3374c1895321e8a2

ISBN : 9780769530598

Keywords : Artificial intelligence, Auditory features, Auditory systems, Auditory-based, Back-propagation neural network, Backpropagation, Backpropagation algorithms, Crack propagation, Dynamic time warping, Feature extraction, Feature vectors, Filter banks, Hidden Markov models, Hidden-Markov model, image classification, Inner ear, International conferences, Layered neural network, Markov processes, Mel-frequency cepstral coefficient, neural network, Neural networks, Quantitative modeling, Recognition methods, Recognition process, Recognition rates, Recognition systems, Robust speech, Robust speech recognition, Signal processing, Speaker dependent, Speech, Speech analysis, speech processing, Speech recognition, Speech recognition systems, System performances, System's performance, Vectors, Vocal tract models, Wave filters, Wavelet Packet

Campus : Coimbatore

School : School of Engineering

Department : Electronics and Communication

Year : 2007

Abstract : A major problem of most speech recognition systems is their unsatisfactory robustness in noise. Human inner ear based 'feature extraction ' leads to very robust speech understanding in noise. This 'Model of Auditory Periphery' is acting as front-end model of this speech recognition process. This paper describes two quantitative models for signal processing in auditory system (i) Gamma Tone Filter Bank (GTFB) and (U) Wavelet Packet (WP) as frontends for robust speech recognition. The auditory feature vectors had been used to train neural network. The classification of the feature vectors was done by the neural network using Back Propagation (BP) algorithm. The system performance was measured by recognition rate with various signal-tonoise ratios over -10 to 10 dB. The proposed system's performance was compared with various types of front-ends and recognition methods such as auditory features with Hidden Markov Model (HMM) amp; Layered Neural Network (LRNN), auditory features with Mel Frequency Cepstral Coefficient (MFCC) amp; LRNN and vocal tract model: MFCC amp; HMM, Dynamic time warping (DTW). The performances of proposed models with gamma tone filter bank and wavelet packet as front-ends were also compared. It had been identified that proposed system with wavelet packet as front-end and Back Propagation Neural Network (BPNN) as the recognition method is having good recognition rate over -10 to 10 dB. Both speaker independent and speaker dependent recognition systems had been designed, implemented and tested. © 2007 IEEE.

Cite this Research Publication : Gandhiraj R. and P.S. Sathidevi, “Auditory-based wavelet packet filterbank for speech recognition using neural network”, in Proceedings of the 15th International Conference on Advanced Computing and Communications, ADCOM 2007, Guwahati, 2007, pp. 666-671.

About Amrita Vishwa Vidyapeetham

Rankings

Accreditation

Governance

Chancellor

Leadership

Press Media

Newsletters

Amritapuri
Campus

Amaravati
Campus

Bengaluru
Campus

Chennai
Campus

Coimbatore
Campus

Faridabad
Campus

Kochi
Campus

Mysuru
Campus

Nagercoil
Campus

Haridwar
(Proposed Campus)

Research

Centers

Patents

Publication