Back close

Deep Learning Based Language Identification System From Speech

Publication Type : Conference Paper

Publisher : IEEE

Source : 2019 International Conference on Intelligent Computing and Control Systems (ICCS) Pages 1094-1097

Url : https://ieeexplore.ieee.org/document/9065370

Campus : Amritapuri

School : School of Engineering

Department : Electronics and Communication

Year : 2019

Abstract : The paper outlines a language identification system from speech with the help of deep learning. In a multilingual country like India, language identification systems have a wide variety of applications. For this, a custom speech database of seven Indian languages from IIIT-Indic has been taken. Preprocessing is done in order to avoid the noise interference and the disturbances in the audio signal. After subsequent framing and windowing, the spectrograms are extracted and fed to a Convolutional Neural Network (CNN). Utterances in languages: Bengali, Hindi, Marathi, Malayalam, Kannada, Tamil and Telugu are utilized in this study. A mixture of speech utterances from binary combination of these languages is used in this analysis. The performance of the system is measured in the form of accuracy, precision, recall, F1 score and confusion matrix. For the data set analyzed, a maximum accuracy of 99.5% is achieved for Bengali vs Kannada and 99% for Kannada vs Malayalam.

Cite this Research Publication : A. N..P and P. S.S., "Deep Learning Based Language Identification System From Speech," 2019 International Conference on Intelligent Computing and Control Systems (ICCS), Madurai, India, 2019, pp. 1094-1097, doi: 10.1109/ICCS45141.2019.9065370.

Admissions Apply Now