Publication Type : Conference Paper
Publisher : ICAITPR
Source : In 2022 First International Conference on Artificial Intelligence Trends and Pattern Recognition (ICAITPR), pp. 1-6. IEEE, 2022.
Url : https://ieeexplore.ieee.org/document/9844177
Campus : Coimbatore
School : School of Engineering
Department : Center for Computational Engineering and Networking (CEN)
Year : 2022
Abstract : Classifying the speech of non-native English speakers is challenging due to various features that distinguish accents. Accents vary by sex, age, formality, social status, geographical area, mother tongue, quality of the voice, phoneme, and prosody. This paper proposes a novel, well-structured database of non-native Indian English speaker accents, referred to as IndicAccentDB. IndicAccentDB contains speech samples from 6 different states to address the unbalanced dataset (gender-bias) and speaker mismatch problems observed in the past. The proposed work also discusses the requirements for creating the IndicAccentDB database and pre-processing tasks performed on the dataset. Furthermore, we experimented with accent classification models, namely 1D-CNN, Support Vector Machines, Random forest, Decision tree, ResNet18, ResNet50, and xResNet18, using MFCC and Mel-Spectrogram features to build the robust Multi-Accent Recognition System (MARS). At last, we evaluated the performance of proposed models on the novel database and compared the results using evaluation metrics like precision, accuracy, F1-score, and recall. Based on our findings, xResNet18 was able to identify the accent classes with significant accuracy.
Cite this Research Publication : Darshana, S., H. Theivaprakasham, G. Jyothish Lal, B. Premjith, V. Sowmya, and Kp Soman. "MARS: A Hybrid Deep CNN-based Multi-Accent Recognition System for English Language." In 2022 First International Conference on Artificial Intelligence Trends and Pattern Recognition (ICAITPR), pp. 1-6. IEEE, 2022.