Back close

Cross-lingual Speech Emotion Recognition for Mental Health Counselling and Aid

Publication Type : Journal Article

Publisher : Elsevier BV

Source : Procedia Computer Science

Url : https://doi.org/10.1016/j.procs.2025.04.375

Keywords : Speech Emotion Recognition, Multi-Lingual, Mental Health, Beagle Bone Black, PCA

Campus : Chennai

School : School of Engineering

Year : 2025

Abstract : Emotion recognition technology, particularly in speech analysis, holds promise for enhancing human-computer interactions and refining mental health assessments. However, current models primarily serve monolingual databases, posing challenges in multilingual contexts. This research addresses this limitation by using underrepresented linguistic datasets—German, Italian, and Urdu—achieving high emotion prediction accuracies of 99.09%, 99.15%, and 98.125%, respectively. Our methodology encompasses three distinct approaches: employing machine learning classifiers directly on Mel Frequency Cepstral Coefficient (MFCC) features from audio files, integrating PCA with MFCC spectrograms for feature reduction, and leveraging pre-trained models for feature extraction from MFCC spectrograms. The impacts of all three approaches are closely observed, and our findings highlight the effectiveness of our approach in discerning emotions across languages, laying the groundwork for more inclusive and culturally sensitive mental health interventions worldwide. Furthermore, a hardware implementation on the Beagle Bone Black board achieves 80% real-time accuracy, showcasing the system’s potential for fast, low-latency emotion recognition in mental health and interactive applications.

Cite this Research Publication : Kanwaljeet Kaur, Aishwarya N, Ganesh Kumar Chellamani, Cross-lingual Speech Emotion Recognition for Mental Health Counselling and Aid, Procedia Computer Science, Elsevier BV, 2025, https://doi.org/10.1016/j.procs.2025.04.375

Admissions Apply Now