Back close

Tamil Speech Recognition Using XLSR Wav2Vec2. 0 & CTC Algorithm

Publication Type : Conference Paper

Publisher : IEEE

Source : In 2022 13th International Conference on Computing Communication and Networking Technologies (ICCCNT) (pp. 1-6). IEEE.

Url : https://ieeexplore.ieee.org/abstract/document/9984422

Campus : Bengaluru

School : School of Engineering

Department : Electronics and Communication

Year : 2022

Abstract : Automatic Speech Recognition is a promising research topic with lots of real-world applications like virtual assistants, aids for physically challenged etc. Tamil language speech recognition could be potentially challenging due to the fact that there are many possible dialects, slangs and accents. This paper proposes an ASR system based on cross-lingual transfer learning in combination with CTC algorithm. The pretrained model from Facebook AI viz. XLSR Wav2Vec2.0 is used. The dataset used in this work is Common Voice Tamil, which is a crowd-sourced dataset provided by Mozilla. Our system achieves a Word Error Rate of 0.58 and Character Error Rate of 0.11.

Cite this Research Publication : Akhilesh, A., Brinda, P., Keerthana, S., Gupta, D., & Vekkot, S. (2022, October). Tamil Speech Recognition Using XLSR Wav2Vec2. 0 & CTC Algorithm. In 2022 13th International Conference on Computing Communication and Networking Technologies (ICCCNT) (pp. 1-6). IEEE.

Admissions Apply Now