Tamil Speech Recognition Using XLSR Wav2Vec2. 0 & CTC Algorithm

Publication Type : Conference Paper

Publisher : IEEE

Source : In 2022 13th International Conference on Computing Communication and Networking Technologies (ICCCNT) (pp. 1-6). IEEE.

Url : https://ieeexplore.ieee.org/abstract/document/9984422

Campus : Bengaluru

School : School of Engineering

Department : Electronics and Communication

Year : 2022

Abstract : Automatic Speech Recognition is a promising research topic with lots of real-world applications like virtual assistants, aids for physically challenged etc. Tamil language speech recognition could be potentially challenging due to the fact that there are many possible dialects, slangs and accents. This paper proposes an ASR system based on cross-lingual transfer learning in combination with CTC algorithm. The pretrained model from Facebook AI viz. XLSR Wav2Vec2.0 is used. The dataset used in this work is Common Voice Tamil, which is a crowd-sourced dataset provided by Mozilla. Our system achieves a Word Error Rate of 0.58 and Character Error Rate of 0.11.

Cite this Research Publication : Akhilesh, A., Brinda, P., Keerthana, S., Gupta, D., & Vekkot, S. (2022, October). Tamil Speech Recognition Using XLSR Wav2Vec2. 0 & CTC Algorithm. In 2022 13th International Conference on Computing Communication and Networking Technologies (ICCCNT) (pp. 1-6). IEEE.

About Amrita Vishwa Vidyapeetham

Rankings

Accreditation

Governance

Chancellor

Leadership

Press Media

Newsletters

Amritapuri
Campus

Amaravati
Campus

Bengaluru
Campus

Chennai
Campus

Coimbatore
Campus

Faridabad
Campus

Kochi
Campus

Mysuru
Campus

Nagercoil
Campus

Haridwar
(Proposed Campus)

Research

Centers

Patents

Publication