Back close

Creation of GIF dataset and implementation of a speech-to-sign language translator in Telugu

Publication Type : Conference Paper

Publisher : IEEE

Source : In 2022 IEEE North Karnataka Subsection Flagship International Conference (NKCon) (pp. 1-7). IEEE

Url : https://ieeexplore.ieee.org/document/10127067

Campus : Bengaluru

School : School of Engineering

Department : Electronics and Communication

Year : 2022

Abstract : The paper proposes the design and implementation of a model that translates live voice or audio recordings of a native Indian regional language (Telugu) to text and further matches it to sign language animations from the pre-defined Graphics Interchange Format (GIF) dataset. The speech is converted to text using Google API. If the text matches with the words in GIF dataset, it generates the respective GIF. A GUI interface is also developed which displays ISL text alphabetically if the given input is outside the range of GIF dataset. Alongside, the performance of four powerful deep learning networks viz. RNN, LSTM, Bi-LSTM, and GRU, has been compared and trained over a Telugu multi-speaker speech dataset for speech recognition. The proposed models yielded accuracies of 51% for RNN, 68% for LSTM, 90% for Bi-LSTM and 88% for GRU. On evaluating the models, Google API-based speech to sign language conversion has been most promising and consistent with precision and recall scores of 94% each and a word error rate of 4.1%.

Cite this Research Publication : Reddy, B. R., Reddy, D. S. T., Preetham, S., & Vekkot, S. (2022, November). Creation of GIF dataset and implementation of a speech-to-sign language translator in Telugu. In 2022 IEEE North Karnataka Subsection Flagship International Conference (NKCon) (pp. 1-7). IEEE.

Admissions Apply Now