Publication Type : Conference Paper
Publisher : Springer Nature Singapore
Source : Smart Innovation, Systems and Technologies
Url : https://doi.org/10.1007/978-981-19-8669-7_16
Campus : Coimbatore
School : School of Computing
Department : Computer Science and Engineering
Year : 2023
Abstract :
India is one of the countries with an increasing deaf population. The needs and problems of deaf communities are not adequately addressed. The majority of the previous approaches on Indian Sign Language (ISL) Recognition are based on convolutional neural networks (CNNs), whose performance depends on factors like lighting and the clothing of the sign language performer. The facial key points along with arm and body key points play a significant role in interpreting the ISL. The proposed approach identifies the facial key points and other body key points using the OpenPose model to generate pose video, which removes information unfavorable to CNN networks. The generated pose video is used by the pre-trained CNN model to extract spatial features and recurrent neural networks (RNNs) to extract temporal features necessary for ISL interpretation. We have used pre-trained models like Inception V3, DenseNet121 and MobileNetV2 to extract spatial features along with GRU, LSTM and bidirectional LSTM networks to extract the temporal features. Our model using the pre-trained MobileNetV2 model combined with GRU gave an accuracy of 99%. We also extended our approach using a combination of pose videos with background information.
Cite this Research Publication : Athul Mathew Konoor, S. Padmavathi, A Hybrid Approach on Lexical Indian Sign Language Recognition, Smart Innovation, Systems and Technologies, Springer Nature Singapore, 2023, https://doi.org/10.1007/978-981-19-8669-7_16