Back close

Code_Conquerors@DravidianLangTech 2025: Multimodal Misogyny Detection in Dravidian Languages Using Vision Transformer and BERT

Publication Type : Conference Proceedings

Publisher : Association for Computational Linguistics

Source : Proceedings of the Fifth Workshop on Speech, Vision, and Language Technologies for Dravidian Languages

Url : https://doi.org/10.18653/v1/2025.dravidianlangtech-1.49

Campus : Coimbatore

School : School of Artificial Intelligence

Year : 2025

Abstract :

This research focuses on misogyny detection in Dravidian languages using multimodal techniques. It leverages advanced machine learning models, including Vision Transformers (ViT) for image analysis and BERT-based transformers for text processing. The study highlights the challenges of working with regional datasets and addresses these with innovative preprocessing and model training strategies. The evaluation reveals significant improvements in detection accuracy, showcasing the potential of multimodal approaches in combating online abuse in underrepresented languages.

Cite this Research Publication : Pathange Omkareshwara Rao, Harish Vijay V, Ippatapu Venkata Srichandra, Neethu Mohan, Sachin Kumar S, Code_Conquerors@DravidianLangTech 2025: Multimodal Misogyny Detection in Dravidian Languages Using Vision Transformer and BERT, Proceedings of the Fifth Workshop on Speech, Vision, and Language Technologies for Dravidian Languages, Association for Computational Linguistics, 2025, https://doi.org/10.18653/v1/2025.dravidianlangtech-1.49

Admissions Apply Now