Back close

BUCC2020: Bilingual Dictionary Induction using Cross-lingual Embedding

Publication Type : Conference Paper

Publisher : BUCC@LREC 2020

Source : BUCC@LREC 2020, pp. 65-68

Url : https://aclanthology.org/2020.bucc-1.11/

Campus : Coimbatore

School : School of Engineering

Department : Center for Computational Engineering and Networking (CEN)

Year : 2020

Abstract : This paper presents a deep learning system for the BUCC 2020 shared task: Bilingual dictionary induction from comparable corpora. We have submitted two runs for this shared Task, German (de) and English (en) language pair for “closed track” and Tamil (ta) and English (en) for the “open track”. Our core approach focuses on quantifying the semantics of the language pairs, so that semantics of two different language pairs can be compared or transfer learned. With the advent of word embeddings, it is possible to quantify this. In this paper, we propose a deep learning approach which makes use of the supplied training data, to generate cross-lingual embedding. This is later used for inducting bilingual dictionary from comparable corpora.

Cite this Research Publication : Sanjanasri J.P., Vijay Krishna Menon, K. P. Soman, “BUCC2020: Bilingual Dictionary Induction using Cross-lingual Embedding”, BUCC@LREC 2020, pp. 65-68

Admissions Apply Now