Back close

English to Tamil Transliteration using WEKA

Publication Type : Journal Article

Publisher : International Journal of Recent Trends in Engineering

Source : International Journal of Recent Trends in Engineering, Volume 1, Issue 1, p.498 (2009)

Campus : Coimbatore

School : School of Engineering

Center : Computational Engineering and Networking

Department : Electronics and Communication

Year : 2009

Abstract : Machine transliteration has gained prime importance as a supporting tool for Machine translation and cross language information retrieval especially when proper names and technical terms are involved. The performance of machine translation and cross-language information retrieval depends extremely on accurate transliteration of named entities. Hence the transliteration model must aim to preserve the phonetic structure of words as closely as possible. In this paper, the transliteration problem is modeled as classification problem and trained using C4.5 decision tree classifier, in WEKA Environment. The training was implemented with features extracted from a parallel corpus. This technique was demonstrated for English to Tamil Transliteration and achieved exact Tamil transliterations for 84.82% of English names. Possible equivalent transliterations were also generated by the model. It is found that the transliteration accuracy is increased when the top five ranked transliterations were considered

Cite this Research Publication : Dr. Soman K. P., MS, V., VP, A., and G, S., “English to Tamil Transliteration using WEKA”, International Journal of Recent Trends in Engineering, vol. 1, no. 1, p. 498, 2009.

Admissions Apply Now