Back close

English to Tamil Transliteration using Sequence Labeling Approach

Publication Type : Journal Article

Publisher : International Conference on Asian Language Processing

Source : International Conference on Asian Language Processing, Thailand (2008)

Url : http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.727.1122&rep=rep1&type=pdf

Campus : Coimbatore

School : School of Engineering

Center : Computational Engineering and Networking

Department : Electronics and Communication

Year : 2008

Abstract : Machine transliteration is an automatic method that converts words/characters in one alphabetical system to corresponding phonetically equivalent words/characters in another alphabetical system. Machine Transliteration has been used extensively to assist machine translation, data mining, information retrieval and more recently in popular web portals, SMS and chat systems. In this paper, we propose a new method where transliteration problem is modeled as a sequence labeling problem and proceed to solve this by using Support Vector Machines (SVM). We have applied this technique for transliterating English to Tamil and achieved exact Tamil transliterations for 80% of English names. We get an accuracy of 88% when we choose from the first five ranked transliterations.

Cite this Research Publication : M. S. Vijaya, Loganathan, R., Shivapratap, G., Ajith, V. P., and Dr. Soman K. P., “English to Tamil Transliteration using Sequence Labeling Approach”, International Conference on Asian Language Processing, Thailand, 2008.

Admissions Apply Now