<p>Dravidian languages are highly agglutinative and morphologically rich in their features. Language processing for these languages requires more annotating data compared to European or Indo-European languages. In this paper we present the comparison between Statistical Machine Translation (SMT) model with linguistic and nonlinguistic data models for English to Kannada languages. The experiments shows an improvement in Bleu-Score for Factored MT system against Baseline MT system for English to Kannada SMT. Kannada fonts can take ten different forms in representing a word any change of a font variant in word leads to change in meaning of the word. We model these morphological variants of Kannada lemma words, their variants and PoS as Factors in our MT System.</p>
cited By 0; Conference of 2016 International Conference on Inventive Computation Technologies, ICICT 2016 ; Conference Date: 26 August 2016 Through 27 August 2016; Conference Code:125861
K. Ma Shivakumar, Shivaraju, Na, Sreekanta, Vb, and Dr. Deepa Gupta, “Comparative study of factored SMT with baseline SMT For English to Kannada”, in Proceedings of the International Conference on Inventive Computation Technologies, ICICT 2016, 2017, vol. 1.