Tree adjoining Grammar (TAG) is a rich formalism for capturing syntax and some limited semantics of Natural languages. The XTAG project has contributed a very comprehensive TAG for English Language. Although TAGs have been proposed nearly 40 years ago by Joshi et al, 1975, their usage and application in the Indian Languages have been very rare, predominantly due to their complexity and lack of resources. In this paper we discuss a new TAG system and methodology of development for Tamil Language that can be extended for other Indian languages. The trees are developed synchronously with a minimalistic grammar obtained by careful pruning of XTAG English Grammar. We also apply Chomskian minimalism on these TAG trees, so as to make them simple and easily parsable. Furthermore we have also developed a parser that can parse simple sentences using the above mentioned grammar, and generating a TAG derivation that can be used for dependency resolution. Due to the synchronous nature of these TAG pairs they can be readily adapted for Formalism based Machine Translation (MT) from English to Tamil and vice versa. © 2015 IEEE.
cited By 0; Conference of International Conference on Advances in Computing, Communications and Informatics, ICACCI 2015 ; Conference Date: 10 August 2015 Through 13 August 2015; Conference Code:115835
Vijay Krishna Menon, S. Rajendran, and Dr. Soman K. P., “A synchronised tree adjoining Grammar for English to Tamil Machine Translation”, in 2015 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2015 (Fourth International Symposium on Natural Language Processing (NLP'15)), SCMS Group of Institutions, Corporate Office CampusPrathap Nagar , Muttom, Aluva, Kochi, Kerala; India, 2015, pp. 1497-1501.