Publication Type:

Journal Article

Source:

Indian Journal of Science and Technology, Volume 9, Number 45 (2016)

URL:

http://www.indjst.org/index.php/indjst/article/view/106480

Keywords:

Bilingual, English-Malayalam, machine translation, Parallel sentence, Templates

Abstract:

This paper aims in providing an overview about a part of Natural Language Generation – Parallel sentence generation which involves the generation of the English sentence as well as its Malayalam translated version. Methods/Analysis: A template based sentence generator approach is followed here. A system is proposed which takes input from a manually created bilingual dictionary and fills the slots in the template for parallel sentence generation. Finding: Using the proposed method, we have generated a total of 25,208 parallel sentences. This can be used in bilingual Machine Translation dictionary. Application/Improvement: In the proposed case use only four templates but by increasing the number of templates and by updating the dictionary, we can increase the size of the parallel corpus that can be generated.

Cite this Research Publication

S. Sudhakaran, Jose, S., Dr. M. Anand Kumar, and Soman, K. P., “Knowledge based Approach for English-Malayalam Parallel Corpus Generation”, Indian Journal of Science and Technology, vol. 9, 2016.

207
PROGRAMS
OFFERED
6
AMRITA
CAMPUSES
15
CONSTITUENT
SCHOOLS
A
GRADE BY
NAAC, MHRD
8th
RANK(INDIA):
NIRF 2018
150+
INTERNATIONAL
PARTNERS
  • Amrita on Social Media

  • Contact us

    Amrita Vishwa Vidyapeetham,
    Amritanagar,
    Coimbatore - 641 112,
    Tamil Nadu, India.
    • Fax                 : +91 (422) 268 6274
    • Coimbatore   : +91 (422) 268 5000
    • Amritapuri    : +91 (476) 280 1280
    • Bengaluru     : +91 (080) 251 83700
    • Kochi              : +91 (484) 280 1234
    • Mysuru          : +91 (821) 234 3479
    • Chennai         : +91 (44 ) 276 02165
    • Contact Details »