Back close

Hierarchal POS tagging for Tamil language using Machine learning approach

Publication Type : Journal Article

Source : 2011

Url : http://www.ldcil.org/Download/POSANIL2011/9Hierarchal%20POS%20tagging%20for%20Tamil%20language%20using%20Machine%20learning%20approach.pdf

Campus : Coimbatore

School : Computational Engineering and Networking

Center : Computational Engineering and Networking

Department : Center for Computational Engineering and Networking (CEN)

Year : 2011

Abstract : This paper presents the intricacies involved in developing a hierarchal POS tagger generator using SVMTool for Tamil language. Tamil, a Dravidian language has a very rich morphological structure which is agglutinative. Tamil words are made up of lexical roots followed by one or more affixes, mostly suffixes. So tagging a word in a language like Tamil is very complex. We try to resolve this complexity by identifying the categorical ambiguities and developing three hierarchaltag sets at word grammatical category and grammatical feature level. These tag sets were used to annotate the corpora and trained using the SVMTool (An Open source tool available at http://www.lsi.upc.es/~nlp/SVMTool ) to generate the POS tagger model. The results obtained in each level were encouraging.

Cite this Research Publication : V. Dhanalakshmi and M Kumar, A., “Hierarchal POS tagging for Tamil language using Machine learning approach”, 2011.

Admissions Apply Now