Publication Type:

Journal Article

Source:

Proceedings of SemEval, p.1022–1027 (2016)

URL:

https://pdfs.semanticscholar.org/31c5/905e6eeb0c4e5a78646fb746a3c42fb4ec5b.pdf

Abstract:

<p>Complex word identification task focuses on identifying the difficult word from English sentence for a Non-Native speakers. NonNative speakers are those who don’t have English as their native language. It is a subtask for lexical simplification. We have experimented with word embedding features, orthographic word features, similarity features and POS tag features which improves the performance of the classification. In addition to the SemEval 2016 results we have evaluated the training data by varying the vector dimension size and obtained the best possible size for producing better performance. The SVM learning algorithm will attains constant and maximum accuracy through linear classifier. We achieve a G-score of 0.43 and 0.54 on computing complex words for two systems.</p>

Cite this Research Publication

S. P. Sanjay, Dr. M. Anand Kumar, and Soman, K. P., “AmritaCEN at SemEval-2016 Task 11: Complex Word Identification using Word Embedding”, Proceedings of SemEval, pp. 1022–1027, 2016.

207
PROGRAMS
OFFERED
5
AMRITA
CAMPUSES
15
CONSTITUENT
SCHOOLS
A
GRADE BY
NAAC, MHRD
8th
RANK(INDIA):
NIRF 2018
150+
INTERNATIONAL
PARTNERS