Publication Type : Conference Paper
Publisher : CEUR Workshop Proceedings, CEUR-WS.
Source : CEUR Workshop Proceedings, CEUR-WS, Volume 1737, p.201-204 (2016)
Keywords : Classification (of information), Decision trees, Distributional semantics, Document matrices, Factorization, Health science, Information Retrieval, Information retrieval systems, Matrix algebra, Nonnegative matrix factorization, Search engines, Semantics, Text classification, Text processing
Campus : Coimbatore
School : School of Engineering
Center : Computational Engineering and Networking
Department : Electronics and Communication
Year : 2016
Abstract : This paper describes about the our proposed system in the Consumer Health Information Search (CHIS) task. The objective of the task 1 is to classify the sentences in the document into relevant or irrelevant with respect to the query and task 2 is analysing the sentiment of the sentences in the documents with respect to the given query. In this proposed approach distributional representation of text along with its statistical and distance measures are carried over to perform the given tasks as a text classification problem. In our experiment, Non - Negative Matrix Factorization utilized to get the distributed representation of the document as well as queries, distance and correlation measures taken as the features and Random Forest Tree utilized to perform the classification. The proposed approach yields 70.19% in task 1 and 34.64% in task 2 as an average accuracy.
Cite this Research Publication : H. B. Barathi Ganesh, Dr. M. Anand Kumar, and Dr. Soman K. P., “Distributional semantic representation in health care text classification”, in CEUR Workshop Proceedings, 2016, vol. 1737, pp. 201-204.