Back close

A concept based clustering model for document similarity

Publication Type : Conference Paper

Publisher : 2014 International Conference on Data Science Engineering (ICDSE)

Source : 2014 International Conference on Data Science Engineering (ICDSE), 2014, pp. 118-123

Keywords : accuracy, Analytical models, belief network, belief networks, Clustering algorithms, concept based clustering model, concept based Extended DB scan algorithm, concept mining model, DBSCAN algorithm, document handling, Document similarity, Graph model, Graph theory, Nanofluidics, Nanomaterials, pattern clustering, Probabilistic network, probability, Semantics, triplet representation

Campus : Amritapuri

School : Department of Computer Science and Engineering, School of Engineering

Center : Computational Linguistics and Indic Studies

Department : Computer Science

Verified : Yes

Year : 2014

Abstract : A lot of research work has been done in the area of concept mining and document similarity in past few years. But all these works were based on the statistical analysis of keywords. The major challenge in this area involves the preservation of semantics of the terms or phrases. Our paper proposes a graph model to represent the concept in the sentence level. The concept follows a triplet representation. A modified DB scan algorithm is used to cluster the extracted concepts. This cluster forms a belief network or probabilistic network. We use this network for extracting the most probable concepts in the document. In this paper we also proposes a new algorithm for document similarity.

Cite this Research Publication : G. Veena and Lekha, N. K., “A concept based clustering model for document similarity”, in 2014 International Conference on Data Science Engineering (ICDSE), 2014, pp. 118-123

Admissions Apply Now