Publication Type:

Conference Paper

Source:

Third International Symposium on Intelligent Informatics (ISI’14), Advances in Intelligent and Soft Computing (Springer) Series, GCET, Greater Noida, India (2015)

URL:

http://link.springer.com/chapter/10.1007%2F978-3-319-11218-3_31

Abstract:

A lot of research work has been done in the area of concept mining and document similarity in past few years. But all these works were based on the statistical analysis of keywords. The major challenge in this area involves the preservation of semantics of the terms or phrases. Our paper proposes a graph model to represent the concept in the sentence level. The concept follows a triplet representation. A modified DB scan algorithm is used to cluster the extracted concepts. This cluster forms a belief network or probabilistic network. We use this network for extracting the most probable concepts in the document. In this paper we also proposes a new algorithm for document similarity. For the belief network comparison an extended chameleon Algorithm is also proposed here.

Cite this Research Publication

G. Veena and Lekha, N. K., “An extended chameleon algorithm for document clustering”, in Third International Symposium on Intelligent Informatics (ISI’14), GCET, Greater Noida, India, 2015.

207
PROGRAMS
OFFERED
5
AMRITA
CAMPUSES
15
CONSTITUENT
SCHOOLS
A
GRADE BY
NAAC, MHRD
9th
RANK(INDIA):
NIRF 2017
150+
INTERNATIONAL
PARTNERS