Back close

Hypergraph based clustering for document similarity using FP growth algorithm

Publication Type : Conference Proceedings

Publisher : IEEE

Source : 2019 International Conference on Intelligent Computing and Control Systems (ICCS)

Url : https://ieeexplore.ieee.org/abstract/document/9065630

Campus : Amritapuri

School : School of Computing

Center : Algorithms and Computing Systems

Year : 2019

Abstract : Modelling multiple documents for different applications is a major field of research due to the tremendous growth in Web data. To find the document similarity, we require clustering to determine the grouping of unlabelled data. Graph models have the capability or knowledge of capturing the structural information in texts. It organizes high dimensional data in such a way that the user can effortlessly access the desired information. In this paper, we use a hypergraph with the help of an association rule mining to model a collection of text documents and find similarity between them using a hypergraph partitioning algorithm. Here we use FP-Growth algorithm to find the association relationship which is a recursive elimination scheme. We then uses a spectral clustering algorithm which uses eigenvalues and vectors which is found out from the matrices to find similar documents. Experiment shows that this algorithm gave better clusters compared to others which commonly take higher eigenvectors.

Cite this Research Publication : N. Ramakrishnan, M. Nair J., D. Jayaprakash, H. Ananthakrishnan and S. Rani S., "Hypergraph based clustering for document similarity using FP growth algorithm," 2019 International Conference on Intelligent Computing and Control Systems (ICCS), Madurai, India, 2019

Admissions Apply Now