Topic Modeling is a statistical model, which derives the latent theme from large collection of text. In this work we developed a topic model for BBC news corpus to find the screened regional from the corpus. We have implemented the topic modeling algorithms Latent Dirichlet Allocation (LDA), Latent Semantic Analysis (LSA) and three different machine learning approaches (Naive Bayes, K-NN and K-means). We compared the performance of topic modeling algorithms with machine learning approaches using the measures precision and recall. Our results show that topic modeling algorithms work better for corpus with multiple topic distribution.
cited By 0
T. Rajasundari, Subathra P., and Kumar, P. N., “Performance Analysis of Topic Modeling Algorithms for News Articles”, Journal of Advanced Research in Dynamical and Control Systems, vol. 2017, pp. 175-183, 2017.