Publication Type:

Book Chapter


A Machine Learning Approach to Cluster the users of Stack Overflow Forum, Springer India, Volume 325, Number Advances in Intelligent Systems and Computing , New Delhi, p.411–418 (2015)





Online question and answer (Q&A) forums are emerging as excellent learning platforms for learners with varied interests. In this paper, we present our results on the clustering of Stack Overflow users into four clusters, namely naive users, surpassing users, experts, and outshiners. This clustering is based on various metrics available on the forum. We use the X-means and expectation maximization clustering algorithms and compare the results. The results have been validated using internal, external, and relative validation techniques. The objective of this clustering is to be able to trace and predict the activity of a user on this forum. According to our results, majority of users (71 % of 40,000 users under consideration) fall in the ‘experts’ category. This indicates that the users in Stack Overflow are of high quality thereby making the forum an excellent platform for users to learn about computer programming.

Cite this Research Publication

J. Anusha, V. Rekha, S., and P. Bhagavathi Sivakumar, “Artificial Intelligence and Evolutionary Algorithms in Engineering Systems”, in A Machine Learning Approach to Cluster the users of Stack Overflow Forum, vol. 325, P. L. Suresh, Dash, S. Subhransu, and Panigrahi, K. Bijaya New Delhi: Springer India, 2015, pp. 411–418.