Back close

Finding the Duplicate Questions in Stack Overflow using Word Embeddings

Publication Type : Conference Paper

Publisher : Procedia Computer Science

Source : Procedia Computer Science 171 (2020): 2729-2733, DOI: 10.1016/j.procs.2020.04.296.

Url : https://www.sciencedirect.com/science/article/pii/S1877050920312898

Campus : Amritapuri

School : School of Computing

Center : Computational Linguistics and Indic Studies

Year : 2020

Abstract : Searching query in the web applications may or may not yield anticipated results, constrained by the questions asked. User may not feel comfortable by seeing a bunch of questions which may not be relevant to his question. Henceforth we device a technique by finding the similar queries or searches to an entered query. For this purpose, Word Embeddings are used on questions asked in Stack Overflow. We use Cosine Similarity to find the similarity between two sentences as every word in the sentence is given Embedding i.e., a vector containing of numbers. We rank the queries based on this similarity. Our goal is to make the best similar question to the query asked to get the best rank.

Cite this Research Publication : Babu, Jameer, and S. Thara. "Finding the Duplicate Questions in Stack Overflow using Word Embeddings." Procedia Computer Science 171 (2020): 2729-2733, DOI: 10.1016/j.procs.2020.04.296.

Admissions Apply Now