Golden Retriever: Question Retrieval System

Duplicate questions get posted on Q&A online forums because users may not be aware of similar questions. Our proposed system, Golden Retriever, can recommend existing questions that are semantically related to incoming questions. Compared with other existing techniques such as Latent Semantic Indexing, Language Model and Semantic Similarity, our approach shows good results for the ICHI Healthcare Data Analytics Challenge dataset using normalized TF-IDF, relevance heuristics, and semantic relatedness.