Internet users today prefer getting precise answers to their questions rather than sifting through a bunch of relevant documents provided by search engines. This has led to the huge popularity of Community Question Answering cQA services like Yahoo! Answers, Baidu Zhidao, Quora, StackOverflowetc., where forum users respond to questions with precise answers. Over time, such cQA archives become rich repositories of knowledge encoded in the form of questions and user generated answers. In cQA archives, retrieval of similar questions, which have already been answered in some form, is important for improving the effectiveness of such forums. The main challenge while retrieving similar questions is the "lexico-syntactic" gap between the user query and the questions already present in the forum. In this paper, we propose a novel approach called "Deep Structured Topic Model DSTM" to bridge the lexico-syntactic gap between the question posed by the user and forum questions. DSTM employs a two-step process consisting of initially retrieving similar questions that lie in the vicinity of the query and latent topic vector space and then re-ranking them using a deep layered semantic model. Experiments on large scale real-life cQA dataset show that our approach outperforms the state-of-the-art translation and topic based baseline approaches.
[1]
Ben He,et al.
Question-answer topic model for question retrieval in community question answering
,
2012,
CIKM.
[2]
Michael I. Jordan,et al.
Latent Dirichlet Allocation
,
2001,
J. Mach. Learn. Res..
[3]
Zhoujun Li,et al.
Question Retrieval with High Quality Answers in Community Question Answering
,
2014,
CIKM.
[4]
Yelong Shen,et al.
Learning semantic representations using convolutional neural networks for web search
,
2014,
WWW.
[5]
W. Bruce Croft,et al.
Retrieval models for question and answer archives
,
2008,
SIGIR '08.
[6]
Li Cai,et al.
Learning the Latent Topics for Question Retrieval in Community QA
,
2011,
IJCNLP.
[7]
Larry P. Heck,et al.
Learning deep structured semantic models for web search using clickthrough data
,
2013,
CIKM.
[8]
W. Bruce Croft,et al.
Finding similar questions in large question and answer archives
,
2005,
CIKM '05.
[9]
Po Hu,et al.
Learning Continuous Word Embedding with Metadata for Question Retrieval in Community Question Answering
,
2015,
ACL.
[10]
Li Cai,et al.
Phrase-Based Translation Model for Question Retrieval in Community Question Answer Archives
,
2011,
ACL.