论文信息 - LSTM-based Deep Learning Models for non-factoid answer selection - 字舞流文

LSTM-based Deep Learning Models for non-factoid answer selection

In this paper, we apply a general deep learning (DL) framework for the answer selection task, which does not depend on manually defined features or linguistic tools. The basic framework is to build the embeddings of questions and answers based on bidirectional long short-term memory (biLSTM) models, and measure their closeness by cosine similarity. We further extend this basic model in two directions. One direction is to define a more composite representation for questions and answers by combining convolutional neural network with the basic framework. The other direction is to utilize a simple but efficient attention mechanism in order to generate the answer representation according to the question context. Several variations of models are provided. The models are examined by two datasets, including TREC-QA and InsuranceQA. Experimental results demonstrate that the proposed models substantially outperform several strong baselines.

Bowen Zhou | Bing Xiang | Ming Tan | M. Tan | Bing Xiang | Bowen Zhou | Ming Tan

[1] Cícero Nogueira dos Santos,et al. Learning Hybrid Representations to Retrieve Semantically Equivalent Questions , 2015, ACL.

[2] Yelong Shen,et al. Learning semantic representations using convolutional neural networks for web search , 2014, WWW.

[3] Noah A. Smith,et al. What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA , 2007, EMNLP.

[4] Bowen Zhou,et al. Applying deep learning to answer selection: A study and an open task , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).

[5] Ming-Wei Chang,et al. Question Answering Using Enhanced Lexical Semantic Models , 2013, ACL.

[6] Phil Blunsom,et al. Teaching Machines to Read and Comprehend , 2015, NIPS.

[7] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[8] Hang Li,et al. Convolutional Neural Network Architectures for Matching Natural Language Sentences , 2014, NIPS.

[9] Jason Weston,et al. End-To-End Memory Networks , 2015, NIPS.

[10] Quoc V. Le,et al. A Neural Conversational Model , 2015, ArXiv.

[11] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[12] Jason Weston,et al. #TagSpace: Semantic Embeddings from Hashtags , 2014, EMNLP.

[13] Jason Weston,et al. A Neural Attention Model for Sentence Summarization , 2015 .

[14] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15] Ting Liu,et al. Document Modeling with Gated Recurrent Neural Network for Sentiment Classification , 2015, EMNLP.

[16] Alessandro Moschitti,et al. Automatic Feature Engineering for Answer Selection and Extraction , 2013, EMNLP.

[17] Larry P. Heck,et al. Learning deep structured semantic models for web search using clickthrough data , 2013, CIKM.

[18] Xinyun Chen. Under Review as a Conference Paper at Iclr 2017 Delving into Transferable Adversarial Ex- Amples and Black-box Attacks , 2016 .

[19] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[20] Chris Callison-Burch,et al. Answer Extraction as Sequence Tagging with Tree Edit Distance , 2013, NAACL.

[21] Alessandro Moschitti,et al. Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks , 2015, SIGIR.

[22] Di Wang,et al. A Long Short-Term Memory Model for Answer Sentence Selection in Question Answering , 2015, ACL.

[23] Razvan Pascanu,et al. Theano: new features and speed improvements , 2012, ArXiv.

[24] Lei Yu,et al. Deep Learning for Answer Sentence Selection , 2014, ArXiv.

[25] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[26] Jason Weston,et al. Weakly Supervised Memory Networks , 2015, ArXiv.

[27] Christopher D. Manning,et al. Probabilistic Tree-Edit Models with Structured Latent Variables for Textual Entailment and Question Answering , 2010, COLING.

[28] Noah A. Smith,et al. Tree Edit Models for Recognizing Textual Entailments, Paraphrases, and Answers to Questions , 2010, NAACL.