论文信息 - Learning to Transform, Combine, and Reason in Open-Domain Question Answering - 字舞流文

Learning to Transform, Combine, and Reason in Open-Domain Question Answering

Users seek direct answers to complex questions from large open-domain knowledge sources like the Web. Open-domain question answering has become a critical task to be solved for building systems that help address users' complex information needs. Most open-domain question answering systems use a search engine to retrieve a set of candidate documents, select one or a few of them as context, and then apply reading comprehension models to extract answers. Some questions, however, require taking a broader context into account, e.g., by considering low-ranked documents that are not immediately relevant, combining information from multiple documents, and reasoning over multiple facts from these documents to infer the answer. In this paper, we propose a model based on the Transformer architecture that is able to efficiently operate over a larger set of candidate documents by effectively combining the evidence from these documents during multiple steps of reasoning, while it is robust against noise from low-ranked non-relevant documents included in the set. We use our proposed model, called TraCRNet, on two public open-domain question answering datasets, SearchQA and Quasar-T, and achieve results that meet or exceed the state-of-the-art.

M. de Rijke | Maarten de Rijke | Jaap Kamps | Mostafa Dehghani | Hosein Azarbonyad | J. Kamps | M. Dehghani | H. Azarbonyad | Mostafa Dehghani

[1] Jason Weston,et al. Reading Wikipedia to Answer Open-Domain Questions , 2017, ACL.

[2] Ruslan Salakhutdinov,et al. Gated-Attention Readers for Text Comprehension , 2016, ACL.

[3] Enrique Alfonseca,et al. Learning to Attend, Copy, and Generate for Session-Based Query Suggestion , 2017, CIKM.

[4] Richard Socher,et al. Dynamic Coattention Networks For Question Answering , 2016, ICLR.

[5] Jian Zhang,et al. SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[6] François Chollet,et al. Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Mirella Lapata,et al. Neural Summarization by Extracting Sentences and Words , 2016, ACL.

[8] Ting Liu,et al. Attention-over-Attention Neural Networks for Reading Comprehension , 2016, ACL.

[9] Yelong Shen,et al. ReasoNet: Learning to Stop Reading in Machine Comprehension , 2016, CoCo@NIPS.

[10] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[11] Zhiyuan Liu,et al. Denoising Distantly Supervised Open-Domain Question Answering , 2018, ACL.

[12] Tom Kenter,et al. Byte-Level Machine Reading Across Morphologically Varied Languages , 2018, AAAI.

[13] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[14] Richard Socher,et al. Ask Me Anything: Dynamic Memory Networks for Natural Language Processing , 2015, ICML.

[15] William W. Cohen,et al. Quasar: Datasets for Question Answering by Search and Reading , 2017, ArXiv.

[16] Kenton Lee,et al. Learning Recurrent Span Representations for Extractive Question Answering , 2016, ArXiv.

[17] Ming Zhou,et al. Gated Self-Matching Networks for Reading Comprehension and Question Answering , 2017, ACL.

[18] Diyi Yang,et al. Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[19] Kyunghyun Cho,et al. SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine , 2017, ArXiv.

[20] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[21] Ellen M. Voorhees,et al. The TREC-8 Question Answering Track Report , 1999, TREC.

[22] Ali Farhadi,et al. Bidirectional Attention Flow for Machine Comprehension , 2016, ICLR.

[23] Bert F. Green,et al. Baseball: an automatic question-answerer , 1899, IRE-AIEE-ACM '61 (Western).

[24] Md. Mustafizur Rahman,et al. Neural information retrieval: at the end of the early years , 2017, Information Retrieval Journal.

[25] Xiaodong Liu,et al. Towards Human-level Machine Reading Comprehension: Reasoning and Inference with Multiple Strategies , 2017, ArXiv.

[26] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[27] Eunsol Choi,et al. Coarse-to-Fine Question Answering for Long Documents , 2016, ACL.

[28] Hwee Tou Ng,et al. A Question-Focused Multi-Factor Attention Network for Question Answering , 2018, AAAI.

[29] M. de Rijke,et al. Attentive Memory Networks: Efficient Machine Reading for Conversational Search , 2017, ArXiv.

[30] Ankur Bapna,et al. The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation , 2018, ACL.

[31] Phil Blunsom,et al. Teaching Machines to Read and Comprehend , 2015, NIPS.

[32] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[33] Wei Zhang,et al. Evidence Aggregation for Answer Re-Ranking in Open-Domain Question Answering , 2017, ICLR.

[34] Wei Zhang,et al. R3: Reinforced Ranker-Reader for Open-Domain Question Answering , 2018, AAAI.

[35] Bowen Zhou,et al. End-to-End Answer Chunk Extraction and Ranking for Reading Comprehension , 2016, 1610.09996.

[36] Benjamin Van Durme,et al. Discriminative Information Retrieval for Question Answering Sentence Selection , 2017, EACL.

[37] Quoc V. Le,et al. QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension , 2018, ICLR.

[38] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[39] Samuel R. Bowman,et al. Training a Ranking Function for Open-Domain Question Answering , 2018, NAACL.

[40] Lukasz Kaiser,et al. Depthwise Separable Convolutions for Neural Machine Translation , 2017, ICLR.

[41] Wei Zhang,et al. R3: Reinforced Reader-Ranker for Open-Domain Question Answering , 2017, ArXiv.

[42] Jannis Bulian,et al. Ask the Right Questions: Active Question Reformulation with Reinforcement Learning , 2017, ICLR.