论文信息 - Crossing Variational Autoencoders for Answer Retrieval - 字舞流文

Crossing Variational Autoencoders for Answer Retrieval

Answer retrieval is to find the most aligned answer from a large set of candidates given a question. Learning vector representations of questions/answers is the key factor. Question-answer alignment and question/answer semantics are two important signals for learning the representations. Existing methods learned semantic representations with dual encoders or dual variational auto-encoders. The semantic information was learned from language models or question-to-question (answer-to-answer) generative processes. However, the alignment and semantics were too separate to capture the aligned semantics between question and answer. In this work, we propose to cross variational auto-encoders by generating questions with aligned answers and generating answers with aligned questions. Experiments show that our method outperforms the state-of-the-art answer retrieval method on SQuAD.

Qingkai Zeng | Wenhao Yu | Yu Deng | Meng Jiang | Lingfei Wu | Shu Tao | Lingfei Wu | Yu Deng | W. Yu | Meng Jiang | S. Tao | Qingkai Zeng

[1] Christopher Potts,et al. A large annotated corpus for learning natural language inference , 2015, EMNLP.

[2] Holger Schwenk,et al. Supervised Learning of Universal Sentence Representations from Natural Language Inference Data , 2017, EMNLP.

[3] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[4] Pradeep Ravikumar,et al. Word Mover’s Embedding: From Word2Vec to Document Embedding , 2018, EMNLP.

[5] Jianfeng Gao,et al. A Human Generated MAchine Reading COmprehension Dataset , 2018 .

[6] Raquel Urtasun,et al. Few-Shot Learning Through an Information Retrieval Lens , 2017, NIPS.

[7] Nan Hua,et al. Universal Sentence Encoder , 2018, ArXiv.

[8] Regina Barzilay,et al. Style Transfer from Non-Parallel Text by Cross-Alignment , 2017, NIPS.

[9] Jian Zhang,et al. SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[10] Michel Deudon,et al. Learning semantic similarity in a continuous space , 2018, NeurIPS.

[11] Ming-Wei Chang,et al. Natural Questions: A Benchmark for Question Answering Research , 2019, TACL.

[12] Zhongbin Xie,et al. Dual-View Variational Autoencoders for Semi-Supervised Text Matching , 2019, IJCAI.

[13] Noah Constant,et al. ReQA: An Evaluation for End-to-End Answer Retrieval Models , 2019, EMNLP.

[14] Weiming Zhang,et al. Neural Machine Reading Comprehension: Methods and Trends , 2019, Applied Sciences.

[15] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[16] Samy Bengio,et al. Generating Sentences from a Continuous Space , 2015, CoNLL.

[17] Jason Weston,et al. Reading Wikipedia to Answer Open-Domain Questions , 2017, ACL.

[18] Arpita Das,et al. Together we stand: Siamese Networks for Similar Question Retrieval , 2016, ACL.

[19] Eric P. Xing,et al. On Unifying Deep Generative Models , 2017, ICLR.

[20] Saeedeh Momtazi,et al. Text‐based question answering from information retrieval and deep neural network perspectives: A survey , 2020, WIREs Data Mining Knowl. Discov..

[21] Ali Farhadi,et al. Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index , 2019, ACL.

[22] Wei-Cheng Chang,et al. Pre-training Tasks for Embedding-based Large-scale Retrieval , 2020, ICLR.

[23] Bowen Zhou,et al. A Structured Self-attentive Sentence Embedding , 2017, ICLR.

[24] Trevor Darrell,et al. Generalized Zero- and Few-Shot Learning via Aligned Variational Autoencoders , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Ray Kurzweil,et al. Multilingual Universal Sentence Encoder for Semantic Retrieval , 2019, ACL.

[26] Phil Blunsom,et al. Neural Variational Inference for Text Processing , 2015, ICML.

[27] Eric P. Xing,et al. Toward Controlled Generation of Text , 2017, ICML.

[28] Kyomin Jung,et al. A Compare-Aggregate Model with Latent Clustering for Answer Selection , 2019, CIKM.

[29] Lawrence Carin,et al. Deconvolutional Latent-Variable Model for Text Sequence Matching , 2017, AAAI.

[30] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[31] Jan Kautz,et al. Unsupervised Image-to-Image Translation Networks , 2017, NIPS.

[32] Ying Tan,et al. Variational Autoencoder for Semi-Supervised Text Classification , 2017, AAAI.

[33] Max Welling,et al. Improved Variational Inference with Inverse Autoregressive Flow , 2016, NIPS 2016.

[34] Ali Farhadi,et al. Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension , 2018, EMNLP.