Lexicon-Enhanced Transformer with Pointing for Domains Specific Generative Question Answering

Aiming at the problem of inaccurate generation caused by the lack of external knowledge in the generative automatic question answering system, we propose a new answer generation model (LEP-Transformer) that integrates domain lexicon and copy mechanism, which can enable the Transformer to effectively deal with the long-distance dependence of different text granularity and have the ability to reproduce the details of the facts when generating answers. And the experimental results on two different datasets show that the model can alleviate this problem and has ability to model short text and long text sequences simultaneously.

[1]  Xiaoyu Shen,et al.  Improving Variational Encoder-Decoders in Dialogue Generation , 2018, AAAI.

[2]  Christopher D. Manning,et al.  Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[3]  Alan Ritter,et al.  Adversarial Learning for Neural Dialogue Generation , 2017, EMNLP.

[4]  Liang He,et al.  Knowledge Memory Based LSTM Model for Answer Selection , 2017, ICONIP.

[5]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[6]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[7]  S. T. Dumais,et al.  Human factors and behavioral science: Statistical semantics: Analysis of the potential performance of key-word information systems , 1983, The Bell System Technical Journal.

[8]  Stephen Clark,et al.  Latent Variable Dialogue Models and their Diversity , 2017, EACL.

[9]  Ashish Vaswani,et al.  Self-Attention with Relative Position Representations , 2018, NAACL.

[10]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[11]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[12]  Jianqiang Li,et al.  Mobile Medical Question and Answer System with Improved Char-level based Convolution Neural Network and Sparse Auto Encoder , 2019, APIT 2019.

[13]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[14]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[15]  Joelle Pineau,et al.  How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation , 2016, EMNLP.

[16]  Jason Weston,et al.  End-To-End Memory Networks , 2015, NIPS.

[17]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[18]  Ali Farhadi,et al.  Bidirectional Attention Flow for Machine Comprehension , 2016, ICLR.