论文信息 - Neural Approaches to Conversational AI

Neural Approaches to Conversational AI

The present paper surveys neural approaches to conversational AI that have been developed in the last few years. We group conversational systems into three categories: (1) question answering agents, (2) task-oriented dialogue agents, and (3) chatbots. For each category, we present a review of state-of-the-art neural approaches, draw the connection between them and traditional approaches, and discuss the progress that has been made and challenges still being faced, using specific systems and models as case studies.

Lihong Li | Michel Galley | Jianfeng Gao

[1] Hermann Ney,et al. The Alignment Template Approach to Statistical Machine Translation , 2004, CL.

[2] David Vandyke,et al. Learning from real users: rating dialogue success with neural networks for reinforcement learning in spoken dialogue systems , 2015, INTERSPEECH.

[3] Dirk Weissenborn,et al. FastQA: A Simple and Efficient Neural Architecture for Question Answering , 2017, ArXiv.

[4] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.

[5] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[6] Yoram Singer,et al. BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[7] Peter L. Bartlett,et al. Infinite-Horizon Policy-Gradient Estimation , 2001, J. Artif. Intell. Res..

[8] Ming-Wei Chang,et al. Traversing Knowledge Graph in Vector Space without Symbolic Space Guidance , 2016 .

[9] Tom M. Mitchell,et al. Incorporating Vector Space Similarity in Random Walk Inference over Knowledge Bases , 2014, EMNLP.

[10] Jianfeng Gao,et al. End-to-End Task-Completion Neural Dialogue Systems , 2017, IJCNLP.

[11] Hervé Frezza-Buet,et al. Sample-efficient batch reinforcement learning for dialogue management optimization , 2011, TSLP.

[12] Jason Weston,et al. Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[13] Ronald A. Cole,et al. TOOLS FOR RESEARCH AND EDUCATION IN SPEECH SCIENCE , 1999 .

[14] Bing Liu,et al. Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling , 2016, INTERSPEECH.

[15] Matthew Richardson,et al. MCTest: A Challenge Dataset for the Open-Domain Machine Comprehension of Text , 2013, EMNLP.

[16] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[17] Lucy Vanderwende,et al. MindNet: Acquiring and Structuring Semantic Information from Text , 1998, COLING-ACL.

[18] Kallirroi Georgila,et al. Hybrid Reinforcement/Supervised Learning of Dialogue Policies from Fixed Data Sets , 2008, CL.

[19] Young-Bum Kim,et al. Task Completion Platform: A self-serve multi-domain goal oriented dialogue platform , 2016, NAACL.

[20] Alexandros Papangelis,et al. Comparison of an End-to-end Trainable Dialogue System with a Modular Statistical Dialogue System , 2018, INTERSPEECH.

[21] Stefan Ultes,et al. Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning , 2017, SIGDIAL Conference.

[22] Dat Quoc Nguyen. An overview of embedding models of entities and relationships for knowledge base completion , 2017, ArXiv.

[23] Shie Mannor,et al. Reinforcement learning with Gaussian processes , 2005, ICML.

[24] Jianfeng Gao,et al. Investigation of Language Understanding Impact for Reinforcement Learning Based Dialogue Systems , 2017, ArXiv.

[25] Roberto Pieraccini,et al. User modeling for spoken dialogue system evaluation , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[26] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.

[27] K. Colby. Artificial paranoia; a computer simulation of paranoid processes , 1975 .

[28] Kallirroi Georgila,et al. User simulation for spoken dialogue systems: learning and evaluation , 2006, INTERSPEECH.

[29] Percy Liang,et al. Know What You Don’t Know: Unanswerable Questions for SQuAD , 2018, ACL.

[30] Jiliang Tang,et al. A Survey on Dialogue Systems: Recent Advances and New Frontiers , 2017, SKDD.

[31] Peng Xu,et al. Emo2Vec: Learning Generalized Emotion Representation by Multi-task Training , 2018, WASSA@EMNLP.

[32] Xiang Zhou,et al. Agent-Aware Dropout DQN for Safe and Efficient On-line Dialogue Policy Learning , 2017, EMNLP.

[33] Kam-Fai Wong,et al. Integrating planning for task-completion dialogue policy learning , 2018, ACL.

[34] Geoffrey E. Hinton,et al. Keeping the neural networks simple by minimizing the description length of the weights , 1993, COLT '93.

[35] Jianfeng Gao,et al. Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation , 2017, IJCNLP.

[36] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[37] Eric Horvitz,et al. Multiparty Turn Taking in Situated Dialog: Study, Lessons, and Directions , 2011, SIGDIAL Conference.

[38] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[39] Oliver Lemon,et al. Natural Language Generation as Planning Under Uncertainty for Spoken Dialogue Systems , 2009, EACL.

[40] Benjamin Van Roy,et al. A Tutorial on Thompson Sampling , 2017, Found. Trends Mach. Learn..

[41] Harry Shum,et al. From Eliza to XiaoIce: challenges and opportunities with social chatbots , 2018, Frontiers of Information Technology & Electronic Engineering.

[42] Hua Ai,et al. Comparing Spoken Dialog Corpora Collected with Recruited Subjects versus Real Users , 2007, SIGDIAL.

[43] Kevin Knight,et al. Generation that Exploits Corpus-Based Statistical Knowledge , 1998, ACL.

[44] Le Song,et al. Boosting the Actor with Dual Critic , 2017, ICLR.

[45] Benjamin Van Roy,et al. Why is Posterior Sampling Better than Optimism for Reinforcement Learning? , 2016, ICML.

[46] Maxine Eskénazi,et al. Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning , 2016, SIGDIAL Conference.

[47] Matthew R. Walter,et al. Coherent Dialogue with Attention-Based Language Models , 2016, AAAI.

[48] Jens Lehmann,et al. DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[49] Denny Britz,et al. Generating High-Quality and Informative Conversation Responses with Sequence-to-Sequence Models , 2017, EMNLP.

[50] Filip De Turck,et al. VIME: Variational Information Maximizing Exploration , 2016, NIPS.

[51] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[52] Ming-Wei Chang,et al. Search-based Neural Structured Learning for Sequential Question Answering , 2017, ACL.

[53] W. R. Thompson. ON THE LIKELIHOOD THAT ONE UNKNOWN PROBABILITY EXCEEDS ANOTHER IN VIEW OF THE EVIDENCE OF TWO SAMPLES , 1933 .

[54] Chong Wang,et al. Subgoal Discovery for Hierarchical Dialogue Policy Learning , 2018, EMNLP.

[55] Stefan Ultes,et al. Domain-Independent User Satisfaction Reward Estimation for Dialogue Policy Learning , 2017, INTERSPEECH.

[56] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[57] Jing He,et al. Policy Networks with Two-Stage Training for Dialogue Systems , 2016, SIGDIAL Conference.

[58] Yuxing Peng,et al. Mnemonic Reader for Machine Comprehension , 2017, ArXiv.

[59] Ewan Klein,et al. Natural Language Processing with Python , 2009 .

[60] Jianfeng Gao,et al. Microsoft Dialogue Challenge: Building End-to-End Task-Completion Dialogue Systems , 2018, ArXiv.

[61] Jianfeng Gao,et al. A Neural Network Approach to Context-Sensitive Generation of Conversational Responses , 2015, NAACL.

[62] Jianfeng Gao,et al. BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems , 2016, AAAI.

[63] Jianfeng Gao,et al. deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets , 2015, ACL.

[64] J. Schatztnann,et al. Effects of the user model on simulation-based learning of dialogue strategies , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..

[65] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[66] Geoffrey E. Hinton,et al. Application of Deep Belief Networks for Natural Language Understanding , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[67] Oliver Lemon,et al. DIPPER: Description and Formalisation of an Information-State Update Dialogue System Architecture , 2003, SIGDIAL Workshop.

[68] Blake Howald,et al. A Statistical NLG Framework for Aggregated Planning and Realization , 2013, ACL.

[69] Philip Bachman,et al. NewsQA: A Machine Comprehension Dataset , 2016, Rep4NLP@ACL.

[70] Jianfeng Gao,et al. Multi-Task Learning for Speaker-Role Adaptation in Neural Conversation Models , 2017, IJCNLP.

[71] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[72] Hugo Larochelle,et al. GuessWhat?! Visual Object Discovery through Multi-modal Dialogue , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[73] Marvin Minsky,et al. Perceptrons: An Introduction to Computational Geometry , 1969 .

[74] Tim Paek. Empirical Methods for Evaluating Dialog Systems , 2001, SIGDIAL Workshop.

[75] David Berthelot,et al. WikiReading: A Novel Large-scale Language Understanding Task over Wikipedia , 2016, ACL.

[76] Hao Tian,et al. Policy Learning for Domain Selection in an Extensible Multi-domain Spoken Dialogue System , 2014, EMNLP.

[77] Yelong Shen,et al. FusionNet: Fusing via Fully-Aware Attention with Application to Machine Comprehension , 2017, ICLR.

[78] Xinyan Xiao,et al. DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications , 2017, QA@ACL.

[79] Christopher D. Manning,et al. Key-Value Retrieval Networks for Task-Oriented Dialogue , 2017, SIGDIAL Conference.

[80] Pascale Fung,et al. Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems , 2018, ACL.

[81] Ali Farhadi,et al. Bidirectional Attention Flow for Machine Comprehension , 2016, ICLR.

[82] Jianfeng Gao,et al. Learning Continuous Phrase Representations for Translation Modeling , 2014, ACL.

[83] Dengyong Zhou,et al. Action-depedent Control Variates for Policy Optimization via Stein's Identity , 2017 .

[84] Yang Liu,et al. Visualizing and Understanding Neural Machine Translation , 2017, ACL.

[85] Tiejun Zhao,et al. Knowledge-Based Question Answering as Machine Translation , 2014, ACL.

[86] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[87] Jason Weston,et al. The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations , 2015, ICLR.

[88] Jason D. Williams,et al. Partially Observable Markov Decision Processes for Spoken Dialogue Management , 2006 .

[89] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[90] Matthew Henderson,et al. Deep Neural Network Approach for the Dialog State Tracking Challenge , 2013, SIGDIAL Conference.

[91] Alex Acero,et al. Spoken Language Understanding "” An Introduction to the Statistical Framework , 2005 .

[92] Benjamin Van Roy,et al. Deep Exploration via Bootstrapped DQN , 2016, NIPS.

[93] Dilek Z. Hakkani-Tür,et al. End-to-end joint learning of natural language understanding and dialogue manager , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[94] Leora Morgenstern,et al. The Winograd Schema Challenge: Evaluating Progress in Commonsense Reasoning , 2015, AAAI.

[95] Xiang Zhang,et al. Evaluating Prerequisite Qualities for Learning End-to-End Dialog Systems , 2015, ICLR.

[96] Hua Ai,et al. Assessing Dialog System User Simulation Evaluation Measures Using Human Judges , 2008, ACL.

[97] Geoffrey Zweig,et al. Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning , 2017, ACL.

[98] Jianfeng Gao,et al. Deep Reinforcement Learning with a Natural Language Action Space , 2015, ACL.

[99] Pierre Geurts,et al. Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..

[100] Joelle Pineau,et al. The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems , 2015, SIGDIAL Conference.

[101] Alexander I. Rudnicky,et al. Stochastic natural language generation for spoken dialog systems , 2002, Comput. Speech Lang..

[102] Xiaodong Liu,et al. Stochastic Answer Networks for Machine Reading Comprehension , 2017, ACL.

[103] Andrew McCallum,et al. Compositional Vector Space Models for Knowledge Base Completion , 2015, ACL.

[104] Thierry Dutoit,et al. A probabilistic framework for dialog simulation and optimal strategy learning , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[105] Jianfeng Gao,et al. A Human Generated MAchine Reading COmprehension Dataset , 2018 .

[106] Pascale Fung,et al. Towards Empathetic Human-Robot Interactions , 2016, CICLing.

[107] Jianfeng Gao,et al. Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access , 2016, ACL.

[108] Danqi Chen,et al. A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task , 2016, ACL.

[109] Yelong Shen,et al. ReasoNet: Learning to Stop Reading in Machine Comprehension , 2016, CoCo@NIPS.

[110] Jason D. Williams,et al. Evaluating user simulations with the Cramér-von Mises divergence , 2008, Speech Commun..

[111] Oliver Lemon,et al. Learning what to say and how to say it: Joint optimisation of spoken dialogue management and natural language generation , 2011, Comput. Speech Lang..

[112] Dilek Z. Hakkani-Tür,et al. Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems , 2018, NAACL.

[113] Pei-hao Su,et al. Reward estimation for dialogue policy optimisation , 2018, Comput. Speech Lang..

[114] Yuting Lai,et al. DRCD: a Chinese Machine Reading Comprehension Dataset , 2018, ArXiv.

[115] Antoine Raux,et al. The Dialog State Tracking Challenge Series , 2014, AI Mag..

[116] Gary Geunbae Lee,et al. Data-driven user simulation for automated evaluation of spoken dialog systems , 2009, Comput. Speech Lang..

[117] Sungjin Lee,et al. Zero-Shot Adaptive Transfer for Conversational Language Understanding , 2018, AAAI.

[118] Joelle Pineau,et al. Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses , 2017, ACL.

[119] Qiang Wu,et al. Adapting boosting for information retrieval measures , 2010, Information Retrieval.

[120] Alon Lavie,et al. BLANC: Learning Evaluation Metrics for MT , 2005, HLT.

[121] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[122] Jianfeng Gao,et al. A User Simulator for Task-Completion Dialogues , 2016, ArXiv.

[123] Geoffrey Zweig,et al. Recurrent neural networks for language understanding , 2013, INTERSPEECH.

[124] Wei-Ying Ma,et al. Hierarchical Recurrent Attention Network for Response Generation , 2017, AAAI.

[125] Javier Snaider,et al. Conversational Contextual Cues: The Case of Personalization and History for Response Ranking , 2016, ArXiv.

[126] Marilyn A. Walker,et al. An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email , 2000, J. Artif. Intell. Res..

[127] Gerhard Weikum,et al. WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[128] Franck Dernoncourt,et al. Sequential Short-Text Classification with Recurrent and Convolutional Neural Networks , 2016, NAACL.

[129] Praveen Paritosh,et al. Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[130] Michael Gamon,et al. A Machine Learning Approach to the Automatic Evaluation of Machine Translation , 2001, ACL.

[131] Long Ji Lin,et al. Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.

[132] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[133] David Vandyke,et al. Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems , 2015, EMNLP.

[134] John Bell,et al. Pragmatic Reasoning: Inferring Contexts , 1999, CONTEXT.

[135] Matthew Henderson,et al. Machine Learning for Dialog State Tracking: A Review , 2015 .

[136] Yelong Shen,et al. M-Walk: Learning to Walk in Graph with Monte Carlo Tree Search , 2018, NIPS 2018.

[137] Lihong Li,et al. Reinforcement learning for dialog management using least-squares Policy iteration and fast feature selection , 2009, INTERSPEECH.

[138] Geoffrey Zweig,et al. End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning , 2016, ArXiv.

[139] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[140] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..

[141] Jianfeng Gao,et al. A Diversity-Promoting Objective Function for Neural Conversation Models , 2015, NAACL.

[142] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[143] Peter L. Bartlett,et al. Experiments with Infinite-Horizon, Policy-Gradient Estimation , 2001, J. Artif. Intell. Res..

[144] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[145] Sebastian Riedel,et al. Constructing Datasets for Multi-hop Reading Comprehension Across Documents , 2017, TACL.

[146] Oliver Lemon,et al. Evaluation of a hierarchical reinforcement learning spoken dialogue system , 2010, Comput. Speech Lang..

[147] Philipp Koehn,et al. Findings of the 2009 Workshop on Statistical Machine Translation , 2009, WMT@EACL.

[148] Roberto Pieraccini,et al. A stochastic model of human-machine interaction for learning dialog strategies , 2000, IEEE Trans. Speech Audio Process..

[149] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[150] Pascale Fung,et al. End-to-End Dynamic Query Memory Network for Entity-Value Independent Task-Oriented Dialog , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[151] Geoffrey Zweig,et al. Attention with Intention for a Neural Network Conversation Model , 2015, ArXiv.

[152] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[153] Danqi Chen,et al. CoQA: A Conversational Question Answering Challenge , 2018, TACL.

[154] Yi Pan,et al. Conversational AI: The Science Behind the Alexa Prize , 2018, ArXiv.

[155] Eric Horvitz,et al. Models for Multiparty Engagement in Open-World Dialog , 2009, SIGDIAL Conference.

[156] Joelle Pineau,et al. A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues , 2016, AAAI.

[157] Oliver Lemon,et al. Learning and Evaluation of Dialogue Strategies for New Applications: Empirical Methods for Optimization from Small Data Sets , 2011, CL.

[158] Andreas Stolcke,et al. A comparative study of recurrent neural network models for lexical domain classification , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[159] Csaba Szepesvári,et al. Finite time bounds for sampling based fitted value iteration , 2005, ICML.

[160] Jason Weston,et al. Dialogue Learning With Human-In-The-Loop , 2016, ICLR.

[161] Seunghak Yu,et al. Scaling up deep reinforcement learning for multi-domain dialogue systems , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).