Optimization of Information-Seeking Dialogue Strategy for Argumentation-Based Dialogue System

Argumentation-based dialogue systems, which can handle and exchange arguments through dialogue, have been widely researched. It is required that these systems have sufficient supporting information to argue their claims rationally; however, the systems often do not have enough of such information in realistic situations. One way to fill in the gap is acquiring such missing information from dialogue partners (information-seeking dialogue). Existing information-seeking dialogue systems are based on handcrafted dialogue strategies that exhaustively examine missing information. However, the proposed strategies are not specialized in collecting information for constructing rational arguments. Moreover, the number of system's inquiry candidates grows in accordance with the size of the argument set that the system deal with. In this paper, we formalize the process of information-seeking dialogue as Markov decision processes (MDPs) and apply deep reinforcement learning (DRL) for automatically optimizing a dialogue strategy. By utilizing DRL, our dialogue strategy can successfully minimize objective functions, the number of turns it takes for our system to collect necessary information in a dialogue. We conducted dialogue experiments using two datasets from different domains of argumentative dialogue. Experimental results show that the proposed formalization based on MDP works well, and the policy optimized by DRL outperformed existing heuristic dialogue strategies.

[1]  Solomon Eyal Shimony,et al.  Probabilistic Semantics for Cost Based Abduction , 1990, AAAI.

[2]  Kallirroi Georgila,et al.  Reinforcement Learning of Question-Answering Dialogue Policies for Virtual Museum Guides , 2012, SIGDIAL Conference.

[3]  Sultan Alahmari Reinforcement Learning for Abstract Argumentation : A Q-learning approach , 2017 .

[4]  Kallirroi Georgila,et al.  Reinforcement Learning of Argumentation Dialogue Policies in Negotiation , 2011, INTERSPEECH.

[5]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[6]  Simon Parsons,et al.  Modelling dialogues using argumentation , 2000, Proceedings Fourth International Conference on MultiAgent Systems.

[7]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[8]  Tatsuya Kawahara,et al.  Conversational system for information navigation based on POMDP with user focus tracking , 2015, Comput. Speech Lang..

[9]  Sarit Kraus,et al.  Strategical Argumentative Agent for Human Persuasion , 2016, ECAI.

[10]  D. Walton,et al.  Commitment in Dialogue: Basic Concepts of Interpersonal Reasoning , 1995 .

[11]  Kentaro Inui,et al.  ILP-Based Reasoning for Weighted Abduction , 2011, Plan, Activity, and Intent Recognition.

[12]  Phan Minh Dung,et al.  Assumption-Based Argumentation , 2009, Argumentation in Artificial Intelligence.

[13]  Michael Wooldridge,et al.  An analysis of formal inter-agent dialogues , 2002, AAMAS '02.

[14]  Peter McBurney,et al.  Representing Epistemic Uncertainty by Means of Dialectical Argumentation , 2001, Annals of Mathematics and Artificial Intelligence.

[15]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[16]  Jerry R. Hobbs,et al.  Abductive Reasoning with a Large Knowledge Base for Discourse Processing , 2011, IWCS.

[17]  Henry Prakken,et al.  Introduction to structured argumentation , 2014, Argument Comput..

[18]  Kallirroi Georgila,et al.  Reinforcement Learning of Multi-Issue Negotiation Dialogue Policies , 2015, SIGDIAL Conference.

[19]  Nicolas Maudet,et al.  Optimization of Probabilistic Argumentation with Markov Decision Models , 2015, IJCAI.

[20]  Hado van Hasselt,et al.  Double Q-learning , 2010, NIPS.

[21]  Serena Villata,et al.  NoDE: A Benchmark of Natural Language Arguments , 2014, COMMA.

[22]  Roberto Pieraccini,et al.  A stochastic model of human-machine interaction for learning dialog strategies , 2000, IEEE Trans. Speech Audio Process..

[23]  Michael Wooldridge,et al.  On the outcomes of formal inter-agent dialogues , 2003, AAMAS '03.

[24]  Steve J. Young,et al.  Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..

[25]  Francesca Toni,et al.  Mechanism Design for Argumentation-Based Information-Seeking and Inquiry , 2015, PRIMA.

[26]  Anthony Hunter,et al.  Constructing argument graphs with deductive arguments: a tutorial , 2014, Argument Comput..

[27]  Francesca Toni,et al.  Agent Strategies for ABA-based Information-seeking and Inquiry Dialogues , 2012, ECAI.