Learning from Exploration: Towards an Explainable Goal Reasoning Agent

Complex, real-world environments may not be fully modeled for an agent, especially if the agent has never operated in the environment before. The agent’s ability to effectively plan and act in the environment is influenced by its knowledge of when it can perform specific actions and the effects of those actions. We describe progress on an explainable exploratory planning agent that is capable of learning action preconditions using an exploratory planning process. The agent’s architecture allows it to perform both exploratory actions as well as goal-directed actions, which opens up important considerations for how exploratory planning and goal planning should be controlled, as well as how the agent’s behavior should be explained to any teammates it may have. We describe initial approaches for both exploratory planning and action model learning, and evaluate them in the video game Dungeon Crawl Stone Soup.

[1]  David W. Aha,et al.  Human-Agent Teaming as a Common Problem for Goal Reasoning , 2018, AAAI Spring Symposia.

[2]  Larry A. Rendell,et al.  Effective Generalization of Relational Descriptions , 1990, AAAI.

[3]  David W. Aha,et al.  Goal Reasoning for an Autonomous Squad Member , 2015 .

[4]  Xuemei Wang,et al.  Learning by Observation and Practice: An Incremental Approach for Planning Operator Acquisition , 1995, ICML.

[5]  J. Ross Quinlan,et al.  Learning logical definitions from relations , 1990, Machine Learning.

[6]  Matthew Klenk,et al.  DiscoverHistory: understanding the past in planning and execution , 2012, AAMAS.

[7]  Daniel Bryce,et al.  Planning and Acting in Incomplete Domains , 2011, ICAPS.

[8]  Stefan Edelkamp,et al.  Automated Planning: Theory and Practice , 2007, Künstliche Intell..

[9]  Peter Stone,et al.  Intrinsically motivated model learning for developing curious robots , 2017, Artif. Intell..

[10]  Jörg Hoffmann,et al.  The Metric-FF Planning System: Translating ''Ignoring Delete Lists'' to Numeric State Variables , 2003, J. Artif. Intell. Res..

[11]  Agnar Aamodt,et al.  Explanation-Driven Case-Based Reasoning , 1993, EWCBR.

[12]  Steven A. Vere,et al.  Multilevel Counterfactuals for Generalizations of Relational Concepts and Productions , 1980, Artif. Intell..

[13]  Frederick Hayes-Roth,et al.  An interference matching technique for inducing abstractions , 1978, CACM.

[14]  David W. Aha,et al.  Towards Explainable NPCs: A Relational Exploration Learning Agent , 2018, AAAI Workshops.