Interleaving Execution and Planning for Nondeterministic, Partially Observable Domains

Methods that interleave planning and execution are a practical solution to deal with complex planning problems in non-deterministic domains under partial observability. However, most of the existing approaches do not tackle in a principled way the important issue of termination of the planning-execution loop, or only do so considering specific assumptions over the domains. In this paper, we tackle the problem of interleaving planning and execution relying on a general framework, which is able to deal with nondeterministic, partially observable planning domains. We propose a new, general planning algorithm that guarantees the termination of the interleaving of planning and execution: either the goal is achieved, or the system detects that there is no longer a guarantee to progress toward it. Our experimental analysis shows that our algorithm can efficiently solve planning problems that cannot be tackled with a state of the art off-line planner for nondeterministic domains under partial observability, MBP. Moreover, we show that our algorithm can efficiently detect situations where progress toward the goal can be no longer guaranteed.

[1]  Sven Koenig,et al.  Minimax real-time heuristic search , 2001, Artif. Intell..

[2]  Illah R. Nourbakhsh,et al.  Time-Saving Tips for Problem Solving with Incomplete Information , 1993, AAAI.

[3]  Piergiorgio Bertoli,et al.  Planning in Nondeterministic Domains under Partial Observability via Symbolic Model Checking , 2001, IJCAI.

[4]  Michael J. S. Pelican,et al.  Hard Real-time Mode Logic Synthesis for Hybrid Control A CIRCA-based approach , 1999 .

[5]  David E. Smith,et al.  Extending Graphplan to handle uncertainty and sensing actions , 1998, AAAI 1998.

[6]  Leslie Pack Kaelbling,et al.  Acting Optimally in Partially Observable Stochastic Domains , 1994, AAAI.

[7]  Reid G. Simmons,et al.  Real-Time Search in Non-Deterministic Domains , 1995, IJCAI.

[8]  Robert P. Goldman,et al.  Using Model Checking to Plan Hard Real-Time Controllers , 2000 .

[9]  M. Veloso,et al.  OBDD-Based Optimistic and Strong Cyclic Adversarial Planning , 2014 .

[10]  Reid G. Simmons,et al.  Solving Robot Navigation Problems with Initial Pose Uncertainty Using Real-Time Heuristic Search , 1998, AIPS.

[11]  Joachim Hertzberg,et al.  A semi-reactive planner based on a possible models action formalization , 1992 .

[12]  Leslie Pack Kaelbling,et al.  Planning under Time Constraints in Stochastic Domains , 1993, Artif. Intell..

[13]  Karen L. Myers Towards a Framework for Continuous Planning and Execution , 2000 .

[14]  Gregg Collins,et al.  Planning for Contingencies: A Decision-based Approach , 1996, J. Artif. Intell. Res..

[15]  Blai Bonet,et al.  Planning with Incomplete Information as Heuristic Search in Belief Space , 2000, AIPS.

[16]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..