Heuristic Search in Cyclic AND/OR Graphs

Heuristic search algorithms can find solutions that take the form of a simple path (A*), a tree or an acyclic graph (AO*). We present a novel generalization of heuristic search (called LAO*) that can find solutions with loops, that is, solutions that take the form of a cyclic graph. We show that it can be used to solve Markov decision problems without evaluating the entire state space, giving it an advantage over dynamic-programming algorithms such as policy iteration and value iteration as an approach to stochastic planning.

[1]  Leslie Pack Kaelbling,et al.  Planning under Time Constraints in Stochastic Domains , 1993, Artif. Intell..

[2]  J. Mann Control strategies. , 1987, AIDS action.

[3]  Nils J. Nilsson,et al.  Principles of Artificial Intelligence , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[5]  Alberto Martelli,et al.  Additive AND/OR Graphs , 1973, IJCAI.

[6]  Richard E. Korf,et al.  Real-Time Heuristic Search , 1990, Artif. Intell..

[7]  Alberto Martelli,et al.  Optimizing decision trees through heuristically guided search , 1978, CACM.

[8]  Stuart J. Russell,et al.  Control Strategies for a Stochastic Planner , 1994, AAAI.

[9]  Jim Blythe,et al.  Decision-Theoretic Planning , 1999, AI Mag..

[10]  Andrew G. Barto,et al.  Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[11]  Krishna R. Pattipati,et al.  Application of heuristic search and information theory to sequential fault diagnosis , 1990, IEEE Trans. Syst. Man Cybern..

[12]  Craig Boutilier,et al.  Abstraction and Approximate Decision-Theoretic Planning , 1997, Artif. Intell..

[13]  P. P. Chakrabarti,et al.  Heuristic Search in Restricted Memory , 1989, Artif. Intell..

[14]  Rina Dechter,et al.  Generalized best-first search strategies and the optimality of A* , 1985, JACM.

[15]  A. Bagchi,et al.  Admissible Heuristic Search in And/Or Graphs , 1983, Theor. Comput. Sci..