论文信息 - Bounded-cost Search Using Estimates of Uncertainty - 字舞流文

Bounded-cost Search Using Estimates of Uncertainty

Many planning problems are too hard to solve optimally. In bounded-cost search, one attempts to find, as quickly as possible, a plan that costs no more than a user-provided absolute cost bound. Several algorithms have been previously proposed for this setting, including Potential Search (PTS) and Bounded-cost Explicit Estimation Search (BEES). BEES attempts to improve on PTS by predicting whether nodes will lead to plans within the cost bound or not. This paper introduces a relatively simple algorithm, Expected Effort Search (XES), which uses not just point estimates but belief distributions in order to estimate the probability that a node will lead to a plan within the bound. XES’s expansion order minimizes expected search time in a simplified formal model. Experimental results on standard planning and search benchmarks show that it consistently exhibits strong performance, outperforming both PTS and BEES. We also derive improved variants of BEES that can exploit belief distributions. These new methods advance the recent trend of taking advantage of uncertainty estimates in deterministic single-agent search.

Wheeler Ruml | Tianyi Gu | Maximilian Fickert | Maximilian Fickert | Tianyi Gu | Wheeler Ruml

[1] Nils J. Nilsson,et al. Artificial Intelligence , 1974, IFIP Congress.

[2] Wheeler Ruml,et al. Faster Bounded-Cost Search Using Inadmissible Estimates , 2012, ICAPS.

[3] Geoffrey J. Gordon,et al. Bounded real-time dynamic programming: RTDP with monotone upper bounds and performance guarantees , 2005, ICML.

[4] Ivan Hal Sudborough,et al. On the Diameter of the Pancake Network , 1997, J. Algorithms.

[5] Rami Puzis,et al. Potential Search: A Bounded-Cost Search Algorithm , 2011, ICAPS.

[6] Wheeler Ruml,et al. Metareasoning in Real-Time Heuristic Search , 2015, SOCS.

[7] Marek Petrik,et al. Beliefs We Can Believe in: Replacing Assumptions with Data in Real-Time Search , 2020, AAAI.

[8] Shlomo Zilberstein,et al. LAO*: A heuristic search algorithm that finds solutions with loops , 2001, Artif. Intell..

[9] Bernhard Nebel,et al. The FF Planning System: Fast Plan Generation Through Heuristic Search , 2011, J. Artif. Intell. Res..

[10] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[11] Christos H. Papadimitriou,et al. Bounds for sorting by prefix reversal , 1979, Discret. Math..

[12] Ira Pohl,et al. Heuristic Search Viewed as Path Finding in a Graph , 1970, Artif. Intell..

[13] Marc G. Bellemare,et al. A Distributional Perspective on Reinforcement Learning , 2017, ICML.

[14] Richard E. Korf,et al. Iterative-Deepening-A*: An Optimal Admissible Tree Search , 1985, IJCAI.

[15] Anders Hald,et al. Statistical Theory with Engineering Applications , 1952 .

[16] G. Winskel. What Is Discrete Mathematics , 2007 .

[17] Patrik Haslum. Heuristics for Bounded-Cost Search , 2013, ICAPS.

[18] Malte Helmert,et al. Landmark Heuristics for the Pancake Problem , 2010, SOCS.

[19] Wheeler Ruml,et al. Learning Inadmissible Heuristics During Search , 2011, ICAPS.

[20] P. Haslum,et al. Cost-Length Tradeoff Heuristics for Bounded-Cost Search , 2017 .

[21] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[22] Wheeler Ruml,et al. Bounded Suboptimal Search: A Direct Approach Using Inadmissible Estimates , 2011, IJCAI.