论文信息 - Action-Model Acquisition from Noisy Plan Traces

Action-Model Acquisition from Noisy Plan Traces

There is increasing awareness in the planning community that the burden of specifying complete domain models is too high, which impedes the applicability of planning technology in many real-world domains. Although there have been many learning approaches that help automatically creating domain models, they all assume plan traces (training data) are correct. In this paper, we aim to remove this assumption, allowing plan traces to be with noise. Compared to collecting large amount of correct plan traces, it is much easier to collect noisy plan traces, e.g., we can directly exploit sensors to help collect noisy plan traces. We consider a novel solution for this challenge that can learn action models from noisy plan traces. We create a set of random variables to capture the possible correct plan traces behind the observed noisy ones, and build a graphical model to describe the physics of the domain. We then learn the parameters of the graphical model and acquire the domain model based on the learnt parameters. In the experiment, we empirically show that our approach is effective in several planning domains.

Subbarao Kambhampati | Hankui Zhuo | S. Kambhampati | Hankui Zhuo

[1] Qiang Yang,et al. Cross-Domain Action-Model Acquisition for Planning via Web Search , 2011, ICAPS.

[2] Nils J. Nilsson,et al. Artificial Intelligence , 1974, IFIP Congress.

[3] Hector Geffner,et al. Probabilistic Plan Recognition Using Off-the-Shelf Classical Planners , 2010, AAAI.

[4] Richard E. Korf,et al. Artificial intelligence journal special issue on heuristic search , 1999 .

[5] Richard Fikes,et al. STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[6] John D. Lafferty,et al. Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[7] Leslie Pack Kaelbling,et al. Learning Planning Rules in Noisy Stochastic Worlds , 2005, AAAI.

[8] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[9] Piergiorgio Bertoli,et al. Automated composition of Web services via planning in asynchronous domains , 2005, Artif. Intell..

[10] Regina Barzilay,et al. Learning High-Level Planning from Text , 2012, ACL.

[11] Qiang Yang,et al. Learning complex action models with quantifiers and logical implications , 2010, Artif. Intell..

[12] Ieee Xplore,et al. IEEE Transactions on Pattern Analysis and Machine Intelligence Information for Authors , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] S. Shankar Sastry,et al. Autonomous Helicopter Flight via Reinforcement Learning , 2003, NIPS.

[14] Subbarao Kambhampati,et al. Action-Model Based Multi-agent Plan Recognition , 2012, NIPS.

[15] Yolanda Gil,et al. Learning by Experimentation: Incremental Refinement of Incomplete Planning Domains , 1994, International Conference on Machine Learning.

[16] Qiang Yang,et al. Learning action models from plan examples using weighted MAX-SAT , 2007, Artif. Intell..

[17] Eyal Amir,et al. Learning Partially Observable Deterministic Action Models , 2005, IJCAI.

[18] Felip Manyà,et al. New Inference Rules for Max-SAT , 2007, J. Artif. Intell. Res..

[19] M. Pollack. Journal of Artificial Intelligence Research: Preface , 2001 .

[20] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[21] L. P. Kaelbling,et al. Learning Symbolic Models of Stochastic Domains , 2007, J. Artif. Intell. Res..

[22] Henry A. Kautz,et al. Generalized Plan Recognition , 1986, AAAI.

[23] Mark Steedman,et al. Learning STRIPS Operators from Noisy and Incomplete Observations , 2012, UAI.

[24] Piergiorgio Bertoli,et al. Web Service Composition as Planning, Revisited: In Between Background Theories and Initial State Uncertainty , 2007, AAAI.

[25] Yolanda Gil,et al. Automatically composed workflows for grid environments , 2004, IEEE Intelligent Systems.

[26] Hung Hai Bui,et al. A General Model for Online Probabilistic Plan Recognition , 2003, IJCAI.