论文信息 - COMPOSER: A Probabilistic Solution to the Utility Problem in Speed-Up Learning

COMPOSER: A Probabilistic Solution to the Utility Problem in Speed-Up Learning

In machine learning there is considerable interest in techniques which improve planning ability. Initial investigations have identified a wide variety of techniques to address this issue. Progress has been hampered by the utility problem, a basic tradeoff between the benefit of learned knowledge and the cost to locate and apply relevant knowledge. In this paper we describe the COMPOSER system which embodies a probabilistic solution to the utility problem. We outline the statistical foundations of our approach and compare it against four other approaches which appear in the literature.

Gerald DeJong | Jonathan Gratch | G. DeJong | J. Gratch

[1] A. Nádas. An Extension of a Theorem of Chow and Robbins on Sequential Confidence Intervals for the Mean , 1969 .

[2] Richard Fikes,et al. Learning and Executing Generalized Robot Plans , 1993, Artif. Intell..

[3] Tom M. Mitchell,et al. Models of Learning Systems. , 1979 .

[4] Steven Minton,et al. Selectively Generalizing Plans for Problem-Solving , 1985, IJCAI.

[5] M. Woodroofe. Nonlinear Renewal Theory in Sequential Analysis , 1987 .

[6] Mark S. Boddy,et al. An Analysis of Time-Dependent Planning , 1988, AAAI.

[7] Jaime G. Carbonell,et al. Learning effective search control knowledge: an explanation-based approach , 1988 .

[8] Mark Drummond,et al. Situated Control Rules , 1989, KR.

[9] Shaul Markovitch,et al. Utilization Filtering: A Method for Reducing the Inherent Harmfulness of Deductively Learned Knowledge , 1989, IJCAI.

[10] Oren Etzioni,et al. Why PRODIGY/EBL Works , 1990, AAAI.

[11] Devika Subramanian,et al. The Utility of EBL in Recursive Domain Theories , 1990, AAAI.

[12] Stanley Letovsky,et al. Operationality Criteria for Recursive Predicates , 1990, AAAI.

[13] Gerald DeJong,et al. A Hybrid Approach to Guaranteed Effective Control Strategies , 1991, ML.

[14] Gerald DeJong,et al. Utility Generalization and Composability Problems in Explanation-Based Learning. , 1991 .

[15] Paul E. Utgoff,et al. Two Kinds of Training Information For Evaluation Function Learning , 1991, AAAI.

[16] Gerald DeJong,et al. A Framework of Simplifications in Learning to Plan , 1992 .

[17] Tom M. Mitchell,et al. Learning by experimentation: acquiring and refining problem-solving heuristics , 1993 .

[18] Dale Schuurmans,et al. Probabilistic , 2019, 99 Variations on a Proof.