Advice Generation from Observed Execution: Abstract Markov Decision Process Learning
暂无分享,去创建一个
[1] Manuela Veloso,et al. An Empirical Study of Coaching , 2002, DARS.
[2] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..
[3] Daniel D. Suthers,et al. Automated Advice-Giving Strategies for Scientific Inquiry , 1996, Intelligent Tutoring Systems.
[4] Gregory Kuhlmann and Peter Stone and Justin Lallinger. The Champion UT Austin Villa 2003 Simulator Online Coach Team , 2004 .
[5] Manuela M. Veloso,et al. Fault Tolerant Planning: Toward Probabilistic Uncertainty Models in Symbolic Non-Deterministic Planning , 2004, ICAPS.
[6] Ubbo Visser,et al. Using Online Learning to Analyze the Opponent's Behavior , 2002, RoboCup.
[7] BoutilierCraig,et al. Abstraction and approximate decision-theoretic planning , 1997 .
[8] Paul J. Schweitzer,et al. Iterative Aggregation-Disaggregation Procedures for Discounted Semi-Markov Reward Processes , 1985, Oper. Res..
[9] Hiroaki Kitano,et al. The RoboCup Synthetic Agent Challenge 97 , 1997, IJCAI.
[10] Tamio Arai,et al. Distributed Autonomous Robotic Systems 3 , 1998 .
[11] Jude W. Shavlik,et al. Creating Advice-Taking Reinforcement Learners , 1998, Machine Learning.
[12] Manuela M. Veloso,et al. TTree: Tree-Based State Generalization with Temporally Abstract Actions , 2002, SARA.
[13] Craig Boutilier,et al. Abstraction and Approximate Decision-Theoretic Planning , 1997, Artif. Intell..
[14] Guy Shani,et al. An MDP-Based Recommender System , 2002, J. Mach. Learn. Res..
[15] Henry A. Kautz. A formal theory of plan recognition , 1987 .
[16] Henry A. Kautz,et al. Reasoning about plans , 1991, Morgan Kaufmann series in representation and reasoning.
[17] Ian Frank,et al. Soccer Server: A Tool for Research on Multiagent Systems , 1998, Appl. Artif. Intell..
[18] Jude W. Shavlik,et al. Creating advice-taking reinforcement learners , 1998 .
[19] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[20] Robert P. Goldman,et al. A Bayesian Model of Plan Recognition , 1993, Artif. Intell..
[21] Milind Tambe,et al. Automated assistants to aid humans in understanding team behaviors , 2000, AGENTS '00.