Targeting Specific Distributions of Trajectories in MDPs
暂无分享,去创建一个
David L. Roberts | Michael Mateas | Michael L. Littman | Mark J. Nelson | Charles Lee Isbell | M. Littman | C. Isbell | M. Mateas | D. Roberts | M. Nelson
[1] Joseph Bates,et al. Virtual Reality, Art, and Entertainment , 1992, Presence: Teleoperators & Virtual Environments.
[2] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[3] Joseph Bates,et al. Guiding interactive drama , 1997 .
[4] Yishay Mansour,et al. Approximate Planning in Large POMDPs via Reusable Trajectories , 1999, NIPS.
[5] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[6] Peter Stone,et al. A social reinforcement learning agent , 2001, AGENTS '01.
[7] Amos Storkey,et al. Advances in Neural Information Processing Systems 20 , 2007 .
[8] Michael Mateas,et al. Search-Based Drama Management in the Interactive Fiction Anchorhead , 2005, AIIDE.
[9] David L. Roberts,et al. Reinforcement learning for declarative optimization-based drama management , 2006, AAMAS '06.