Bayesian Policy Search with Policy Priors
暂无分享,去创建一个
Leslie Pack Kaelbling | Joshua B. Tenenbaum | Noah D. Goodman | Daniel M. Roy | David Wingate | J. Tenenbaum | L. Kaelbling | D. Wingate
[1] Ronald A. Howard,et al. Dynamic Programming and Markov Processes , 1960 .
[2] Verzekeren Naar Sparen,et al. Cambridge , 1969, Humphrey Burton: In My Own Time.
[3] Nils J. Nilsson,et al. Artificial Intelligence , 1974, IFIP Congress.
[4] R. Fildes. Journal of the American Statistical Association : William S. Cleveland, Marylyn E. McGill and Robert McGill, The shape parameter for a two variable graph 83 (1988) 289-300 , 1989 .
[5] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[6] Doina Precup,et al. Theoretical Results on Reinforcement Learning with Temporally Abstract Options , 1998, ECML.
[7] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..
[8] Jim Pitman,et al. Poisson–Dirichlet and GEM Invariant Distributions for Split-and-Merge Transformations of an Interval Partition , 2002, Combinatorics, Probability and Computing.
[9] M. Mitzenmacher,et al. Probability and Computing: Events and Probability , 2005 .
[10] Michael Mitzenmacher,et al. Probability And Computing , 2005 .
[11] Michael I. Jordan,et al. Hierarchical Dirichlet Processes , 2006 .
[12] Marc Toussaint,et al. Probabilistic inference for solving (PO) MDPs , 2006 .
[13] Andrea Bonarini,et al. Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods , 2007, NIPS.
[14] Matthew Botvinick,et al. Goal-directed decision making in prefrontal cortex: a computational framework , 2008, NIPS.
[15] David Silver,et al. Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008) Achieving Master Level Play in 9 × 9 Computer Go , 2022 .
[16] Marc Toussaint,et al. Hierarchical POMDP Controller Optimization by Likelihood Maximization , 2008, UAI.
[17] Noah D. Goodman,et al. Learning a theory of causality. , 2011, Psychological review.
[18] N. Jimpitma. Poisson – Dirichlet and GEM Invariant Distributions for Split-and-Merge Transformations of an Interval Partition , 2022 .