Efficient Reinforcement Learning in Factored MDPs
暂无分享,去创建一个
[1] R. Dobrushin. Central Limit Theorem for Nonstationary Markov Chains. II , 1956 .
[2] T. Lindvall. Lectures on the Coupling Method , 1992 .
[3] Mark Jerrum,et al. Polynomial-Time Approximation Algorithms for the Ising Model , 1990, SIAM J. Comput..
[4] Stuart J. Russell,et al. The BATmobile: Towards a Bayesian Automated Taxi , 1995, IJCAI.
[5] Kee-Eung Kim,et al. Solving Very Large Weakly Coupled Markov Decision Processes , 1998, AAAI/IAAI.
[6] Xavier Boyen,et al. Tractable Inference for Complex Stochastic Processes , 1998, UAI.
[7] Daphne Koller,et al. Computing Factored Value Functions for Policies in Structured MDPs , 1999, IJCAI.
[8] Craig Boutilier,et al. Decision-Theoretic Planning: Structural Assumptions and Computational Leverage , 1999, J. Artif. Intell. Res..