Feature-based methods for large scale dynamic programming
暂无分享,去创建一个
[1] R. Bellman,et al. FUNCTIONAL APPROXIMATIONS AND DYNAMIC PROGRAMMING , 1959 .
[2] Ward Whitt,et al. Approximations of Dynamic Programs, I , 1978, Math. Oper. Res..
[3] Thomas L. Morin,et al. COMPUTATIONAL ADVANCES IN DYNAMIC PROGRAMMING , 1978 .
[4] P. Schweitzer,et al. Generalized polynomial approximations in Markovian decision processes , 1985 .
[5] Richard E. Korf,et al. Planning as Search: A Quantitative Approach , 1987, Artif. Intell..
[6] John N. Tsitsiklis,et al. Parallel and distributed computation , 1989 .
[7] D. Bertsekas,et al. Adaptive aggregation methods for infinite horizon dynamic programming , 1989 .
[8] F. Girosi,et al. Networks for approximation and learning , 1990, Proc. IEEE.
[9] Gerald Tesauro,et al. Practical Issues in Temporal Difference Learning , 1992, Mach. Learn..
[10] Richard P. Lippmann,et al. LNKnet: Neural Network, Machine-Learning, and Statistical Software for Pattern Classification , 1993 .
[11] Bhavik R. Bakshi,et al. Wave‐net: a multiresolution, hierarchical neural network with localized learning , 1993 .
[12] Michael I. Jordan,et al. MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES , 1996 .
[13] Michael I. Jordan,et al. Reinforcement Learning Algorithm for Partially Observable Markov Decision Problems , 1994, NIPS.
[14] Geoffrey J. Gordon. Stable Function Approximation in Dynamic Programming , 1995, ICML.
[15] Dimitri P. Bertsekas,et al. A Counterexample to Temporal Differences Learning , 1995, Neural Computation.
[16] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[17] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..