Online Resource Allocation Using Decompositional Reinforcement Learning
暂无分享,去创建一个
[1] Satinder P. Singh,et al. How to Dynamically Merge Markov Decision Processes , 1997, NIPS.
[2] Kee-Eung Kim,et al. Solving Very Large Weakly Coupled Markov Decision Processes , 1998, AAAI/IAAI.
[3] Michael Kearns,et al. Efficient Reinforcement Learning in Factored MDPs , 1999, IJCAI.
[4] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[5] Erol Gelenbe. System Performance Evaluation: Methodologies and Applications , 2000 .
[6] Mark S. Squillante,et al. Internet traffic: periodicity, tail behavior, and performance implications , 2000 .
[7] Michail G. Lagoudakis,et al. Coordinated Reinforcement Learning , 2002, ICML.
[8] Jeffrey O. Kephart,et al. The Vision of Autonomic Computing , 2003, Computer.
[9] Stuart J. Russell,et al. Q-Decomposition for Reinforcement Learning Agents , 2003, ICML.
[10] Rajarshi Das,et al. Utility functions in autonomic systems , 2004 .
[11] Rajarshi Das,et al. Utility functions in autonomic systems , 2004, International Conference on Autonomic Computing, 2004. Proceedings..