DQL: A New Updating Strategy for Reinforcement Learning Based on Q-Learning
暂无分享,去创建一个
[1] D. J. Smith,et al. A Study of Permutation Crossover Operators on the Traveling Salesman Problem , 1987, ICGA.
[2] C. Watkins. Learning from delayed rewards , 1989 .
[3] Marco Dorigo,et al. Optimization, Learning and Natural Algorithms , 1992 .
[4] Michael L. Littman,et al. A Distributed Reinforcement Learning Scheme for Network Routing , 1993 .
[5] Ming Tan,et al. Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.
[6] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[7] G. Reinelt. The traveling salesman: computational solutions for TSP applications , 1994 .
[8] Dan Boneh,et al. On genetic algorithms , 1995, COLT '95.
[9] Luca Maria Gambardella,et al. Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem , 1995, ICML.
[10] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[11] Michael P. Wellman,et al. Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.
[12] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[13] Craig Boutilier,et al. Sequential Optimality and Coordination in Multiagent Systems , 1999, IJCAI.
[14] Craig Boutilier,et al. Implicit Imitation in Multiagent Reinforcement Learning , 1999, ICML.
[15] Eduardo F. Morales,et al. A New Distributed Reinforcement Learning Algorithm for Multiple Objective Optimization Problems , 2000, IBERAMIA-SBIA.
[16] Eduardo F. Morales,et al. A New Approach for the Solution of Multiple Objective Optimization Problems Based on Reinforcement Learning , 2000, MICAI.