A Cooperation Online Reinforcement Learning Approach in Ant-Q
暂无分享,去创建一个
[1] Luca Maria Gambardella,et al. Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem , 1995, ICML.
[2] Marco Dorigo,et al. An Investigation of some Properties of an "Ant Algorithm" , 1992, PPSN.
[3] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[4] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[5] Luca Maria Gambardella,et al. Solving symmetric and asymmetric TSPs by ant colonies , 1996, Proceedings of IEEE International Conference on Evolutionary Computation.
[6] T. Stützle,et al. MAX-MIN Ant System and local search for the traveling salesman problem , 1997, Proceedings of 1997 IEEE International Conference on Evolutionary Computation (ICEC '97).
[7] SeungGwan Lee. Multiagent Reinforcement Learning Algorithm Using Temporal Difference Error , 2005, ISNN.
[8] Claude-Nicolas Fiechter,et al. Efficient reinforcement learning , 1994, COLT '94.
[9] Etienne Barnard,et al. Temporal-difference methods and Markov models , 1993, IEEE Trans. Syst. Man Cybern..
[10] Luca Maria Gambardella,et al. Ant colony system: a cooperative learning approach to the traveling salesman problem , 1997, IEEE Trans. Evol. Comput..
[11] Marco Dorigo,et al. Distributed Optimization by Ant Colonies , 1992 .
[12] Luca Maria Gambardella,et al. A Study of Some Properties of Ant-Q , 1996, PPSN.
[13] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[14] Marco Dorigo,et al. Ant system: optimization by a colony of cooperating agents , 1996, IEEE Trans. Syst. Man Cybern. Part B.
[15] TaeChoong Chung,et al. A Reinforcement Learning Algorithm Using Temporal Difference Error in Ant Model , 2005, IWANN.