Analysis about Efficiency of Indirect Media Communication on Multi-agent Cooperation Learning
暂无分享,去创建一个
Ruoying Sun | Gang Zhao | R. Sun | Gang Zhao
[1] Ruoying Sun,et al. An Accelerated k-Certainty Exploration Method , 1999 .
[2] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[3] Reda Alhajj,et al. Multiagent reinforcement learning using function approximation , 2000, IEEE Trans. Syst. Man Cybern. Part C.
[4] Thomas Stützle,et al. MAX-MIN Ant System , 2000, Future Gener. Comput. Syst..
[5] Marco Dorigo,et al. The hyper-cube framework for ant colony optimization , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).
[6] Gang Zhao,et al. Convergence of the Q-ae Learning on Deterministic MDPs and Its Efficiency on the Stochastic Environmnet , 2000 .
[7] Ming Tan,et al. Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.
[8] Alex Alves Freitas,et al. Data mining with an ant colony optimization algorithm , 2002, IEEE Trans. Evol. Comput..
[9] Andrew G. Barto,et al. Improving Elevator Performance Using Reinforcement Learning , 1995, NIPS.
[10] Shigenobu Kobayashi,et al. k-Certainty Exploration Method: An Action Selector to Identify the Environment in Reinforcement Learning , 1997, Artif. Intell..
[11] Thomas Stützle,et al. A short convergence proof for a class of ant colony optimization algorithms , 2002, IEEE Trans. Evol. Comput..
[12] Agostino Poggi,et al. Multiagent Systems , 2006, Intelligenza Artificiale.
[13] Michael Sampels,et al. A MAX-MIN Ant System for the University Course Timetabling Problem , 2002, Ant Algorithms.
[14] C.C. White,et al. Dynamic programming and stochastic control , 1978, Proceedings of the IEEE.
[15] Dimitri P. Bertsekas,et al. Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems , 1996, NIPS.
[16] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[17] Gang Zhao,et al. Convergence of the Q-ae learning under deterministic MDPs and its efficiency under the stochastic environment , 2000, Smc 2000 conference proceedings. 2000 ieee international conference on systems, man and cybernetics. 'cybernetics evolving to systems, humans, organizations, and their complex interactions' (cat. no.0.
[18] Luca Maria Gambardella,et al. Ant colony system: a cooperative learning approach to the traveling salesman problem , 1997, IEEE Trans. Evol. Comput..
[19] Sati S. Sian,et al. Extending Learning to Multiple Agents: Issues and a Model for Multi-Agent Machine Learning (MA-ML) , 1991, EWSL.
[20] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[21] Vittorio Maniezzo,et al. The Ant System Applied to the Quadratic Assignment Problem , 1999, IEEE Trans. Knowl. Data Eng..