A Distributed Q-Learning Approach for Variable Attention to Multiple Critics
暂无分享,去创建一个
Majid Nili Ahmadabadi | Masoud Asadpour | Maryam S. Mirian | Maryam Tavakol | M. N. Ahmadabadi | M. Asadpour | M. Mirian | M. Tavakol
[1] Y. Niv. Reinforcement learning in the brain , 2009 .
[2] Thomas T. Hills,et al. Model-Based Reinforcement Learning as Cognitive Search: Neurocomputational Theories , 2012 .
[3] Jong-Hwan Kim,et al. Modular Q-learning based multi-agent cooperation for robot soccer , 2001, Robotics Auton. Syst..
[4] Nathaniel D. Daw,et al. Environmental statistics and the trade-off between model-based and TD learning in humans , 2011, NIPS.
[5] Christian R. Shelton,et al. Balancing Multiple Sources of Reward in Reinforcement Learning , 2000, NIPS.
[6] Peter Raicevic. Parallel reinforcement learning using multiple reward signals , 2006, Neurocomputing.
[7] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[8] Dana H. Ballard,et al. Multiple-Goal Reinforcement Learning with Modular Sarsa(0) , 2003, IJCAI.
[9] N. Daw. Model-based reinforcement learning as cognitive search : Neurocomputational theories , 2012 .
[10] Michael Mateas,et al. On the Difficulty of Modular Reinforcement Learning for Real-World Partial Programming , 2006, AAAI.
[11] Mitsuo Kawato,et al. Inter-module credit assignment in modular reinforcement learning , 2003, Neural Networks.
[12] P. Dayan,et al. States versus Rewards: Dissociable Neural Prediction Error Signals Underlying Model-Based and Model-Free Reinforcement Learning , 2010, Neuron.
[13] Csaba Szepesvári,et al. Algorithms for Reinforcement Learning , 2010, Synthesis Lectures on Artificial Intelligence and Machine Learning.
[14] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[15] P. Glimcher,et al. Midbrain Dopamine Neurons Encode a Quantitative Reward Prediction Error Signal , 2005, Neuron.
[16] P. Dayan,et al. Decision theory, reinforcement learning, and the brain , 2008, Cognitive, affective & behavioral neuroscience.