Combined model-free and model-sensitive reinforcement learning in non-human primates
暂无分享,去创建一个
Peter Dayan | Bruno Miranda | Timothy E Behrens | Steven W. Kennerley | W. M. Nishantha Malalasekera | P. Dayan | S. Kennerley | B. Miranda | W. Malalasekera | T. Behrens | Bruno Miranda | W. M. N. Malalasekera
[1] Todd A. Hare,et al. Model-free or muddled models in the two-stage task? , 2019 .
[2] Richard S. Sutton,et al. Dyna, an integrated architecture for learning, planning, and reacting , 1990, SGAR.
[3] N. Daw,et al. Dopamine selectively remediates 'model-based' reward learning: a computational approach. , 2016, Brain : a journal of neurology.
[4] Peter Dayan,et al. Simple Plans or Sophisticated Habits? State, Transition and Learning Interactions in the Two-Step Task , 2015, bioRxiv.
[5] Mahesan Niranjan,et al. On-line Q-learning using connectionist systems , 1994 .
[6] A. Dickinson. Actions and habits: the development of behavioural autonomy , 1985 .
[7] M. Roesch,et al. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards , 2007, Nature Neuroscience.
[8] Kevin McCabe,et al. Neural signature of fictive learning signals in a sequential investment task , 2007, Proceedings of the National Academy of Sciences.
[9] P. Dayan. How to set the switches on this thing , 2012, Current Opinion in Neurobiology.
[10] A. Phillips. The macmillan company. , 1970, Analytical chemistry.
[11] Thomas L. Griffiths,et al. Rational metareasoning and the plasticity of cognitive control , 2018, PLoS Comput. Biol..
[12] N. Daw,et al. The ubiquity of model-based reinforcement learning , 2012, Current Opinion in Neurobiology.
[13] B. Balleine,et al. The Role of Learning in the Operation of Motivational Systems , 2002 .
[14] F. Cushman,et al. Habitual control of goal selection in humans , 2015, Proceedings of the National Academy of Sciences.
[15] J. Stevens,et al. Animal Intelligence , 1883, Nature.
[16] P. Dayan,et al. Goals and Habits in the Brain , 2013, Neuron.
[17] O. Woolpert. Biological Sciences , 1980, Nature.
[18] M. Botvinick,et al. Reduced model-based decision-making in schizophrenia. , 2016, Journal of abnormal psychology.
[19] N. Daw,et al. Characterizing a psychiatric symptom dimension related to deficits in goal-directed control , 2016, eLife.
[20] Dylan A. Simon,et al. Model-based choices involve prospective neural activity , 2015, Nature Neuroscience.
[21] Karl J. Friston,et al. Bayesian model selection for group studies , 2009, NeuroImage.
[22] P. Dayan,et al. Model-based influences on humans’ choices and striatal prediction errors , 2011, Neuron.
[23] A. Gelman. Scaling regression inputs by dividing by two standard deviations , 2008, Statistics in medicine.
[24] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[25] A. Villringer,et al. Lateral prefrontal model-based signatures are reduced in healthy individuals with high trait impulsivity , 2015, Translational Psychiatry.
[26] B. Balleine,et al. Motivational control of goal-directed action , 1994 .
[27] P. Dayan,et al. States versus Rewards: Dissociable Neural Prediction Error Signals Underlying Model-Based and Model-Free Reinforcement Learning , 2010, Neuron.
[28] Thomas H. B. FitzGerald,et al. Transcranial Direct Current Stimulation of Right Dorsolateral Prefrontal Cortex Does Not Affect Model-Based or Model-Free Reinforcement Learning in Humans , 2014, PloS one.
[29] Amir Dezfouli,et al. Speed/Accuracy Trade-Off between the Habitual and the Goal-Directed Processes , 2011, PLoS Comput. Biol..
[30] N. Daw,et al. Valence-dependent influence of serotonin depletion on model-based choice strategy , 2015, Molecular Psychiatry.
[31] Adam Johnson,et al. Neural Ensembles in CA3 Transiently Encode Paths Forward of the Animal at a Decision Point , 2007, The Journal of Neuroscience.
[32] Zeb Kurth-Nelson,et al. Model-Based Reasoning in Humans Becomes Automatic with Training , 2015, PLoS Comput. Biol..
[33] Giovanni Pezzulo,et al. The Mixed Instrumental Controller: Using Value of Information to Combine Habitual Choice and Mental Simulation , 2013, Front. Psychol..
[34] B. Balleine,et al. Human and Rodent Homologies in Action Control: Corticostriatal Determinants of Goal-Directed and Habitual Action , 2010, Neuropsychopharmacology.
[35] P. Dayan,et al. Adaptive integration of habits into depth-limited planning defines a habitual-goal–directed spectrum , 2016, Proceedings of the National Academy of Sciences.
[36] Ulrik R Beierholm,et al. Dopamine Modulates Reward-Related Vigor , 2013, Neuropsychopharmacology.
[37] Shinsuke Shimojo,et al. Neural Computations Underlying Arbitration between Model-Based and Model-free Learning , 2013, Neuron.
[38] Wouter Kool,et al. When Does Model-Based Control Pay Off? , 2016, PLoS Comput. Biol..
[39] L. Deserno,et al. Model-Based and Model-Free Decisions in Alcohol Dependence , 2014, Neuropsychobiology.
[40] Kevin J. Miller,et al. Dorsal hippocampus contributes to model-based planning , 2017, Nature Neuroscience.
[41] N. Daw,et al. Variability in Dopamine Genes Dissociates Model-Based and Model-Free Reinforcement Learning , 2016, The Journal of Neuroscience.
[42] P. Dayan,et al. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.
[43] E. Vaadia,et al. Midbrain dopamine neurons encode decisions for future action , 2006, Nature Neuroscience.
[44] Wouter Kool,et al. Cost-Benefit Arbitration Between Multiple Reinforcement-Learning Systems , 2017, Psychological science.
[45] Andrew W. Moore,et al. Prioritized sweeping: Reinforcement learning with less data and less time , 2004, Machine Learning.
[46] G. Schwarz. Estimating the Dimension of a Model , 1978 .
[47] A. Markman,et al. Journal of Experimental Psychology : General Retrospective Revaluation in Sequential Decision Making : A Tale of Two Systems , 2012 .
[48] K. Katahira. The relation between reinforcement learning parameters and the influence of reinforcement history on choice behavior , 2015 .
[49] Peter Dayan,et al. Vigor in the Face of Fluctuating Rates of Reward: An Experimental Examination , 2011, Journal of Cognitive Neuroscience.
[50] Eduardo F. Morales,et al. An Introduction to Reinforcement Learning , 2011 .
[51] Jonathan B. Dingwell,et al. Error Correction and the Structure of Inter-Trial Fluctuations in a Redundant Movement Task , 2016, PLoS Comput. Biol..
[52] Karl J. Friston,et al. Bayesian model selection for group studies — Revisited , 2014, NeuroImage.
[53] E. Tolman. Cognitive maps in rats and men. , 1948, Psychological review.
[54] Raymond J. Dolan,et al. Disentangling the Roles of Approach, Activation and Valence in Instrumental and Pavlovian Responding , 2011, PLoS Comput. Biol..
[55] P. Dayan,et al. Tonic dopamine: opportunity costs and the control of response vigor , 2007, Psychopharmacology.
[56] P. Glimcher,et al. JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR 2005, 84, 555–579 NUMBER 3(NOVEMBER) DYNAMIC RESPONSE-BY-RESPONSE MODELS OF MATCHING BEHAVIOR IN RHESUS MONKEYS , 2022 .
[57] Bernard W. Balleine,et al. Actions, Action Sequences and Habits: Evidence That Goal-Directed and Habitual Action Control Are Hierarchically Organized , 2013, PLoS Comput. Biol..
[58] K. Doya,et al. Validation of Decision-Making Models and Analysis of Decision Variables in the Rat Basal Ganglia , 2009, The Journal of Neuroscience.
[59] R. Dolan,et al. Dopamine Enhances Model-Based over Model-Free Choice Behavior , 2012, Neuron.
[60] P. Dayan,et al. The algorithmic anatomy of model-based evaluation , 2014, Philosophical Transactions of the Royal Society B: Biological Sciences.