Instructional control of reinforcement learning: A behavioral and neurocomputational investigation

[1]  W. Brown Animal Intelligence: Experimental Studies , 1912, Nature.

[2]  R. M. Elliott,et al.  Behavior of Organisms , 1991 .

[3]  W. Estes Effects of competing reactions on the conditioning curve for bar pressing. , 1950, Journal of experimental psychology.

[4]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[5]  J. E. Mazur,et al.  Learning and Behavior , 1966 .

[6]  Arnold Kaufman,et al.  SOME EFFECTS OF INSTRUCTIONS ON HUMAN OPERANT BEHAVIOR. , 1966 .

[7]  H. Akaike A new look at the statistical model identification , 1974 .

[8]  M Galizio,et al.  Contingency-shaped and rule-governed behavior: instructional control of human loss avoidance. , 1979, Journal of the experimental analysis of behavior.

[9]  A. Brownstein,et al.  Rule-governed behavior and sensitivity to changing consequences of responding. , 1986, Journal of the experimental analysis of behavior.

[10]  S. Hayes Rule-Governed Behavior , 1989 .

[11]  S. Hayes Rule-governed behavior : cognition, contingencies, and instructional control , 1989 .

[12]  R. Nosofsky,et al.  Rules and exemplars in categorization, identification, and recognition. , 1989, Journal of experimental psychology. Learning, memory, and cognition.

[13]  Steven C. Hayes,et al.  Rule Governance: Basic Behavioral Research and Applied Implications , 1993 .

[14]  A. Neal,et al.  Instance-based categorization: Automatic versus intentional forms of retrieval , 1995, Memory & cognition.

[15]  Garrison W. Cottrell,et al.  A Connectionist Model Of Instruction Following , 1995 .

[16]  P. Dayan,et al.  A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[17]  S. Sloman The empirical case for two systems of reasoning. , 1996 .

[18]  Padraic Monaghan,et al.  Proceedings of the 23rd annual conference of the cognitive science society , 2001 .

[19]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[20]  P. Greengard,et al.  Bidirectional Regulation of DARPP-32 Phosphorylation by Dopamine , 1997, The Journal of Neuroscience.

[21]  Gregory Ashby,et al.  A neuropsychological theory of multiple systems in category learning. , 1998, Psychological review.

[22]  A. Graybiel The Basal Ganglia and Chunking of Action Repertoires , 1998, Neurobiology of Learning and Memory.

[23]  Stuart J. Russell,et al.  Bayesian Q-Learning , 1998, AAAI/IAAI.

[24]  Colin Camerer,et al.  Experience‐weighted Attraction Learning in Normal Form Games , 1999 .

[25]  Individual Differences in Exemplar-Based Interference During Instructed Category Learning , 2000 .

[26]  R. O’Reilly,et al.  Computational Explorations in Cognitive Neuroscience: Understanding the Mind by Simulating the Brain , 2000 .

[27]  T. Sejnowski,et al.  Neurocomputational models of working memory , 2000, Nature Neuroscience.

[28]  L. R Gleitman,et al.  Proceedings of the twenty-second annual conference of the cognitive science society , 2000 .

[29]  J. Driver,et al.  Control of Cognitive Processes: Attention and Performance XVIII , 2000 .

[30]  Michael J. Frank,et al.  Interactions between frontal cortex and basal ganglia in working memory: A computational model , 2001, Cognitive, affective & behavioral neuroscience.

[31]  E. Miller,et al.  An integrative theory of prefrontal cortex function. , 2001, Annual review of neuroscience.

[32]  L. Brooks,et al.  Specializing the operation of an explicit rule , 1991 .

[33]  R. O’Reilly,et al.  Conjunctive representations in learning and memory: principles of cortical and hippocampal function. , 2001, Psychological review.

[34]  William M. Baum,et al.  Understanding Behaviorism: Behavior, Culture, and Evolution , 2003 .

[35]  R. Poldrack,et al.  Competition among multiple memory systems: converging evidence from animal and human brain studies , 2003, Neuropsychologia.

[36]  E. Miller,et al.  Neural circuits subserving the retrieval and maintenance of abstract rules. , 2003, Journal of neurophysiology.

[37]  D. Kahneman A perspective on judgment and choice: mapping bounded rationality. , 2003, The American psychologist.

[38]  Samuel M. McClure,et al.  Temporal Prediction Errors in a Passive Learning Task Activate Human Striatum , 2003, Neuron.

[39]  Karl J. Friston,et al.  Temporal Difference Models and Reward-Related Learning in the Human Brain , 2003, Neuron.

[40]  M. Delgado,et al.  Dorsal striatum responses to reward and punishment: Effects of valence and magnitude manipulations , 2003, Cognitive, affective & behavioral neuroscience.

[41]  E. Miller,et al.  From rule to response: neuronal processes in the premotor and prefrontal cortex. , 2003, Journal of neurophysiology.

[42]  Jonathan D. Cohen,et al.  The Neural Basis of Economic Decision-Making in the Ultimatum Game , 2003, Science.

[43]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[44]  Karl J. Friston,et al.  Dissociable Roles of Ventral and Dorsal Striatum in Instrumental Conditioning , 2004, Science.

[45]  Samuel M. McClure,et al.  Separate Neural Systems Value Immediate and Delayed Monetary Rewards , 2004, Science.

[46]  Michael J. Frank,et al.  Hippocampus, cortex, and basal ganglia: Insights from computational models of complementary learning systems , 2004, Neurobiology of Learning and Memory.

[47]  Michael J. Frank,et al.  By Carrot or by Stick: Cognitive Reinforcement Learning in Parkinsonism , 2004, Science.

[48]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[49]  R. Hertwig,et al.  Decisions from Experience and the Effect of Rare Events in Risky Choice , 2004, Psychological science.

[50]  Peter Praamstra,et al.  The basal ganglia and inhibitory mechanisms in response selection: evidence from subliminal priming of motor responses in Parkinson's disease. , 2004, Brain : a journal of neurology.

[51]  Jonathan D. Cohen,et al.  Prefrontal cortex and flexible cognitive control: rules without symbols. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Michael J. Frank,et al.  Error-Related Negativity Predicts Reinforcement Learning and Conflict Biases , 2005, Neuron.

[53]  W. T. Maddox,et al.  Cortical and subcortical brain regions involved in rule-based category learning , 2005, Neuroreport.

[54]  K. Doya,et al.  Representation of Action-Specific Reward Values in the Striatum , 2005, Science.

[55]  E. Miller,et al.  Different time courses of learning-related activity in the prefrontal cortex and striatum , 2005, Nature.

[56]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[57]  Michael J. Frank,et al.  Dynamic Dopamine Modulation in the Basal Ganglia: A Neurocomputational Account of Cognitive Deficits in Medicated and Nonmedicated Parkinsonism , 2005, Journal of Cognitive Neuroscience.

[58]  P. Glimcher,et al.  JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR 2005, 84, 555–579 NUMBER 3(NOVEMBER) DYNAMIC RESPONSE-BY-RESPONSE MODELS OF MATCHING BEHAVIOR IN RHESUS MONKEYS , 2022 .

[59]  Jonathan D. Cohen,et al.  Neuroeconomics: cross-currents in research on decision-making , 2006, Trends in Cognitive Sciences.

[60]  H. Yin,et al.  The role of the basal ganglia in habit formation , 2006, Nature Reviews Neuroscience.

[61]  P. Dayan,et al.  Cortical substrates for exploratory decisions in humans , 2006, Nature.

[62]  Michael J. Frank,et al.  Making Working Memory Work: A Computational Model of Learning in the Prefrontal Cortex and Basal Ganglia , 2006, Neural Computation.

[63]  Michael J. Frank,et al.  A mechanistic account of striatal dopamine function in human cognition: psychopharmacological studies with cabergoline and haloperidol. , 2006, Behavioral neuroscience.

[64]  J. O'Doherty,et al.  The Role of the Ventromedial Prefrontal Cortex in Abstract State-Based Inference during Decision Making in Humans , 2006, The Journal of Neuroscience.

[65]  Michael J. Frank,et al.  Hold your horses: A dynamic computational role for the subthalamic nucleus in decision making , 2006, Neural Networks.

[66]  M. D’Esposito,et al.  Reversal learning in Parkinson's disease depends on medication status and outcome valence , 2006, Neuropsychologia.

[67]  M. Frank,et al.  Anatomy of a decision: striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal. , 2006, Psychological review.

[68]  Garrison W. Cottrell,et al.  In Search Of Articulated Attractors , 2007 .

[69]  J. O'Doherty,et al.  Model‐Based fMRI and Its Application to Reward Learning and Decision Making , 2007, Annals of the New York Academy of Sciences.

[70]  N. Daw,et al.  Reinforcement Learning Signals in the Human Striatum Distinguish Learners from Nonlearners during Reward-Based Decision Making , 2007, The Journal of Neuroscience.

[71]  M. Reuter,et al.  Genetically Determined Differences in Learning from Errors , 2007, Science.

[72]  Michael X. Cohen,et al.  Behavioral / Systems / Cognitive Reinforcement Learning Signals Predict Future Decisions , 2007 .

[73]  Michael J. Frank,et al.  Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning , 2007, Proceedings of the National Academy of Sciences.

[74]  W. Schultz Multiple dopamine functions at different time courses. , 2007, Annual review of neuroscience.

[75]  W. T. Maddox,et al.  Neural correlates of rule-based and information-integration visual category learning. , 2006, Cerebral cortex.

[76]  T. Braver,et al.  Explaining the many varieties of working memory variation: Dual mechanisms of cognitive control. , 2007 .

[77]  J. Wallis,et al.  Neuroscience of Rule-Guided Behavior , 2007 .

[78]  Michael J. Frank,et al.  Testing Computational Models of Dopamine and Noradrenaline Dysfunction in Attention Deficit/Hyperactivity Disorder , 2007, Neuropsychopharmacology.

[79]  Colin Camerer,et al.  Experienced-Weighted Attraction Learning in Normal Form Games , 2007 .

[80]  Kevin McCabe,et al.  Neural signature of fictive learning signals in a sequential investment task , 2007, Proceedings of the National Academy of Sciences.

[81]  Michael J. Frank,et al.  Hold Your Horses: Impulsivity, Deep Brain Stimulation, and Medication in Parkinsonism , 2007, Science.

[82]  Jonathan D. Cohen,et al.  On the Control of Control: The Role of Dopamine in Regulating Prefrontal Function and Working Memory , 2007 .

[83]  A. Grace,et al.  The dopamine system and the pathophysiology of schizophrenia: a basic science perspective. , 2007, International review of neurobiology.

[84]  Silvia A. Bunge,et al.  Neural representations used to specify action , 2008 .

[85]  A. Grace,et al.  Dopamine modulation of hippocampal-prefrontal cortical interaction drives memory-guided behavior. , 2008, Cerebral cortex.

[86]  J. Kruschke Bayesian approaches to associative learning: From passive to active learning , 2008, Learning & behavior.

[87]  Andrew R. A. Conway,et al.  Variation in working memory , 2008 .

[88]  Richard Gonzalez,et al.  Computational Models for the Combination of Advice and Individual Learning , 2009, Cogn. Sci..

[89]  Jonathan D. Cohen,et al.  Prefrontal Cortex and the Flexibility of Cognitive Control : Rules Without Symbols , 2022 .