From Creatures of Habit to Goal-Directed Learners

Theoretical models distinguish two decision-making strategies that have been formalized in reinforcement-learning theory. A model-based strategy leverages a cognitive model of potential actions and their consequences to make goal-directed choices, whereas a model-free strategy evaluates actions based solely on their reward history. Research in adults has begun to elucidate the psychological mechanisms and neural substrates underlying these learning processes and factors that influence their relative recruitment. However, the developmental trajectory of these evaluative strategies has not been well characterized. In this study, children, adolescents, and adults performed a sequential reinforcement-learning task that enabled estimation of model-based and model-free contributions to choice. Whereas a model-free strategy was apparent in choice behavior across all age groups, a model-based strategy was absent in children, became evident in adolescents, and strengthened in adults. These results suggest that recruitment of model-based valuation systems represents a critical cognitive component underlying the gradual maturation of goal-directed behavior.

[1]  J. Piaget The construction of reality in the child , 1954 .

[2]  P. Udani,et al.  Morbidity and mortality. , 1962, The Indian journal of child health.

[3]  M. Nisan Delay of Gratification in Children: Personal versus Group Choices , 1976 .

[4]  B. Haynes,et al.  of Human and Rodent , 1983 .

[5]  A. Dickinson Actions and habits: the development of behavioural autonomy , 1985 .

[6]  P. Zelazo,et al.  An age-related dissociation between knowing rules and using them ☆ , 1996 .

[7]  J L Collins,et al.  Youth Risk Behavior Surveillance--United States, 1997. State and Local YRBSS Coordinators. , 1998, The Journal of school health.

[8]  M. Posner,et al.  Developing mechanisms of self-regulation , 2000, Development and Psychopathology.

[9]  E. Miller,et al.  An integrative theory of prefrontal cortex function. , 2001, Annual review of neuroscience.

[10]  Natasha Z. Kirkham,et al.  ARTICLE WITH PEER COMMENTARIES AND RESPONSE Helping children apply their knowledge to their behavior on a dimension-switching task , 2003 .

[11]  T. Klingberg,et al.  Increased prefrontal and parietal activity after training of working memory , 2004, Nature Neuroscience.

[12]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[13]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[14]  Silvia A. Bunge,et al.  A Brain-Based Account of the Development of Rule Use in Childhood , 2006 .

[15]  A. Diamond The Early Development of Executive Functions. , 2006 .

[16]  Connie Lim,et al.  Youth risk behavior surveillance--United States, 2005. , 2006, The Journal of school health.

[17]  J. Russell,et al.  The control of instrumental action following outcome devaluation in young children aged between 1 and 4 years. , 2008, Journal of experimental psychology. General.

[18]  Christopher H. Chatham,et al.  Pupillometric and behavioral markers of a developmental shift in the temporal dynamics of cognitive control , 2009, Proceedings of the National Academy of Sciences.

[19]  L. Somerville,et al.  Developmental neurobiology of cognitive control and motivational systems , 2010, Current Opinion in Neurobiology.

[20]  B. Balleine,et al.  Human and Rodent Homologies in Action Control: Corticostriatal Determinants of Goal-Directed and Habitual Action , 2010, Neuropsychopharmacology.

[21]  Russell A. Poldrack,et al.  A unique adolescent response to reward prediction errors , 2010, Nature Neuroscience.

[22]  P. Dayan,et al.  Model-based influences on humans’ choices and striatal prediction errors , 2011, Neuron.

[23]  T. Robbins,et al.  The hippocampal–striatal axis in learning, prediction and goal-directed behavior , 2011, Trends in Neurosciences.

[24]  N. Daw,et al.  The ubiquity of model-based reinforcement learning , 2012, Current Opinion in Neurobiology.

[25]  Z. Kurth-Nelson,et al.  A theoretical account of cognitive effects in delay discounting , 2012, The European journal of neuroscience.

[26]  Michael X. Cohen,et al.  Striatum-medial prefrontal cortex connectivity predicts developmental changes in reinforcement learning. , 2012, Cerebral cortex.

[27]  Stephanie M. Carlson,et al.  Hot and Cool Executive Function in Childhood and Adolescence: Development and Plasticity , 2012 .

[28]  T. Braver The variable nature of cognitive control: a dual mechanisms framework , 2012, Trends in Cognitive Sciences.

[29]  Yuko Munakata,et al.  Developing Cognitive Control , 2012, Current directions in psychological science.

[30]  D. Shohamy,et al.  Preference by Association: How Memory Mechanisms in the Hippocampus Bias Decisions , 2012, Science.

[31]  Shu-Chen Li,et al.  Of goals and habits: age-related and individual differences in goal-directed decision-making , 2013, Front. Neurosci..

[32]  N. Turk-Browne,et al.  Mechanisms for widespread hippocampal involvement in cognition. , 2013, Journal of experimental psychology. General.

[33]  Michael J. Brammer,et al.  Neural and Psychological Maturation of Decision-making in Adolescence and Young Adulthood , 2013, Journal of Cognitive Neuroscience.

[34]  Alice Y. Chiang,et al.  Working-memory capacity protects model-based learning from stress , 2013, Proceedings of the National Academy of Sciences.

[35]  A. Markman,et al.  The Curse of Planning: Dissecting Multiple Reinforcement-Learning Systems by Taxing the Central Executive , 2013 .

[36]  Thomas H. B. FitzGerald,et al.  Disruption of Dorsolateral Prefrontal Cortex Decreases Model-Based in Favor of Model-free Control in Humans , 2013, Neuron.

[37]  F. Verbruggen,et al.  Banishing the Control Homunculi in Studies of Action Control and Behavior Change , 2014, Perspectives on psychological science : a journal of the Association for Psychological Science.

[38]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[39]  B. B. Doll,et al.  Experiential reward learning outweighs instruction prior to adulthood , 2015, Cognitive, affective & behavioral neuroscience.

[40]  Catherine A. Hartley,et al.  The neuroscience of adolescent decision-making , 2015, Current Opinion in Behavioral Sciences.

[41]  R. Dolan,et al.  Ventral striatal dopamine reflects behavioral and neural signatures of model-based control during sequential decision making , 2015, Proceedings of the National Academy of Sciences.

[42]  Christopher G. Lucas,et al.  When Younger Learners Can Be Better (or at Least More Open-Minded) Than Older Ones , 2015 .

[43]  N. Daw,et al.  Cognitive Control Predicts Use of Model-based Reinforcement Learning , 2014, Journal of Cognitive Neuroscience.