Imbalanced Decision Hierarchy in Addicts Emerging from Drug-Hijacked Dopamine Spiraling Circuit

Despite explicitly wanting to quit, long-term addicts find themselves powerless to resist drugs, despite knowing that drug-taking may be a harmful course of action. Such inconsistency between the explicit knowledge of negative consequences and the compulsive behavioral patterns represents a cognitive/behavioral conflict that is a central characteristic of addiction. Neurobiologically, differential cue-induced activity in distinct striatal subregions, as well as the dopamine connectivity spiraling from ventral striatal regions to the dorsal regions, play critical roles in compulsive drug seeking. However, the functional mechanism that integrates these neuropharmacological observations with the above-mentioned cognitive/behavioral conflict is unknown. Here we provide a formal computational explanation for the drug-induced cognitive inconsistency that is apparent in the addicts' “self-described mistake”. We show that addictive drugs gradually produce a motivational bias toward drug-seeking at low-level habitual decision processes, despite the low abstract cognitive valuation of this behavior. This pathology emerges within the hierarchical reinforcement learning framework when chronic exposure to the drug pharmacologically produces pathologicaly persistent phasic dopamine signals. Thereby the drug hijacks the dopaminergic spirals that cascade the reinforcement signals down the ventro-dorsal cortico-striatal hierarchy. Neurobiologically, our theory accounts for rapid development of drug cue-elicited dopamine efflux in the ventral striatum and a delayed response in the dorsal striatum. Our theory also shows how this response pattern depends critically on the dopamine spiraling circuitry. Behaviorally, our framework explains gradual insensitivity of drug-seeking to drug-associated punishments, the blocking phenomenon for drug outcomes, and the persistent preference for drugs over natural rewards by addicts. The model suggests testable predictions and beyond that, sets the stage for a view of addiction as a pathology of hierarchical decision-making processes. This view is complementary to the traditional interpretation of addiction as interaction between habitual and goal-directed decision systems.

[1]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[2]  L. Kamin Predictability, surprise, attention, and conditioning , 1967 .

[3]  B. Campbell,et al.  Punishment and aversive behavior , 1969 .

[4]  Narcotics Anonymous Narcotics Anonymous. , 1985, JAMA.

[5]  G. E. Alexander,et al.  Parallel organization of functionally segregated circuits linking basal ganglia and cortex. , 1986, Annual review of neuroscience.

[6]  G. Di Chiara,et al.  Drugs abused by humans preferentially increase synaptic dopamine concentrations in the mesolimbic system of freely moving rats. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[7]  G. E. Alexander,et al.  Basal ganglia-thalamocortical circuits: parallel substrates for motor, oculomotor, "prefrontal" and "limbic" functions. , 1990, Progress in brain research.

[8]  K. Preston,et al.  The reinforcing and subjective effects of morphine in post-addicts: a dose-response study. , 1991, The Journal of pharmacology and experimental therapeutics.

[9]  S. Heilig Addiction: From Biology to Drug Policy , 1994 .

[10]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[11]  G. Koob,et al.  The Neurobiology of Addiction , 1997, Alcohol health and research world.

[12]  Edward K. Vogel,et al.  The capacity of visual working memory for features and conjunctions , 1997, Nature.

[13]  Nikolaus R. McFarland,et al.  Striatonigrostriatal Pathways in Primates Form an Ascending Spiral from the Shell to the Dorsolateral Striatum , 2000, The Journal of Neuroscience.

[14]  Kenji Doya,et al.  Metalearning and neuromodulation , 2002, Neural Networks.

[15]  E. Koechlin,et al.  The Architecture of Cognitive Control in the Human Prefrontal Cortex , 2003, Science.

[16]  R. Hu Diagnostic and Statistical Manual of Mental Disorders (DSM-IV) , 2003 .

[17]  S. Haber The primate basal ganglia: parallel and integrative networks , 2003, Journal of Chemical Neuroanatomy.

[18]  B. Everitt,et al.  Drug Seeking Becomes Compulsive After Prolonged Cocaine Self-Administration , 2004, Science.

[19]  J. Swanson,et al.  Dopamine in drug abuse and addiction: results from imaging studies and treatment implications , 2004, Molecular Psychiatry.

[20]  A. Redish,et al.  Addiction as a Computational Process Gone Awry , 2004, Science.

[21]  David Belin,et al.  Evidence for Addiction-like Behavior in the Rat , 2004, Science.

[22]  T. Robbins,et al.  Neural systems of reinforcement for drug addiction: from actions to habits to compulsion , 2005, Nature Neuroscience.

[23]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[24]  B. Everitt,et al.  Involvement of the Dorsal Striatum in Cue-Controlled Cocaine Seeking , 2005, The Journal of Neuroscience.

[25]  N. Volkow,et al.  The neural basis of addiction: a pathology of motivation and choice. , 2005, The American journal of psychiatry.

[26]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[27]  N. Volkow,et al.  Cocaine Cues and Dopamine in Dorsal Striatum: Mechanism of Craving in Cocaine Addiction , 2006, The Journal of Neuroscience.

[28]  C. Lüscher,et al.  The Mechanistic Classification of Addictive Drugs , 2006, PLoS medicine.

[29]  Mitsuo Kawato,et al.  Heterarchical reinforcement-learning model for integration of multiple cortico-striatal loops: fMRI examination in stimulus-action-reward association learning , 2006, Neural Networks.

[30]  Robert H Mach,et al.  PET imaging of dopamine D2 receptors during chronic cocaine self-administration in monkeys , 2006, Nature Neuroscience.

[31]  L. Panlilio,et al.  Blocking of conditioning to a cocaine-paired stimulus: Testing the hypothesis that cocaine perpetually produces a signal of larger-than-expected reward , 2007, Pharmacology Biochemistry and Behavior.

[32]  B. Everitt,et al.  Cocaine Seeking Habits Depend upon Dopamine-Dependent Serial Connectivity Linking the Ventral with the Dorsal Striatum , 2008, Neuron.

[33]  Y. Niv,et al.  Silencing the Critics: Understanding the Effects of Cocaine Sensitization on Dorsolateral and Ventral Striatum in the Context of an Actor/Critic Model , 2008, Front. Neurosci..

[34]  M. Botvinick Hierarchical models of behavior and prefrontal function , 2008, Trends in Cognitive Sciences.

[35]  Adam Johnson,et al.  Computing motivation: Incentive salience boosts of drug or appetite states , 2008, Behavioral and Brain Sciences.

[36]  M. Frank,et al.  Do substantia nigra dopaminergic neurons differentiate between reward and punishment? , 2009, Journal of molecular cell biology.

[37]  M. Botvinick,et al.  Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective , 2009, Cognition.

[38]  Caro Lucas,et al.  A Neurocomputational Model for Cocaine Addiction , 2009, Neural Computation.

[39]  O. Hikosaka,et al.  Two types of dopamine neuron distinctly convey positive and negative motivational signals , 2009, Nature.

[40]  P. Dayan Dopamine, reinforcement learning, and addiction. , 2009, Pharmacopsychiatry.

[41]  M. D’Esposito,et al.  Is the rostro-caudal axis of the frontal lobe hierarchical? , 2009, Nature Reviews Neuroscience.

[42]  Jeffrey W. Cooney,et al.  Hierarchical cognitive control deficits following damage to the human frontal lobe , 2009, Nature Neuroscience.

[43]  A. Dickinson,et al.  Parallel and interactive learning processes within the basal ganglia: Relevance for the understanding of addiction , 2009, Behavioural Brain Research.

[44]  Reinout W Wiers,et al.  Implicit cognition and addiction: a tool for explaining paradoxical behavior. , 2010, Annual review of clinical psychology.

[45]  F. Telang,et al.  Liking and wanting of drug and non-drug rewards in active cocaine users: the STRAP-R questionnaire , 2010, Journal of psychopharmacology.

[46]  Caro Lucas,et al.  Individual Differences in Nucleus Accumbens Dopamine Receptors Predict Development of Addiction-Like Behavior: A Computational Approach , 2010, Neural Computation.

[47]  E. Walker,et al.  Diagnostic and Statistical Manual of Mental Disorders , 2013 .

[48]  Anne E Carpenter,et al.  Neuron-type specific signals for reward and punishment in the ventral tegmental area , 2011, Nature.

[49]  B. Everitt,et al.  Hierarchical recruitment of phasic dopamine signaling in the striatum during the progression of cocaine use , 2012, Proceedings of the National Academy of Sciences.

[50]  B. Balleine,et al.  Habits, action sequences and reinforcement learning , 2012, The European journal of neuroscience.

[51]  Janet B W Williams,et al.  Diagnostic and Statistical Manual of Mental Disorders , 2013 .