Instrumental Uncertainty as a Determinant of Behavior Under Interval Schedules of Reinforcement

Interval schedules of reinforcement are known to generate habitual behavior, the performance of which is less sensitive to revaluation of the earned reward and to alterations in the action-outcome contingency. Here we report results from experiments using different types of interval schedules of reinforcement in mice to assess the effect of uncertainty, in the time of reward availability, on habit formation. After limited training, lever pressing under fixed interval (FI, low interval uncertainty) or random interval schedules (RI, higher interval uncertainty) was sensitive to devaluation, but with more extended training, performance of animals trained under RI schedules became more habitual, i.e. no longer sensitive to devaluation, whereas performance of those trained under FI schedules remained goal-directed. When the press-reward contingency was reversed by omitting reward after pressing but presenting reward in the absence of pressing, lever pressing in mice previously trained under FI decreased more rapidly than that of mice trained under RI schedules. Further analysis revealed that action-reward contiguity is significantly reduced in lever pressing under RI schedules, whereas action-reward correlation is similar for the different schedules. Thus the extent of goal-directedness could vary as a function of uncertainty about the time of reward availability. We hypothesize that the reduced action-reward contiguity found in behavior generated under high uncertainty is responsible for habit formation.

[1]  H. Yin,et al.  Genetic Deletion of A2A Adenosine Receptors in the Striatum Selectively Impairs Habit Formation , 2009, The Journal of Neuroscience.

[2]  B. Balleine,et al.  Reward‐guided learning beyond dopamine in the nucleus accumbens: the integrative functions of cortico‐basal ganglia networks , 2008, The European journal of neuroscience.

[3]  B. Balleine,et al.  Calculating Consequences: Brain Systems That Encode the Causal Effects of Actions , 2008, The Journal of Neuroscience.

[4]  R. Costa,et al.  Endocannabinoid Signaling is Critical for Habit Formation , 2007, Frontiers in integrative neuroscience.

[5]  J. Horvitz,et al.  Dopaminergic Mechanisms in Actions and Habits , 2007, The Journal of Neuroscience.

[6]  B. Balleine,et al.  Inactivation of dorsolateral striatum enhances sensitivity to changes in the action–outcome contingency in instrumental conditioning , 2006, Behavioural Brain Research.

[7]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[8]  B. Balleine,et al.  Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning , 2004, The European journal of neuroscience.

[9]  W. Schultz,et al.  Discrete Coding of Reward Probability and Uncertainty by Dopamine Neurons , 2003, Science.

[10]  J. Staddon Adaptive Dynamics: The Theoretical Analysis of Behavior , 2001 .

[11]  B. Balleine,et al.  Goal-directed instrumental action: contingency and incentive learning and their cortical substrates , 1998, Neuropharmacology.

[12]  A. Dickinson,et al.  Instrumental judgment and performance under variations in action-outcome contingency and contiguity , 1991, Memory & cognition.

[13]  A. Dickinson Actions and habits: the development of behavioural autonomy , 1985 .

[14]  R M Church,et al.  Scalar Timing in Memory , 1984, Annals of the New York Academy of Sciences.

[15]  Christopher D. Adams,et al.  The Effect of the Instrumental Training Contingency on Susceptibility to Reinforcer Devaluation , 1983 .

[16]  Christopher D. Adams Variations in the Sensitivity of Instrumental Responding to Reinforcer Devaluation , 1982 .

[17]  Christopher D. Adams,et al.  Instrumental Responding following Reinforcer Devaluation , 1981 .

[18]  W M Baum,et al.  The correlation-based law of effect. , 1973, Journal of the experimental analysis of behavior.

[19]  G. S. Reynolds,et al.  A quantitative analysis of the responding maintained by interval schedules of reinforcement. , 1968, Journal of the experimental analysis of behavior.

[20]  J FARMER,et al.  PROPERTIES OF BEHAVIOR UNDER RANDOM INTERVAL REINFORCEMENT SCHEDULES. , 1963, Journal of the experimental analysis of behavior.

[21]  Floyd C. Mace,et al.  Schedules of reinforcement , 2011 .

[22]  A. Dickinson Expectancy theory in animal conditioning. , 1989 .

[23]  R. Rescorla,et al.  Associative Structures In Instrumental Learning , 1986 .

[24]  Robert Miller Meaning and Purpose in the Intact Brain , 1981 .