Short-term gains, long-term pains: How cues about state aid learning in dynamic environments

[1]  R. Luce,et al.  Individual Choice Behavior: A Theoretical Analysis. , 1960 .

[2]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[3]  R. Rescorla,et al.  A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .

[4]  W. F. Prokasy,et al.  Classical conditioning II: Current research and theory. , 1972 .

[5]  P. W. Frey,et al.  Inhibition and learning , 1973 .

[6]  H. Akaike A new look at the statistical model identification , 1974 .

[7]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[8]  R. Duncan Luce,et al.  Individual Choice Behavior: A Theoretical Analysis , 1979 .

[9]  W. Hamilton,et al.  The evolution of cooperation. , 1984, Science.

[10]  D. Broadbent,et al.  Interactive tasks and the implicit‐explicit distinction , 1988 .

[11]  Stephen Grossberg,et al.  The ART of adaptive pattern recognition by a self-organizing neural network , 1988, Computer.

[12]  R. Mathews,et al.  Insight without Awareness: On the Interaction of Verbalization, Instruction and Practice in a Simulated Process Control Task , 1989 .

[13]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[14]  Leslie Pack Kaelbling,et al.  Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.

[15]  R. Herrnstein,et al.  Melioration: A Theory of Distributed Choice , 1991 .

[16]  R. Herrnstein Experiments on Stable Suboptimality in Individual Behavior , 1991 .

[17]  R. Herrnstein,et al.  Utility maximization and melioration: Internalities in individual choice , 1993 .

[18]  S. Dehaene,et al.  The mental representation of parity and number magnitude. , 1993 .

[19]  Andrew McCallum,et al.  Overcoming Incomplete Perception with Utile Distinction Memory , 1993, ICML.

[20]  Gerald Tesauro,et al.  TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play , 1994, Neural Computation.

[21]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[22]  Richard S. Sutton,et al.  Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .

[23]  Peter Dayan,et al.  Bee foraging in uncertain environments using predictive hebbian learning , 1995, Nature.

[24]  P. Dayan,et al.  A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[25]  Andrew McCallum,et al.  Reinforcement learning with selective perception and hidden state , 1996 .

[26]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[27]  P. Montague,et al.  A Computational Role for Dopamine Delivery in Human Decision-Making , 1998, Journal of Cognitive Neuroscience.

[28]  Carlo Contoreggi,et al.  Drug abusers show impaired performance in a laboratory test of decision making , 2000, Neuropsychologia.

[29]  Steven W Anderson,et al.  Decision-making deficits, linked to a dysfunctional ventromedial prefrontal cortex, revealed in alcohol and stimulant abusers , 2001, Neuropsychologia.

[30]  Richard S. Sutton,et al.  Predictive Representations of State , 2001, NIPS.

[31]  Jeff G. Schneider,et al.  Autonomous helicopter control using reinforcement learning policy search methods , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[32]  M. Arbib,et al.  Modeling functions of striatal dopamine modulation in learning and planning , 2001, Neuroscience.

[33]  Hanna Damasio,et al.  Decision-making and addiction (part I): impaired activation of somatic states in substance dependent individuals when pondering decisions with negative future consequences , 2002, Neuropsychologia.

[34]  D. Shanks,et al.  A re‐examination of melioration and rational choice , 2002 .

[35]  J. Busemeyer,et al.  A contribution of cognitive decision models to clinical assessment: decomposing performance on the Bechara gambling task. , 2002, Psychological assessment.

[36]  P. Montague,et al.  Neural Economics and the Biological Substrates of Valuation , 2002, Neuron.

[37]  David S. Touretzky,et al.  Long-Term Reward Prediction in TD Models of the Dopamine System , 2002, Neural Computation.

[38]  D. Ballard,et al.  What you see is what you need. , 2003, Journal of vision.

[39]  Howard Rachlin,et al.  Learning by pigeons playing against tit-for-tat in an operant prisoner’s dilemma , 2003, Learning & behavior.

[40]  D. Medin,et al.  SUSTAIN: a network model of category learning. , 2004, Psychological review.

[41]  Eldad Yechiam,et al.  Comparison of basic assumptions embedded in learning models for experience-based decision making , 2005, Psychonomic bulletin & review.

[42]  Dana H. Ballard,et al.  Learning to perceive and act by trial and error , 1991, Machine Learning.

[43]  R. Sun,et al.  The interaction of the explicit and the implicit in skill learning: a dual-process approach. , 2005, Psychological review.

[44]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[45]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[46]  John R. Anderson,et al.  From recurrent choice to skill learning: a reinforcement-learning model. , 2006, Journal of experimental psychology. General.

[47]  Wayne D. Gray,et al.  Melioration Dominates Maximization: Stable Suboptimal Performance Despite Global Feedback , 2006 .

[48]  P. Dayan,et al.  Cortical substrates for exploratory decisions in humans , 2006, Nature.

[49]  Robert A Jacobs,et al.  Near-Optimal Human Adaptive Control across Different Noise Environments , 2006, The Journal of Neuroscience.

[50]  Jadin C. Jackson,et al.  Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling. , 2007, Psychological review.

[51]  Samuel M. McClure,et al.  Short-term memory traces for action bias in human reinforcement learning , 2007, Brain Research.

[52]  Arthur B Markman,et al.  Regulatory fit effects in a choice task , 2007, Psychonomic bulletin & review.