论文信息 - Short-term gains, long-term pains: How cues about state aid learning in dynamic environments - 字舞流文

Short-term gains, long-term pains: How cues about state aid learning in dynamic environments

Bradley C. Love | B. Love | T. Gureckis

[1] R. Luce,et al. Individual Choice Behavior: A Theoretical Analysis. , 1960 .

[2] John A. Nelder,et al. A Simplex Method for Function Minimization , 1965, Comput. J..

[3] R. Rescorla,et al. A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .

[4] W. F. Prokasy,et al. Classical conditioning II: Current research and theory. , 1972 .

[5] P. W. Frey,et al. Inhibition and learning , 1973 .

[6] H. Akaike. A new look at the statistical model identification , 1974 .

[7] G. Schwarz. Estimating the Dimension of a Model , 1978 .

[8] R. Duncan Luce,et al. Individual Choice Behavior: A Theoretical Analysis , 1979 .

[9] W. Hamilton,et al. The evolution of cooperation. , 1984, Science.

[10] D. Broadbent,et al. Interactive tasks and the implicit‐explicit distinction , 1988 .

[11] Stephen Grossberg,et al. The ART of adaptive pattern recognition by a self-organizing neural network , 1988, Computer.

[12] R. Mathews,et al. Insight without Awareness: On the Interaction of Verbalization, Instruction and Practice in a Simulated Process Control Task , 1989 .

[13] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .

[14] Leslie Pack Kaelbling,et al. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.

[15] R. Herrnstein,et al. Melioration: A Theory of Distributed Choice , 1991 .

[16] R. Herrnstein. Experiments on Stable Suboptimality in Individual Behavior , 1991 .

[17] R. Herrnstein,et al. Utility maximization and melioration: Internalities in individual choice , 1993 .

[18] S. Dehaene,et al. The mental representation of parity and number magnitude. , 1993 .

[19] Andrew McCallum,et al. Overcoming Incomplete Perception with Utile Distinction Memory , 1993, ICML.

[20] Gerald Tesauro,et al. TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play , 1994, Neural Computation.

[21] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..

[22] Richard S. Sutton,et al. Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .

[23] Peter Dayan,et al. Bee foraging in uncertain environments using predictive hebbian learning , 1995, Nature.

[24] P. Dayan,et al. A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[25] Andrew McCallum,et al. Reinforcement learning with selective perception and hidden state , 1996 .

[26] Peter Dayan,et al. A Neural Substrate of Prediction and Reward , 1997, Science.

[27] P. Montague,et al. A Computational Role for Dopamine Delivery in Human Decision-Making , 1998, Journal of Cognitive Neuroscience.

[28] Carlo Contoreggi,et al. Drug abusers show impaired performance in a laboratory test of decision making , 2000, Neuropsychologia.

[29] Steven W Anderson,et al. Decision-making deficits, linked to a dysfunctional ventromedial prefrontal cortex, revealed in alcohol and stimulant abusers , 2001, Neuropsychologia.

[30] Richard S. Sutton,et al. Predictive Representations of State , 2001, NIPS.

[31] Jeff G. Schneider,et al. Autonomous helicopter control using reinforcement learning policy search methods , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[32] M. Arbib,et al. Modeling functions of striatal dopamine modulation in learning and planning , 2001, Neuroscience.

[33] Hanna Damasio,et al. Decision-making and addiction (part I): impaired activation of somatic states in substance dependent individuals when pondering decisions with negative future consequences , 2002, Neuropsychologia.

[34] D. Shanks,et al. A re‐examination of melioration and rational choice , 2002 .

[35] J. Busemeyer,et al. A contribution of cognitive decision models to clinical assessment: decomposing performance on the Bechara gambling task. , 2002, Psychological assessment.

[36] P. Montague,et al. Neural Economics and the Biological Substrates of Valuation , 2002, Neuron.

[37] David S. Touretzky,et al. Long-Term Reward Prediction in TD Models of the Dopamine System , 2002, Neural Computation.

[38] D. Ballard,et al. What you see is what you need. , 2003, Journal of vision.

[39] Howard Rachlin,et al. Learning by pigeons playing against tit-for-tat in an operant prisoner’s dilemma , 2003, Learning & behavior.

[40] D. Medin,et al. SUSTAIN: a network model of category learning. , 2004, Psychological review.

[41] Eldad Yechiam,et al. Comparison of basic assumptions embedded in learning models for experience-based decision making , 2005, Psychonomic bulletin & review.

[42] Dana H. Ballard,et al. Learning to perceive and act by trial and error , 1991, Machine Learning.

[43] R. Sun,et al. The interaction of the explicit and the implicit in skill learning: a dual-process approach. , 2005, Psychological review.

[44] P. Dayan,et al. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[45] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[46] John R. Anderson,et al. From recurrent choice to skill learning: a reinforcement-learning model. , 2006, Journal of experimental psychology. General.

[47] Wayne D. Gray,et al. Melioration Dominates Maximization: Stable Suboptimal Performance Despite Global Feedback , 2006 .

[48] P. Dayan,et al. Cortical substrates for exploratory decisions in humans , 2006, Nature.

[49] Robert A Jacobs,et al. Near-Optimal Human Adaptive Control across Different Noise Environments , 2006, The Journal of Neuroscience.

[50] Jadin C. Jackson,et al. Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling. , 2007, Psychological review.

[51] Samuel M. McClure,et al. Short-term memory traces for action bias in human reinforcement learning , 2007, Brain Research.

[52] Arthur B Markman,et al. Regulatory fit effects in a choice task , 2007, Psychonomic bulletin & review.