Short-term gains, long-term pains: How cues about state aid learning in dynamic environments
暂无分享,去创建一个
[1] R. Luce,et al. Individual Choice Behavior: A Theoretical Analysis. , 1960 .
[2] John A. Nelder,et al. A Simplex Method for Function Minimization , 1965, Comput. J..
[3] R. Rescorla,et al. A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .
[4] W. F. Prokasy,et al. Classical conditioning II: Current research and theory. , 1972 .
[5] P. W. Frey,et al. Inhibition and learning , 1973 .
[6] H. Akaike. A new look at the statistical model identification , 1974 .
[7] G. Schwarz. Estimating the Dimension of a Model , 1978 .
[8] R. Duncan Luce,et al. Individual Choice Behavior: A Theoretical Analysis , 1979 .
[9] W. Hamilton,et al. The evolution of cooperation. , 1984, Science.
[10] D. Broadbent,et al. Interactive tasks and the implicit‐explicit distinction , 1988 .
[11] Stephen Grossberg,et al. The ART of adaptive pattern recognition by a self-organizing neural network , 1988, Computer.
[12] R. Mathews,et al. Insight without Awareness: On the Interaction of Verbalization, Instruction and Practice in a Simulated Process Control Task , 1989 .
[13] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .
[14] Leslie Pack Kaelbling,et al. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.
[15] R. Herrnstein,et al. Melioration: A Theory of Distributed Choice , 1991 .
[16] R. Herrnstein. Experiments on Stable Suboptimality in Individual Behavior , 1991 .
[17] R. Herrnstein,et al. Utility maximization and melioration: Internalities in individual choice , 1993 .
[18] S. Dehaene,et al. The mental representation of parity and number magnitude. , 1993 .
[19] Andrew McCallum,et al. Overcoming Incomplete Perception with Utile Distinction Memory , 1993, ICML.
[20] Gerald Tesauro,et al. TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play , 1994, Neural Computation.
[21] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[22] Richard S. Sutton,et al. Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .
[23] Peter Dayan,et al. Bee foraging in uncertain environments using predictive hebbian learning , 1995, Nature.
[24] P. Dayan,et al. A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.
[25] Andrew McCallum,et al. Reinforcement learning with selective perception and hidden state , 1996 .
[26] Peter Dayan,et al. A Neural Substrate of Prediction and Reward , 1997, Science.
[27] P. Montague,et al. A Computational Role for Dopamine Delivery in Human Decision-Making , 1998, Journal of Cognitive Neuroscience.
[28] Carlo Contoreggi,et al. Drug abusers show impaired performance in a laboratory test of decision making , 2000, Neuropsychologia.
[29] Steven W Anderson,et al. Decision-making deficits, linked to a dysfunctional ventromedial prefrontal cortex, revealed in alcohol and stimulant abusers , 2001, Neuropsychologia.
[30] Richard S. Sutton,et al. Predictive Representations of State , 2001, NIPS.
[31] Jeff G. Schneider,et al. Autonomous helicopter control using reinforcement learning policy search methods , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).
[32] M. Arbib,et al. Modeling functions of striatal dopamine modulation in learning and planning , 2001, Neuroscience.
[33] Hanna Damasio,et al. Decision-making and addiction (part I): impaired activation of somatic states in substance dependent individuals when pondering decisions with negative future consequences , 2002, Neuropsychologia.
[34] D. Shanks,et al. A re‐examination of melioration and rational choice , 2002 .
[35] J. Busemeyer,et al. A contribution of cognitive decision models to clinical assessment: decomposing performance on the Bechara gambling task. , 2002, Psychological assessment.
[36] P. Montague,et al. Neural Economics and the Biological Substrates of Valuation , 2002, Neuron.
[37] David S. Touretzky,et al. Long-Term Reward Prediction in TD Models of the Dopamine System , 2002, Neural Computation.
[38] D. Ballard,et al. What you see is what you need. , 2003, Journal of vision.
[39] Howard Rachlin,et al. Learning by pigeons playing against tit-for-tat in an operant prisoner’s dilemma , 2003, Learning & behavior.
[40] D. Medin,et al. SUSTAIN: a network model of category learning. , 2004, Psychological review.
[41] Eldad Yechiam,et al. Comparison of basic assumptions embedded in learning models for experience-based decision making , 2005, Psychonomic bulletin & review.
[42] Dana H. Ballard,et al. Learning to perceive and act by trial and error , 1991, Machine Learning.
[43] R. Sun,et al. The interaction of the explicit and the implicit in skill learning: a dual-process approach. , 2005, Psychological review.
[44] P. Dayan,et al. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.
[45] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[46] John R. Anderson,et al. From recurrent choice to skill learning: a reinforcement-learning model. , 2006, Journal of experimental psychology. General.
[47] Wayne D. Gray,et al. Melioration Dominates Maximization: Stable Suboptimal Performance Despite Global Feedback , 2006 .
[48] P. Dayan,et al. Cortical substrates for exploratory decisions in humans , 2006, Nature.
[49] Robert A Jacobs,et al. Near-Optimal Human Adaptive Control across Different Noise Environments , 2006, The Journal of Neuroscience.
[50] Jadin C. Jackson,et al. Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling. , 2007, Psychological review.
[51] Samuel M. McClure,et al. Short-term memory traces for action bias in human reinforcement learning , 2007, Brain Research.
[52] Arthur B Markman,et al. Regulatory fit effects in a choice task , 2007, Psychonomic bulletin & review.