论文信息 - Shaping robot behavior using principles from instrumental conditioning - 字舞流文

Shaping robot behavior using principles from instrumental conditioning

David S. Touretzky | Lisa M. Saksida | Scott M. Raymond | D. Touretzky | L. Saksida

[1] Marco Colombetti,et al. Behavior analysis and training-a methodology for behavior engineering , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[2] José del R. Millán,et al. Rapid, safe, and incremental learning of navigation strategies , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[3] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[4] M. Conway. Handbook of perception and cognition , 1996 .

[5] Pattie Maes,et al. No Bad Dogs: Ethological Lessons for Learning in Hamsterdam , 1996 .

[6] Gillian M. Hayes,et al. Robot Shaping --- Principles, Methods and Architectures , 1996 .

[7] Marco Colombetti,et al. Robot Shaping: Developing Autonomous Agents Through Learning , 1994, Artif. Intell..

[8] José del R. Millán,et al. Learning efficient reactive behavioral sequences from basic reflexes in a goal-directed autonomous robot , 1994 .

[9] Bruce Blumberg,et al. Action-selection in hamsterdam: lessons from ethology , 1994 .

[10] Reid G. Simmons,et al. Structured control for autonomous robots , 1994, IEEE Trans. Robotics Autom..

[11] T. Bussey,et al. A novel automated touchscreen procedure for assessing learning in the rat using computer graphic stimuli , 1994 .

[12] D. Cliff. From animals to animats 3 : proceedings of the Third International Conference on Simulation of Adaptive Behavior , 1994 .

[13] R. Hampson,et al. Hippocampal cell firing correlates of delayed-match-to-sample performance in the rat. , 1993, Behavioral neuroscience.

[14] Devika Subramanian,et al. A Multistrategy Learning Scheme for Agent Knowledge Acquisition , 1993, Informatica.

[15] Douglas A. Baxter,et al. A learning rule based on empirically-derived activity-dependent neuromodulation supports operant conditioning in a small network , 1992, Neural Networks.

[16] Paul E. Utgoff,et al. A Teaching Method for Reinforcement Learning , 1992, ML.

[17] Paul E. Utgoff,et al. Two Kinds of Training Information For Evaluation Function Learning , 1991, AAAI.

[18] Douglas A. Baxter,et al. Empirically derived adaptive elements and networks simulate associative learning , 1991 .

[19] C. Watkins. Learning from delayed rewards , 1989 .

[20] S. Pellis,et al. Escalation of feline predation along a gradient from avoidance through "play" to killing. , 1988, Behavioral neuroscience.

[21] A. Dickinson. Actions and habits: the development of behavioural autonomy , 1985 .

[22] A G Barto,et al. Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.

[23] J. Pearce,et al. A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli. , 1980, Psychological review.

[24] C. Gallistel. The organization of action , 1980 .

[25] H. M. Jenkins,et al. Sign-tracking : the stimulus-reinforcer relation and directed action , 1974 .

[26] H. M. Jenkins,et al. The form of the auto-shaped response with food or water reinforcers. , 1973, Journal of the experimental analysis of behavior.

[27] R. Rescorla. A theory of pavlovian conditioning: The effectiveness of reinforcement and non-reinforcement , 1972 .

[28] G. S. Reynolds. A Primer of Operant Conditioning , 1968 .

[29] P. L. Brown,et al. Auto-shaping of the pigeon's key-peck. , 1968, Journal of the experimental analysis of behavior.

[30] A. A. Mullin,et al. Principles of neurodynamics , 1962 .

[31] K. Breland,et al. The misbehavior of organisms. , 1961 .