Shaping robot behavior using principles from instrumental conditioning
暂无分享,去创建一个
[1] Marco Colombetti,et al. Behavior analysis and training-a methodology for behavior engineering , 1996, IEEE Trans. Syst. Man Cybern. Part B.
[2] José del R. Millán,et al. Rapid, safe, and incremental learning of navigation strategies , 1996, IEEE Trans. Syst. Man Cybern. Part B.
[3] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[4] M. Conway. Handbook of perception and cognition , 1996 .
[5] Pattie Maes,et al. No Bad Dogs: Ethological Lessons for Learning in Hamsterdam , 1996 .
[6] Gillian M. Hayes,et al. Robot Shaping --- Principles, Methods and Architectures , 1996 .
[7] Marco Colombetti,et al. Robot Shaping: Developing Autonomous Agents Through Learning , 1994, Artif. Intell..
[8] José del R. Millán,et al. Learning efficient reactive behavioral sequences from basic reflexes in a goal-directed autonomous robot , 1994 .
[9] Bruce Blumberg,et al. Action-selection in hamsterdam: lessons from ethology , 1994 .
[10] Reid G. Simmons,et al. Structured control for autonomous robots , 1994, IEEE Trans. Robotics Autom..
[11] T. Bussey,et al. A novel automated touchscreen procedure for assessing learning in the rat using computer graphic stimuli , 1994 .
[12] D. Cliff. From animals to animats 3 : proceedings of the Third International Conference on Simulation of Adaptive Behavior , 1994 .
[13] R. Hampson,et al. Hippocampal cell firing correlates of delayed-match-to-sample performance in the rat. , 1993, Behavioral neuroscience.
[14] Devika Subramanian,et al. A Multistrategy Learning Scheme for Agent Knowledge Acquisition , 1993, Informatica.
[15] Douglas A. Baxter,et al. A learning rule based on empirically-derived activity-dependent neuromodulation supports operant conditioning in a small network , 1992, Neural Networks.
[16] Paul E. Utgoff,et al. A Teaching Method for Reinforcement Learning , 1992, ML.
[17] Paul E. Utgoff,et al. Two Kinds of Training Information For Evaluation Function Learning , 1991, AAAI.
[18] Douglas A. Baxter,et al. Empirically derived adaptive elements and networks simulate associative learning , 1991 .
[19] C. Watkins. Learning from delayed rewards , 1989 .
[20] S. Pellis,et al. Escalation of feline predation along a gradient from avoidance through "play" to killing. , 1988, Behavioral neuroscience.
[21] A. Dickinson. Actions and habits: the development of behavioural autonomy , 1985 .
[22] A G Barto,et al. Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.
[23] J. Pearce,et al. A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli. , 1980, Psychological review.
[24] C. Gallistel. The organization of action , 1980 .
[25] H. M. Jenkins,et al. Sign-tracking : the stimulus-reinforcer relation and directed action , 1974 .
[26] H. M. Jenkins,et al. The form of the auto-shaped response with food or water reinforcers. , 1973, Journal of the experimental analysis of behavior.
[27] R. Rescorla. A theory of pavlovian conditioning: The effectiveness of reinforcement and non-reinforcement , 1972 .
[28] G. S. Reynolds. A Primer of Operant Conditioning , 1968 .
[29] P. L. Brown,et al. Auto-shaping of the pigeon's key-peck. , 1968, Journal of the experimental analysis of behavior.
[30] A. A. Mullin,et al. Principles of neurodynamics , 1962 .
[31] K. Breland,et al. The misbehavior of organisms. , 1961 .