论文信息 - Skinner-Pigeon Experiment Simulated Based on Probabilistic Automata

Skinner-Pigeon Experiment Simulated Based on Probabilistic Automata

This paper constructs a learning probabilistic automata (PA) model with response of operant conditioning (OC) behavior, which used for simulating skinner-pigeon experiment. The PA model with OC is a form of animal learning in that it allows an agent to adapt its actions to gain maximally from the environment while only being rewarded for correct performance. The learning mechanism achieved by design probability of action selection, which is updated by the information of reward and punishment form the environment, and then the agent select an action random according to the probability of action selection. We apply our model to skinner-pigeon experiment, the peck button task. The pigeon learn this task in stages. In simulation, our model also acquires the task in a similar manner.

Xiaogang Ruan | Jianxian Cai

[1] B. Roche,et al. The Behavior of Organisms? , 1997 .

[2] Manuela M. Veloso,et al. CMRoboBits: Creating an Intelligent AIBO Robot , 2006, AI Mag..

[3] David S. Touretzky,et al. Tekkotsu: A Framework for AIBO Cognitive Robotics , 2005, AAAI.

[4] David S. Touretzky,et al. Operant Conditioning in Skinnerbots , 1997, Adapt. Behav..

[5] W. Holcombe. Algebraic automata theory: Contents , 1982 .

[6] David S. Touretzky,et al. Cognitive Primitives for Mobile Robots , 2004, AAAI Technical Report.

[7] E. Tira-Thompson. Combining Configural and TD Learning on a Robot , 2002, ICDL 2002.

[8] David S. Touretzky,et al. Shaping robot behavior using principles from instrumental conditioning , 1997, Robotics Auton. Syst..

[9] B. Skinner. Two Types of Conditioned Reflex and a Pseudo Type , 1935 .

[10] Matthew Simon,et al. Automata Theory , 1999 .

[11] David S. Touretzky,et al. Application of a model of instrumental conditioning to mobile robot control , 1997, Other Conferences.

[12] W. N. Schoenfeld,et al. Essentials of behavior. , 1952 .

[13] Paolo Dario,et al. Behavior model of humanoid robots based on operant conditioning , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..