Bionic experiments based on autonomous operant conditioning automata

This paper proposes an autonomous operant conditioning (AOC) automaton that, designs a bionic autonomous learning control method, and constructs an operant conditioning learning system. AOC provides a recursive operation programme. AOC simulates biological operant conditioning mechanism, which has a bionic self-organising feature, including self-learning and adaptive capabilities, and can be used for description, simulation, and design of a variety of self-organising systems. The bionic autonomous learning control method can be used to describe and simulate a bionic autonomous learning process. The learning system can learn on line by interaction with environment, and achieve the best consequences. Based on the above, we apply the model to simulate Skinner-rat experiment to prove that AOC can simulate the learning mechanism of operant conditioning, and apply it to achieving the balancing control of two-wheeled self-balancing robots which shows that AOC can be used to design a variety of intelligent behaviour for robotic systems.

[1]  Wei Sun,et al.  The design and control on intelligent underwater cleaning robot for power plant condenser , 2009, Int. J. Model. Identif. Control..

[2]  E. Tira-Thompson Combining Configural and TD Learning on a Robot , 2002, ICDL 2002.

[3]  M. A. L. Thathachar,et al.  Learning Automata for Pattern Classification , 2004 .

[4]  David S. Touretzky,et al.  Shaping robot behavior using principles from instrumental conditioning , 1997, Robotics Auton. Syst..

[5]  David S. Touretzky,et al.  Operant Conditioning in Skinnerbots , 1997, Adapt. Behav..

[6]  B. John Oommen,et al.  Dynamic algorithms for the shortest path routing problem: learning automata-based solutions , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[7]  B. Skinner Two Types of Conditioned Reflex and a Pseudo Type , 1935 .

[8]  Kumpati S. Narendra,et al.  The use of learning algorithms in telephone traffic routing - A methodology , 1983, Autom..

[9]  Erfu Yang,et al.  Multi-robot systems with agent-based reinforcement learning: evolution, opportunities and challenges , 2009, Int. J. Model. Identif. Control..

[10]  Cyrille Bertelle,et al.  Shift operators and complex systems , 2008, Int. J. Model. Identif. Control..

[11]  Marco Dorigo,et al.  Towards group transport by swarms of robots , 2009, Int. J. Bio Inspired Comput..

[12]  Gianluca Antonelli,et al.  Coordinated Control of Mobile Antennas for Ad-Hoc Networks in Cluttered Environments , 2006, IAS.

[13]  Paolo Dario,et al.  Behavior model of humanoid robots based on operant conditioning , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[14]  King-Sun Fu,et al.  Learning Control Systems-Review and Outlook , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  K. Najim,et al.  Optimization technique based on learning automata , 1990 .

[16]  Norio Baba,et al.  An application of the stochastic automaton to the investment game , 1980 .

[17]  David S. Touretzky,et al.  Application of a model of instrumental conditioning to mobile robot control , 1997, Other Conferences.

[18]  R. M. Elliott,et al.  Behavior of Organisms , 1991 .

[19]  Zhenbang Gong,et al.  Small UAV controlled by an online adaptive fuzzy control system , 2010, Int. J. Model. Identif. Control..

[20]  Marcílio Carlos Pereira de Souto,et al.  On a hybrid weightless neural system , 2009, Int. J. Bio Inspired Comput..