Automatic control based on wasp behavioral model and stochastic learning automata

A stochastic automaton can perform a finite number of actions in a random environment. When a specific action is performed, the environment responds by producing an environment output that is stochastically related to the action. The aim is to design an automaton, using a reinforcement scheme based on the computational model of wasp behaviour that can determine the best action guided by past actions and environment responses. Using Stochastic Learning Automata techniques, we introduce a decision/control method for intelligent vehicles receiving data from on-board sensors or from the localization system of highway infrastructure.

[1]  Sridhar Mahadevan,et al.  Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..

[2]  S. Lakshmivarahan,et al.  Absolutely Expedient Learning Algorithms For Stochastic Automata , 1973 .

[3]  Guy Theraulaz,et al.  Task differentiation in Polistes wasp colonies: a model for self-organizing groups of robots , 1991 .

[4]  A. Iazzetta The Eeectiveness of Co{mutation in Evolutionary Algorithms: the M Ijn Operator , 2022 .

[5]  Ernesto Tarantino,et al.  A Comparative Analysis of Evolutionary Algorithms for Function Optimisation , 1996 .

[6]  Guy Theraulaz,et al.  Response Threshold Reinforcement and Division of Labor in Insect Societies , 1998 .

[7]  Hajime Kita,et al.  A Comparison Study of Self-Adaptation in Evolution Strategies and Real-Coded Genetic Algorithms , 2001, Evolutionary Computation.

[8]  Olivier Buffet,et al.  Incremental reinforcement learning for designing multi-agent systems , 2001, AGENTS '01.

[9]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[10]  Kumpati S. Narendra,et al.  Learning automata - an introduction , 1989 .

[11]  E. Bonabeau,et al.  Fixed response thresholds and the regulation of division of labor in insect societies , 1998 .

[12]  Dana Simian,et al.  Models for a multi-agent system based on wasp-like behaviour for distributed patients repartition , 2008 .

[13]  Pushkin Kachroo,et al.  Simulation study of multiple intelligent vehicle control using stochastic learning automata , 1997 .

[14]  Pushkin Kachroo,et al.  Multiple stochastic learning automata for vehicle path control in an automated highway system , 1999, IEEE Trans. Syst. Man Cybern. Part A.

[15]  Yufeng Liu,et al.  Stochastic Direct Reinforcement: Application to Simple Games with Recurrence , 2004, AAAI Technical Report.

[16]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[18]  G. Theraulaz,et al.  Response threshold reinforcements and division of labour in insect societies , 1998, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[19]  E. Tarantino,et al.  The effectiveness of co-mutation in evolutionary algorithms: the /spl Mscr//sub ijn/ operator , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[20]  Stephen F. Smith,et al.  Wasp-like Agents for Distributed Factory Coordination , 2004, Autonomous Agents and Multi-Agent Systems.