Mimicking the Behaviour of Idiotypic AIS Robot Controllers Using Probabilistic Systems

Previous work has shown that robot navigation systems that employ an architecture based upon the idiotypic network theory of the immune system have an advantage over control techniques that rely on reinforcement learning only. This is thought to be a result of intelligent behaviour selection on the part of the idiotypic robot. In this paper an attempt is made to imitate idiotypic dynamics by creating controllers that use reinforcement with a number of different probabilistic schemes to select robot behaviour. The aims are to show that the idiotypic system is not merely performing some kind of periodic random behaviour selection, and to try to gain further insight into the processes that govern the idiotypic mechanism. Trials are carried out using simulated Pioneer robots that undertake navigation exercises. Results show that a scheme that boosts the probability of selecting highly-ranked alternative behaviours to 50% during stall conditions comes closest to achieving the properties of the idiotypic system, but remains unable to match it in terms of all round performance.

[1]  Fernando José Von Zuben,et al.  An Immune Learning Classifier Network for Autonomous Navigation , 2003, ICARIS.

[2]  Yoshiki Uchikawa,et al.  Evolutionary construction of an immune network-based behavior arbitration mechanism for autonomous mobile robots , 1998 .

[3]  Alan S. Perelson,et al.  The immune system, adaptation, and machine learning , 1986 .

[4]  Guan-Chun Luh,et al.  Reactive Immune Network Based Mobile Robot Navigation , 2004, ICARIS.

[5]  Yoshiki Uchikawa,et al.  Emergent construction of a behavior arbitration mechanism based on the immune system , 1997, Adv. Robotics.

[6]  Zhou Ji,et al.  Revisiting Negative Selection Algorithms , 2007, Evolutionary Computation.

[7]  Yoshiki Uchikawa,et al.  A Reinforcement Learning Method for Dynamic Behavior Arbitration of Autonomous Mobile Robots Based on the Immunological Information Processing Mechanisms , 1997 .

[8]  Jerne Nk Towards a network theory of the immune system. , 1974 .

[9]  Richard T. Vaughan,et al.  The Player/Stage Project: Tools for Multi-Robot and Distributed Sensor Systems , 2003 .

[10]  F.J. Von Zuben,et al.  Decentralized control system for autonomous navigation based on an evolved artificial immune network , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[11]  Uwe Aickelin,et al.  Idiotypic Immune Networks in Mobile-Robot Control , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[12]  N K Jerne,et al.  Towards a network theory of the immune system. , 1973, Annales d'immunologie.

[13]  Uwe Aickelin,et al.  An Idiotypic Immune Network as a Short-Term Learning Architecture for Mobile Robots , 2008, ICARIS.

[14]  Hugues Bersini,et al.  Revisiting Idiotypic Immune Networks , 2003, ECAL.

[15]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[16]  Fernando José Von Zuben,et al.  Learning and optimization using the clonal selection principle , 2002, IEEE Trans. Evol. Comput..

[17]  Werner Dilger,et al.  AIS Based Robot Navigation in a Rescue Scenario , 2004, ICARIS.

[18]  P. Delves,et al.  The Immune System , 2000 .