A Neuro-Evolution Approach to Shepherding Swarm Guidance in the Face of Uncertainty

Controlling a large swarm of agents is a challenging task. Shepherding refers to an active field of research that seeks to address this challenge by using a control agent (sheepdog), which guides a swarm (sheep) towards a goal. Traditional shepherding involves switching between two main behaviours: driving the swarm towards the goal, and collecting stray sheep back to the flock. Evidently, the movement of the agents are dependent on their sensed information. Therefore, effectively controlling a swarm is even more challenging when sensor information or communication channels are unreliable. In this paper, we propose a shepherding methodology to achieve efficient swarm control in the presence of noise in the sensed information. The proposed approach consists of a new resting behaviour and a neural network-based reinforcement learning model. The neural network is used to learn shepherding policies using the new resting behaviour, where the objective is to optimise the frequency of sheep-to-dog interactions with varying levels of noise. The proposed approach is validated through simulations. Numerical experiments show that the proposed approach results in a more effective and stable performance compared to some conventional shepherding models from the literature.

[1]  Robert Hunjet,et al.  Path Planning for Shepherding a Swarm in a Cluttered Environment using Differential Evolution , 2020, 2020 IEEE Symposium Series on Computational Intelligence (SSCI).

[2]  H. Abbass,et al.  Disturbances in Influence of a Shepherding Agent is More Impactful than Sensorial Noise During Swarm Guidance , 2020, 2020 IEEE Symposium Series on Computational Intelligence (SSCI).

[3]  Kenneth O. Stanley,et al.  Designing neural networks through neuroevolution , 2019, Nat. Mach. Intell..

[4]  Shanhui Fan,et al.  Training of Photonic Neural Networks through In Situ Backpropagation , 2018, 2019 Conference on Lasers and Electro-Optics (CLEO).

[5]  Lihua Xie,et al.  A survey on recent progress in control of swarm systems , 2017, Science China Information Sciences.

[6]  Katia P. Sycara,et al.  Human Interaction With Robot Swarms: A Survey , 2016, IEEE Transactions on Human-Machine Systems.

[7]  Andrew J. King,et al.  Solving the shepherding problem: heuristics for herding autonomous, interacting agents , 2014, Journal of The Royal Society Interface.

[8]  Leslie Pack Kaelbling,et al.  Acting Optimally in Partially Observable Stochastic Domains , 1994, AAAI.

[9]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[10]  Craig W. Reynolds Flocks, herds, and schools: a distributed behavioral model , 1987, SIGGRAPH.

[11]  R. Howard Dynamic Programming and Markov Processes , 1960 .

[12]  Nancy M. Amato,et al.  Shepherding behaviors , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.