论文信息 - A Neuro-Evolution Approach to Shepherding Swarm Guidance in the Face of Uncertainty

A Neuro-Evolution Approach to Shepherding Swarm Guidance in the Face of Uncertainty

Controlling a large swarm of agents is a challenging task. Shepherding refers to an active field of research that seeks to address this challenge by using a control agent (sheepdog), which guides a swarm (sheep) towards a goal. Traditional shepherding involves switching between two main behaviours: driving the swarm towards the goal, and collecting stray sheep back to the flock. Evidently, the movement of the agents are dependent on their sensed information. Therefore, effectively controlling a swarm is even more challenging when sensor information or communication channels are unreliable. In this paper, we propose a shepherding methodology to achieve efficient swarm control in the presence of noise in the sensed information. The proposed approach consists of a new resting behaviour and a neural network-based reinforcement learning model. The neural network is used to learn shepherding policies using the new resting behaviour, where the objective is to optimise the frequency of sheep-to-dog interactions with varying levels of noise. The proposed approach is validated through simulations. Numerical experiments show that the proposed approach results in a more effective and stable performance compared to some conventional shepherding models from the literature.

[1] Robert Hunjet,et al. Path Planning for Shepherding a Swarm in a Cluttered Environment using Differential Evolution , 2020, 2020 IEEE Symposium Series on Computational Intelligence (SSCI).

[2] H. Abbass,et al. Disturbances in Influence of a Shepherding Agent is More Impactful than Sensorial Noise During Swarm Guidance , 2020, 2020 IEEE Symposium Series on Computational Intelligence (SSCI).

[3] Kenneth O. Stanley,et al. Designing neural networks through neuroevolution , 2019, Nat. Mach. Intell..

[4] Shanhui Fan,et al. Training of Photonic Neural Networks through In Situ Backpropagation , 2018, 2019 Conference on Lasers and Electro-Optics (CLEO).

[5] Lihua Xie,et al. A survey on recent progress in control of swarm systems , 2017, Science China Information Sciences.

[6] Katia P. Sycara,et al. Human Interaction With Robot Swarms: A Survey , 2016, IEEE Transactions on Human-Machine Systems.

[7] Andrew J. King,et al. Solving the shepherding problem: heuristics for herding autonomous, interacting agents , 2014, Journal of The Royal Society Interface.

[8] Leslie Pack Kaelbling,et al. Acting Optimally in Partially Observable Stochastic Domains , 1994, AAAI.

[9] Peter Dayan,et al. Q-learning , 1992, Machine Learning.

[10] Craig W. Reynolds. Flocks, herds, and schools: a distributed behavioral model , 1987, SIGGRAPH.

[11] R. Howard. Dynamic Programming and Markov Processes , 1960 .

[12] Nancy M. Amato,et al. Shepherding behaviors , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.