Research on decision-making strategy of soccer robot based on multi-agent reinforcement learning

This article studies a multi-agent reinforcement learning algorithm based on agent action prediction. In multi-agent system, the action of learning agent selection is inevitably affected by the action of other agents, so the reinforcement learning system needs to consider the joint state and joint action of multi-agent based on this. In addition, the application of this method in the cooperative strategy learning of soccer robot is studied, so that the multi-agent system can pass through the environment. To realize the division of labour and cooperation of multi-robots, the interactive learning is used to master the behaviour strategy. Combined with the characteristics of decision-making of soccer robot, this article analyses the role transformation and experience sharing of multi-agent reinforcement learning, and applies it to the local attack strategy of soccer robot, uses this algorithm to learn the action selection strategy of the main robot in the team, and uses Matlab platform for simulation verification. The experimental results prove the effectiveness of the research method, and the superiority of the proposed method is validated compared with some simple methods.

[1]  Victor C. M. Leung,et al.  Deep-Reinforcement-Learning-Based Optimization for Cache-Enabled Opportunistic Interference Alignment Wireless Networks , 2017, IEEE Transactions on Vehicular Technology.

[2]  Maozhu Jin,et al.  Uniform $L^1$ stability of the inelastic Boltzmann equation with large external force for hard potentials , 2019, Discrete & Continuous Dynamical Systems - S.

[3]  Shaofei Wu,et al.  Sewage information monitoring system based on wireless sensor , 2018 .

[4]  Jian Xu,et al.  A comprehensive study on the locomotion characteristics of a metameric earthworm-like robot , 2015 .

[5]  Jianfeng Chen,et al.  Smart generation control based on multi-agent reinforcement learning with the idea of the time tunnel , 2017 .

[6]  Shaofei Wu,et al.  Evaluation of Developer Efficiency Based on Improved DEA Model , 2018, Wirel. Pers. Commun..

[7]  Shaofei Wu,et al.  Nonlinear information data mining based on time series for fractional differential operators. , 2019, Chaos.

[8]  Xiao Wang,et al.  Multi-agent reinforcement learning based maintenance policy for a resource constrained flow line system , 2016, J. Intell. Manuf..

[9]  Feng Zhao,et al.  A Reinforcement Learning-Based Sleep Scheduling Algorithm for Desired Area Coverage in Solar-Powered Wireless Sensor Networks , 2016, IEEE Sensors Journal.

[10]  Lizhi Liu,et al.  Modeling method of internet public information data mining based on probabilistic topic model , 2019, The Journal of Supercomputing.

[11]  Zibin Zheng,et al.  Integrating Reinforcement Learning with Multi-Agent Techniques for Adaptive Service Composition , 2017, ACM Trans. Auton. Adapt. Syst..

[12]  Yigang He,et al.  Analysis of Mutual Couple Effect of UHF RFID Antenna for the Internet of Things Environment , 2019, IEEE Access.

[13]  Glen Berseth,et al.  DeepLoco , 2017, ACM Trans. Graph..

[14]  Shaofei Wu,et al.  Research on internet information mining based on agent algorithm , 2018, Future Gener. Comput. Syst..

[15]  Loredana Zollo,et al.  Interplay of Rhythmic and Discrete Manipulation Movements During Development: A Policy-Search Reinforcement-Learning Robot Model , 2016, IEEE Transactions on Cognitive and Developmental Systems.

[16]  Jie Wu,et al.  e-Sampling , 2017, ACM Trans. Auton. Adapt. Syst..

[17]  Hongbing Wang,et al.  Effective service composition using multi-agent reinforcement learning , 2016, Knowl. Based Syst..

[18]  Xiaoliang Ma,et al.  A Multi-Objective Agent-Based Control Approach With Application in Intelligent Traffic Signal System , 2019, IEEE Transactions on Intelligent Transportation Systems.

[19]  Shaofei Wu,et al.  A Traffic Motion Object Extraction Algorithm , 2015, Int. J. Bifurc. Chaos.

[20]  Robert Babuska,et al.  Decentralized Reinforcement Learning of Robot Behaviors , 2018, Artif. Intell..

[21]  Shaofei Wu,et al.  Bidirectional cognitive computing method supported by cloud technology , 2018, Cognitive Systems Research.