Operational Index Evaluation Based on Greedy Strategy in a Combat of Multi-Arms

The operational index evaluation needs to be combined with the static capability and dynamic tactics of arms of both sides. According to the Lanchester equation of multi-arms engagement, we propose an operational index evaluation method based on the greedy strategy. The proposed method combines the normal engagement model and the reinforcement learning theory, and circularly updates the operational index and fire assignment of arms, which gradually converges to a stable value. The method has the advantages of intuitive principle, bootstrap and convenient for computer processing.

[1]  Risa Nye,et al.  Numbers , 2008, A Grammar of the Hittite Language.

[2]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[3]  Thomas L. Saaty,et al.  DECISION MAKING WITH THE ANALYTIC HIERARCHY PROCESS , 2008 .

[4]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[5]  Hans-Jürgen Zimmermann,et al.  Fuzzy Set Theory - and Its Applications , 1985 .

[6]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[7]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[8]  H. Zimmermann,et al.  Fuzzy Set Theory and Its Applications , 1993 .

[9]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[10]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[11]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.