A Behavior Optimization Method for Unmanned Combat Aerial Vehicles Using Matrix Factorization

One of the fundamental technologies for unmanned combat aerial vehicles and combat simulators is behavior optimization, which finds a behavior that maximizes the probability of winning a battle. With the advent of military science, combat logs became available, allowing machine learning algorithms to be used for the behavior optimization. Due to implicit attributes such as the experience of an operator that are not explicitly presented in log data, existing methods for behavior optimization have limitations in performance improvement. Furthermore, specific behaviors occur with low frequency, resulting in a dataset with imbalanced and empty values. Therefore, we apply a matrix factorization (MF) method, which is one of latent factor models and known for sophisticated imputation of empty values, to the behavior optimization problem of unmanned combat aerial vehicles. A situation-behavior matrix, whose elements are ratings indicating the optimality of behaviors in situations, is defined to implement the MF based method. Experiments for performance comparison were conducted on combat logs, in which the proposed method yielded satisfactory results.

[1]  Jinyoung Suk,et al.  Collision Avoidance Maneuver Planning Using GA for LEO and GEO Satellite Maintained in Keeping Area , 2012 .

[2]  Bo Wang,et al.  HI2Rec: Exploring Knowledge in Heterogeneous Information for Movie Recommendation , 2019, IEEE Access.

[3]  Gerardo G. Acosta,et al.  Trajectory tracking algorithm for autonomous vehicles using adaptive reinforcement learning , 2015, OCEANS 2015 - MTS/IEEE Washington.

[4]  Michael Lewis,et al.  Automated maneuvering decisions for air-to-air combat , 1987 .

[5]  Yubo Jiang,et al.  Improved Metric-Based Recommender by Historical Interactions , 2019, IEEE Access.

[6]  Hichem Snoussi,et al.  A reinforcement learning approach for UAV target searching and tracking , 2018, Multimedia Tools and Applications.

[7]  Bilal H. Abed-alguni,et al.  A multi-agent cooperative reinforcement learning model using a hierarchy of consultants, tutors and workers , 2015, Vietnam Journal of Computer Science.

[8]  Yun Liu,et al.  Overlapping Community Detection Using Non-Negative Matrix Factorization With Orthogonal and Sparseness Constraints , 2018, IEEE Access.

[9]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[10]  Öztürk Özdemir Kanat,et al.  Combined active flow and flight control systems design for morphing unmanned aerial vehicles , 2019 .

[11]  Tong Ming-an Modeling and Solution of Scheduling Problem and Fire Allocations for Multi-targets BVR Attacking , 2004 .

[12]  Zhou Zhong-liang Decision-Making for Cooperative Multiple Target Attack Based on Adaptive Pseudo-Parallel Genetic Algorithm , 2013 .

[13]  Zhenyu Shi,et al.  Deep reinforcement learning based optimal trajectory tracking control of autonomous underwater vehicle , 2017, 2017 36th Chinese Control Conference (CCC).

[14]  Zohreh Azimifar,et al.  Human Action Recognition: Learning Sparse Basis Units from Trajectory Subspace , 2016, Appl. Artif. Intell..

[15]  Cédric Hartland,et al.  Evolutionary Robotics, Anticipation and the Reality Gap , 2006, 2006 IEEE International Conference on Robotics and Biomimetics.

[16]  Gail Gong Cross-Validation, the Jackknife, and the Bootstrap: Excess Error Estimation in Forward Logistic Regression , 1986 .

[17]  Carlo Kopp,et al.  RANGE-LIMITED UAV TRAJECTORY USING TERRAIN MASKING UNDER RADAR DETECTION RISK , 2012, Appl. Artif. Intell..

[18]  Honggang Zhang,et al.  Variational Bayesian Matrix Factorization for Bounded Support Data , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Michael B. C. Khoo,et al.  Dichotomous Logistic Regression with Leave-One-Out Validation , 2010 .

[20]  Li Fu,et al.  The overview for UAV Air-Combat Decision method , 2014, The 26th Chinese Control and Decision Conference (2014 CCDC).

[21]  Guanghong Gong,et al.  Human performance modeling for manufacturing based on an improved KNN algorithm , 2016 .

[22]  Mitsuo Gen,et al.  Unusual human behavior recognition using evolutionary technique , 2009, Comput. Ind. Eng..

[23]  Ping Li,et al.  Current trends in the development of intelligent unmanned autonomous systems , 2017, Frontiers of Information Technology & Electronic Engineering.

[24]  Pietro Liò,et al.  Collective Human Mobility Pattern from Taxi Trips in Urban Area , 2012, PloS one.

[25]  Andrea d'Avella,et al.  Matrix factorization algorithms for the identification of muscle synergies: evaluation on simulated and experimental data sets. , 2006, Journal of neurophysiology.

[26]  Tugrul Oktay,et al.  Simultaneous Longitudinal and Lateral Flight Control Systems Design for Both Passive and Active Morphing TUAVs , 2017 .

[27]  Tal Shima,et al.  Integrated task assignment and path optimization for cooperating uninhabited aerial vehicles using genetic algorithms , 2011, Comput. Oper. Res..

[28]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[29]  Huijun Jiang,et al.  Human action tracking design of neural network algorithm based on GA-PSO in physical training , 2018, Cluster Computing.

[30]  Yu Zhang,et al.  Cooperative Trajectory Planning for Multiple UAVs Using Distributed Receding Horizon Control and Inverse Dynamics Optimization Method , 2015, ITITS.

[31]  W. Marsden I and J , 2012 .

[32]  Markus Flierl,et al.  Graph-Preserving Sparse Nonnegative Matrix Factorization With Application to Facial Expression Recognition , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[33]  Ling Luo,et al.  Personalized recommendation by matrix co-factorization with tags and time information , 2019, Expert Syst. Appl..

[34]  Animesh Chakravarthy,et al.  Collision avoidance laws for objects with arbitrary shapes , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[35]  Naixue Xiong,et al.  Deep Matrix Factorization With Implicit Feedback Embedding for Recommendation System , 2019, IEEE Transactions on Industrial Informatics.

[36]  R. J. Kuo,et al.  Taiwanese export trade forecasting using firefly algorithm based K-means algorithm and SVR with wavelet transform , 2016, Comput. Ind. Eng..