Numerical solution methods for differential game problems

Differential game theory provides a potential means for the parametric analysis of combat engagement scenarios. To determine its viability for this type of analysis, three frameworks for solving differential game problems are evaluated. Each method solves zero-sum, pursuit-evasion games in which two players have opposing goals. A solution to the saddle-point equilibrium problem is sought in which one player minimizes the value of the game while the other player maximizes it. The boundary value method is an indirect method that makes use of the analytical necessary conditions of optimality and is solved using a conventional optimal control framework. This method provides a high accuracy solution but has a limited convergence space that requires a good initial guess for both the state and less intuitive costate. The decomposition method in which optimal trajectories for each player are iteratively calculated is a direct method that bypasses the need for costate information. Because a linearized cost gradient is used to update the evader's strategy the initial conditions can heavily influence the convergence of the problem. The new method of neural networks involves the use of neural networks to govern the control policy for each player. An optimization tool adjusts the weights and biases of the network to form the control policy that results in the best final value of the game. An automatic differentiation engine provides gradient information for the sensitivity of each weight to the final cost. The final weights define the control policy's response to a range of initial conditions dependent upon the breadth of the state-space used to train each neural network. The neural nets are initialized with a normal distribution of weights so that no information regarding the state, costate, or switching structure of the controller is required. In its current form this method often converges to a suboptimal solution. Also, creative techniques are required when dealing with boundary conditions and free end-time problems. Thesis Supervisor: Steven R. Hall, Ph.D. Title: Professor of Aeronautics and Astronautics Thesis Supervisor: Russell Smith, Ph.D. Title: Principal Member of Technical Staff, C. S. Draper Laboratory, Inc.

[1]  Bruce A. Conway,et al.  Genetic Algorithm Preprocessing for Numerical Solution of Differential Games Problems , 2004 .

[2]  J. Shinar,et al.  Time-Varying Linear Pursuit-Evasion Game Models with Bounded Controls , 2002 .

[3]  J. Shinar,et al.  An approximate feedback solution of a variable speed non-linear pursuit-evasion game between two airplanes in a horizontal plane , 1980 .

[4]  Antony W Merz,et al.  The Homicidal Chauffeur - A Differential Game , 1971 .

[5]  N. J. Rao,et al.  Pursuit-Evasion of Two Aircraft in a Horizontal Plane , 1980 .

[6]  Kazuhiro Horie Collocation With Nonlinear Programming for Two-Sided Flight Path Optimization , 2002 .

[7]  H. J. Pesch,et al.  Complex differential games of pursuit-evasion type with state constraints, part 1: Necessary conditions for optimal open-loop strategies , 1993 .

[8]  A. Merz The game of two identical cars , 1972 .

[9]  J. Shinar,et al.  Qualitative study of a planar pursuit evasion game in the atmosphere , 1990 .

[10]  Y. Ho,et al.  Differential games and optimal pursuit-evasion strategies , 1965 .

[11]  A. W. Merz,et al.  Minimum Required Capture Radius in a Coplanar Model of the Aerial Combat Problem , 1977 .

[12]  Harri Ehtamo,et al.  Visual Aircraft Identification as a Pursuit-Evasion Game , 2000 .

[13]  Bruce A. Conway,et al.  Optimal Fighter Pursuit-Evasion Maneuvers Found via Two-Sided Optimization , 2006 .

[14]  A. W. Merz,et al.  The Homicidal Chauffeur , 1974 .

[15]  Bruce A. Conway,et al.  Optimal Interception of Evasive Missile Warheads: Numerical Solution of the Differential Game , 2008 .

[16]  A. W. Merz,et al.  Football as a differential game , 1992 .

[17]  V. Turova,et al.  Numerical Study of the Homicidal Chauueur Game , 2022 .

[18]  Laurent Hascoët Automatic Differentiation by Program Transformation , 2007 .

[19]  George M. Siouris,et al.  Applied Optimal Control: Optimization, Estimation, and Control , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[20]  A. W. Merz,et al.  The variable-speed tail-chase aerial combat problem , 1981 .

[21]  Rufus Isaacs,et al.  DIFFERENTIAL GAMES II: THE DEFINITION AND FORMULATION , 1954 .

[22]  P. Varaiya,et al.  Differential games , 1971 .

[23]  Thomas Kaminski,et al.  Recipes for adjoint code construction , 1998, TOMS.

[24]  J. Betts Survey of Numerical Methods for Trajectory Optimization , 1998 .

[25]  Richard Bellman,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[26]  Bernt Jarmark,et al.  Near-optimal flight trajectories generated by neural networks , 1994 .

[27]  T. Raivio Capture Set Computation of an Optimally Guided Missile , 2001 .

[28]  Y. Yavin,et al.  Game of two cars: Case of variable speed , 1989 .

[29]  H. J. Pesch,et al.  Complex differential games of pursuit-evasion type with state constraints, part 2: Numerical computation of optimal open-loop strategies , 1993 .

[30]  Rufus Isaacs,et al.  DIFFERENTIAL GAMES III: THE BASIC PRINCIPLES OF THE SOLUTION PROCESS , 1954 .

[31]  D. S. Hague,et al.  Coplanar tail-chase aerial combat as a differential game , 1977 .

[32]  T. Başar,et al.  Dynamic Noncooperative Game Theory , 1982 .

[34]  J. V. Breakwell,et al.  Role determination in an aerial dogfight , 1974 .

[35]  Lawrence F. Shampine,et al.  Solving Boundary Value Problems for Ordinary Differential Equations in M atlab with bvp 4 c , 2022 .

[36]  J. S. Przemieniecki Mathematical methods in defense analyses , 1994 .

[37]  Laurent Hascoët,et al.  TAPENADE 2.1 user's guide , 2004 .

[38]  B. Jarmark,et al.  Pursuit-evasion between two realistic aircraft , 1984 .

[39]  Eduardo D. Sontag,et al.  Neural Networks for Control , 1993 .

[40]  T. Raivio,et al.  On the Numerical Solution of a Class of Pursuit-Evasion Games , 2000 .