Integrated Hybrid Planning and Programmed Control for Real Time UAV Maneuvering

The automatic generation of realistic behaviour such as tactical intercepts for Unmanned Aerial Vehicles (UAV) in air combat is a challenging problem. State-of-the-art solutions propose hand--crafted algorithms and heuristics whose performance depends heavily on the initial conditions and aerodynamic properties of the UAVs involved. This paper shows how to employ domain--independent planners, embedded into professional multi--agent simulations, to implement two--level Model Predictive Control (MPC) hybrid control systems for simulated UAVs. We compare the performance of controllers using planners with others based on behaviour trees that implement real world tactics. Our results indicate that hybrid planners derive novel and effective tactics from first principles inherent to the dynamical constraints UAVs are subject to.

[1]  Alberto Bemporad,et al.  The explicit linear quadratic regulator for constrained systems , 2003, Autom..

[2]  Andrew G. Barto,et al.  Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[3]  John Thangarajah,et al.  A framework for modelling tactical decision-making in autonomous systems , 2015, J. Syst. Softw..

[4]  Hector Geffner,et al.  Purely Declarative Action Descriptions are Overrated: Classical Planning with Simulators , 2017, IJCAI.

[5]  Edward A. Lee,et al.  Introduction to Embedded Systems - A Cyber-Physical Systems Approach , 2013 .

[6]  Gil Tidhar,et al.  Flying Together: Modelling Air Mission Teams , 1998, Applied Intelligence.

[7]  Maria Fox,et al.  Modelling Mixed Discrete-Continuous Domains for Planning , 2006, J. Artif. Intell. Res..

[8]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[9]  Kevin McDonald,et al.  Augmented Reality as an Interface to Air Combat Multi-Agent Simulation , 2015 .

[10]  Tim Miller,et al.  Real-Time UAV Maneuvering via Automated Planning in Simulations , 2017, IJCAI.

[11]  Csaba Szepesvári,et al.  Bandit Based Monte-Carlo Planning , 2006, ECML.

[12]  Paolo Traverso,et al.  Automated Planning: Theory & Practice , 2004 .

[13]  Hector Geffner,et al.  Classical Planning with Simulators: Results on the Atari Video Games , 2015, IJCAI.

[14]  Alberto Bemporad,et al.  Predictive Control for Linear and Hybrid Systems , 2017 .

[15]  Michael W Byrnes Nightfall: Machine Autonomy in Air-to-Air Combat , 2014 .

[16]  Pieter Abbeel,et al.  Combining model-based policy search with online model learning for control of physical humanoids , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[17]  Robert L. Shaw,et al.  Fighter Combat: Tactics and Maneuvering , 1985 .

[18]  Hector Geffner,et al.  Width and Serialization of Classical Planning Problems , 2012, ECAI.

[19]  T. L. McCluskey,et al.  Towards The Integration of Model Predictive Control into an AI Planning Framework , 2016, PlanSIG.

[20]  Stefano Di Cairano,et al.  An Industry Perspective on MPC in Large Volumes Applications: Potential Benefits and Open Challenges , 2012 .

[21]  Maria Fox,et al.  Heuristic Planning for PDDL+ Domains , 2016, AAAI Workshop: Planning for Hybrid Systems.

[22]  Patrik Haslum,et al.  Interval-Based Relaxation for General Numeric Planning , 2016, ECAI.

[23]  Hector Geffner Functional strips: a more flexible language for planning and problem solving , 2000 .

[24]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[25]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[26]  Jonathan P. How,et al.  Aircraft trajectory planning with collision avoidance using mixed integer linear programming , 2002, Proceedings of the 2002 American Control Conference (IEEE Cat. No.CH37301).

[27]  Nicholas Roy,et al.  Air-Combat Strategy Using Approximate Dynamic Programming , 2008 .

[28]  Daniel Liberzon,et al.  Switching in Systems and Control , 2003, Systems & Control: Foundations & Applications.

[29]  Hector Geffner,et al.  Width-Based Planning for General Video-Game Playing , 2015, AIIDE.

[30]  Maria Fox,et al.  PDDL2.1: An Extension to PDDL for Expressing Temporal Planning Domains , 2003, J. Artif. Intell. Res..

[31]  Benedetto Intrigila,et al.  UPMurphi: A Tool for Universal Planning on PDDL+ Problems , 2009, ICAPS.

[32]  Alex M. Andrew,et al.  Knowledge in Action: Logical Foundations for Specifying and Implementing Dynamical Systems , 2002 .

[33]  Clinton Heinze,et al.  Thinking Quickly: Agents for Modeling Air Warfare , 1998, Australian Joint Conference on Artificial Intelligence.

[34]  R. Sanfelice,et al.  Hybrid dynamical systems , 2009, IEEE Control Systems.

[35]  Petter Ögren,et al.  How Behavior Trees Modularize Hybrid Control Systems and Generalize Sequential Behavior Compositions, the Subsumption Architecture, and Decision Trees , 2017, IEEE Transactions on Robotics.

[36]  Hector Geffner,et al.  ∃-STRIPS: Existential Quantification in Planning and Constraint Satisfaction , 2016, IJCAI.

[37]  Michael Lewis,et al.  Game theory for automated maneuvering during air-to-air combat , 1990 .

[38]  Paolo Traverso,et al.  Automated planning - theory and practice , 2004 .

[39]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[40]  Enrico Scala,et al.  Numeric Kernel for Reasoning about Plans Involving Numeric Fluents , 2013, AI*IA.

[41]  Michael Papasimeon,et al.  Simulating Fighter Pilots , 2007 .

[42]  Michael Papasimeon,et al.  Specifying Agent Behaviour with Use Cases , 2000, PRIMA.

[43]  Bernhard Nebel,et al.  A Planning Based Framework for Controlling Hybrid Systems , 2012, ICAPS.

[44]  D. Hull Fundamentals of Airplane Flight Mechanics , 2007 .

[45]  Min-Jea Tahk,et al.  Differential Game Based Air Combat Maneuver Generation Using Scoring Function Matrix , 2016 .