Adaptive stress testing of airborne collision avoidance systems

This paper presents a scalable method to efficiently search for the most likely state trajectory leading to an event given only a simulator of a system. Our approach uses a reinforcement learning formulation and solves it using Monte Carlo Tree Search (MCTS). The approach places very few requirements on the underlying system, requiring only that the simulator provide some basic controls, the ability to evaluate certain conditions, and a mechanism to control the stochasticity in the system. Access to the system state is not required, allowing the method to support systems with hidden state. The method is applied to stress test a prototype aircraft collision avoidance system to identify trajectories that are likely to lead to near mid-air collisions. We present results for both single and multi-threat encounters and discuss their relevance. Compared with direct Monte Carlo search, this MCTS method performs significantly better both in finding events and in maximizing their likelihood.

[1]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[2]  Mykel J. Kochenderfer,et al.  Next-Generation Airborne Collision Avoidance System , 2012 .

[3]  Jean-Baptiste Jeannin,et al.  A Formally Verified Hybrid System for the Next-Generation Airborne Collision Avoidance System , 2015, TACAS.

[4]  James K. Kuchar,et al.  The Traffic Alert and Collision Avoidance System , 2007 .

[5]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[6]  J. Charles,et al.  A Sino-German λ 6 cm polarization survey of the Galactic plane I . Survey strategy and results for the first survey region , 2006 .

[7]  Mykel J. Kochenderfer,et al.  A decision-theoretic approach to developing robust collision avoidance logic , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[8]  Christian von Essen,et al.  Probabilistic verification and synthesis of the next generation airborne collision avoidance system , 2015, International Journal on Software Tools for Technology Transfer.

[9]  Ole J. Mengshoel,et al.  Adaptive stress testing of airborne collision avoidance systems , 2015 .

[10]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[11]  Christian von Essen,et al.  Analyzing the Next Generation Airborne Collision Avoidance System , 2014, TACAS.

[12]  B J Chludzinski Evaluation of TCAS II Version 7.1 Using the FAA Fast-Time Encounter Generator Model, Volume 1 , 2009 .

[13]  Nataliya Sokolovska,et al.  Continuous Upper Confidence Trees , 2011, LION.

[14]  Ben Tse,et al.  Autonomous Inverted Helicopter Flight via Reinforcement Learning , 2004, ISER.

[15]  Long Ji Lin,et al.  Reinforcement Learning of Non-Markov Decision Processes , 1995, Artif. Intell..

[16]  Jean-Baptiste Jeannin,et al.  Formally Verified Safe Vertical Maneuvers for Non-deterministic, Accelerating Aircraft Dynamics , 2017, ITP.

[17]  Arnold Barnett Cross-National Differences in Aviation Safety Records , 2010, Transp. Sci..

[18]  Simon M. Lucas,et al.  A Survey of Monte Carlo Tree Search Methods , 2012, IEEE Transactions on Computational Intelligence and AI in Games.

[19]  Mykel J. Kochenderfer,et al.  Airspace Encounter Models for Estimating Collision Risk , 2010 .

[20]  Mykel J. Kochenderfer,et al.  Optimizing the Next Generation Collision Avoidance System for Safe, Suitable, and Acceptable Operational Performance , 2013 .

[21]  Anshu Saksena,et al.  Probabilistic model checking of the next-generation airborne collision avoidance system , 2016, 2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC).