Policy compression for aircraft collision avoidance systems

One approach to designing the decision making logic for an aircraft collision avoidance system is to frame the problem as Markov decision process and optimize the system using dynamic programming. The resulting strategy can be represented as a numeric table. This methodology has been used in the development of the ACAS X family of collision avoidance systems for manned and unmanned aircraft. However, due to the high dimensionality of the state space, discretizing the state variables can lead to very large tables. To improve storage efficiency, we propose two approaches for compressing the lookup table. The first approach exploits redundancy in the table. The table is decomposed into a set of lower-dimensional tables, some of which can be represented by single tables in areas where the lower-dimensional tables are identical or nearly identical with respect to a similarity metric. The second approach uses a deep neural network to learn a complex non-linear function approximation of the table. With the use of an asymmetric loss function and a gradient descent algorithm, the parameters for this network can be trained to provide very accurate estimates of values while preserving the relative preferences of the possible advisories for each state. As a result, the table can be approximately represented by only the parameters of the network, which reduces the required storage space by a factor of 1000. Simulation studies show that system performance is very similar using either compressed table representation in place of the original table. Even though the neural network was trained directly on the original table, the network surpasses the original table on the performance metrics and encounter sets evaluated here.

[1]  Mykel J. Kochenderfer,et al.  Compression of Optimal Value Functions for Markov Decision Processes , 2013, 2013 Data Compression Conference.

[2]  Shie Mannor,et al.  Reinforcement learning with Gaussian processes , 2005, ICML.

[3]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[4]  Mykel J. Kochenderfer,et al.  Airspace Encounter Models for Estimating Collision Risk , 2010 .

[5]  Klaus-Robert Müller,et al.  Efficient BackProp , 2012, Neural Networks: Tricks of the Trade.

[6]  Ryan W. Gardner,et al.  Enabling real-time execution: Runtime-efficiency improvements for the next-generation airborne collision avoidance algorithms , 2016, 2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC).

[7]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[8]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[9]  Mykel J. Kochenderfer,et al.  Robust Airborne Collision Avoidance through Dynamic Programming , 2011 .

[10]  Sven F. Crone,et al.  Training artificial neural networks for time series prediction using asymmetric cost functions , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[11]  Andrew W. Moore,et al.  Variable Resolution Discretization in Optimal Control , 2002, Machine Learning.

[12]  Mykel J. Kochenderfer,et al.  Next-Generation Airborne Collision Avoidance System , 2012 .

[13]  Tara N. Sainath,et al.  Improving deep neural networks for LVCSR using rectified linear units and dropout , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Mike Marston,et al.  ACAS-Xu Initial Self-Separation Flight Tests , 2015 .

[15]  John Salvatier,et al.  Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.

[16]  Mykel J. Kochenderfer,et al.  Short-term conflict resolution for unmanned aircraft traffic management , 2015, 2015 IEEE/AIAA 34th Digital Avionics Systems Conference (DASC).

[17]  Jonathan P. How,et al.  Optimized Airborne Collision Avoidance , 2015 .

[18]  Mykel J. Kochenderfer,et al.  Accounting for State Uncertainty in Collision Avoidance , 2011 .

[19]  James K. Kuchar,et al.  A review of conflict detection and resolution modeling methods , 2000, IEEE Trans. Intell. Transp. Syst..

[20]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[21]  Jeffrey K. Uhlmann,et al.  Unscented filtering and nonlinear estimation , 2004, Proceedings of the IEEE.

[22]  Guoqiang Peter Zhang,et al.  Neural networks for classification: a survey , 2000, IEEE Trans. Syst. Man Cybern. Part C.