论文信息 - On the stability analysis of optimal state feedbacks as represented by deep neural models

On the stability analysis of optimal state feedbacks as represented by deep neural models

Research has shown how the optimal feedback control of several non linear systems of interest in aerospace applications can be represented by deep neural architectures and trained using techniques including imitation learning, reinforcement learning and evolutionary algorithms. Such deep architectures are here also referred to as Guidance and Control Networks, or G&CNETs. It is difficult to provide theoretical proofs on the control stability of such neural control architectures in general, and G&CNETs in particular, to perturbations, time delays or model uncertainties or to compute stability margins and trace them back to the network training process or to its architecture. In most cases the analysis of the trained network is performed via Monte Carlo experiments and practitioners renounce to any formal guarantee. This lack of validation naturally leads to scepticism especially in cases where safety and validation are of paramount importance such as is the case, for example, in the automotive or space industry. In an attempt to narrow the gap between deep learning research and control theory, we propose a new methodology based on differential algebra and automated differentiation to obtain formal guarantees on the behaviour of neural based control systems.

[1] Albert C. J. Luo. Periodic Flows to Chaos in Time-Delay Systems , 2016 .

[2] Dario Izzo,et al. Real-time optimal control via Deep Neural Networks: study on landing problems , 2016, ArXiv.

[3] C. Sánchez,et al. Optimal real-time landing using deep networks , 2016 .

[4] Dario Izzo,et al. Learning the optimal state-feedback using deep networks , 2016, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).

[5] M. Berz. Differential Algebraic Description of Beam Dynamics to Very High Orders , 1988 .

[6] M. Berz,et al. Asteroid close encounters characterization using differential algebra: the case of Apophis , 2010 .

[7] Dario Izzo,et al. Machine learning and evolutionary techniques in interplanetary trajectory design , 2018, Springer Optimization and Its Applications.

[8] Martin Berz,et al. Verified Integration of ODEs and Flows Using Differential Algebraic Methods on High-Order Taylor Models , 1998, Reliab. Comput..

[9] L. S. Pontryagin,et al. Mathematical Theory of Optimal Processes , 1962 .

[10] Sergey Levine,et al. Exploring Deep and Recurrent Architectures for Optimal Control , 2013, ArXiv.

[11] Griewank,et al. On automatic differentiation , 1988 .