Optimal real-time landing using deep networks

Optimal trajectories for spacecraft guidance, be it during orbital transfers or landing sequences are often pre-computed on ground and used as nominal desired solutions later tracked by a secondary control system. Linearization of the dynamics around such nominal profiles allows to cancel the error during the actual navigation phase when the trajectory is executed. In this study, instead, we assess the possibility of having the optimal guidance profile be represented on-board by a deep artificial neural network trained, using supervised learning, to represent the optimal control structure. We show how the deep network is able to learn the structure of the optimal state-feedback outside of the training data and with great precision. We apply our method to different interesting optimal control problems, a multicopter time and power optimal pinpoint landing control problem and two different mass optimal spacecraft landing problems. In all cases, the deep network is able to safely learn the optimal state-feedback, also outside of the training data, making it a viable candidate for the implementation of a reactive real-time optimal control architecture.

[1]  Randal W. Beard,et al.  Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation , 1997, Autom..

[2]  M. Bardi,et al.  Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations , 1997 .

[3]  Michael A. Saunders,et al.  SNOPT: An SQP Algorithm for Large-Scale Constrained Optimization , 2002, SIAM J. Optim..

[4]  J. van Leeuwen,et al.  Neural Networks: Tricks of the Trade , 2002, Lecture Notes in Computer Science.

[5]  F. Lewis,et al.  A Hamilton-Jacobi setup for constrained neural network control , 2003, Proceedings of the 2003 IEEE International Symposium on Intelligent Control.

[6]  E. Todorov Optimality principles in sensorimotor control , 2004, Nature Neuroscience.

[7]  Behcet Acikmese,et al.  Convex programming approach to powered descent guidance for mars landing , 2007 .

[8]  Yuval Tassa,et al.  Least Squares Solutions of the HJB Equation With Neural Network Value-Function Approximators , 2007, IEEE Transactions on Neural Networks.

[9]  John T. Betts,et al.  Practical Methods for Optimal Control and Estimation Using Nonlinear Programming , 2009 .

[10]  Yann LeCun,et al.  What is the best multi-stage architecture for object recognition? , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[11]  Farzad Pourboghrat,et al.  Optimal control of nonlinear systems using RBF neural network and adaptive extended Kalman filter , 2009, 2009 American Control Conference.

[12]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[13]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[14]  Sohrab Effati,et al.  Optimal control problem via neural networks , 2013, Neural Computing and Applications.

[15]  Raffaello D'Andrea,et al.  Performance benchmarking of quadrotor systems using time-optimal control , 2012, Auton. Robots.

[16]  Sergey Levine,et al.  Exploring Deep and Recurrent Architectures for Optimal Control , 2013, ArXiv.

[17]  Geoffrey E. Hinton,et al.  On the importance of initialization and momentum in deep learning , 2013, ICML.

[18]  D. Izzo,et al.  Nonlinear model predictive control applied to vision-based spacecraft landing , 2013 .

[19]  Wang Ding,et al.  Constrained online optimal control for continuous-time nonlinear systems using neuro-dynamic programming , 2014, Proceedings of the 33rd Chinese Control Conference.

[20]  Konrad P. Körding,et al.  Deep networks for motor control functions , 2015, Front. Comput. Neurosci..

[21]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[22]  Sergey Levine,et al.  Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[23]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..