Multiple sliding surface guidance for planetary landing : Tuning and optimization via reinforcement learning

The problem of achieving pinpoint landing accuracy in future space missions to extra-terrestrial bodies such as the Moon or Mars presents many challenges, including the requirements of higher accuracy and more flexibility. These new challenges may require the development of novel and more advanced guidance algorithms. Conventional guidance schemes, which generally require a combination of off-line trajectory generation and real-time, trajectory tracking algorithms, have worked well in the past but may not satisfy the more stringent and difficult landing requirements imposed by future mission architectures to bring landers very near to specified locations. In this paper, a novel non-linear guidance algorithm for planetary landing is proposed and analyzed. Based on Higher-Order Sliding Control (HOSC) theory, the Multiple Sliding Surface Guidance (MSSG) algorithms has been specifically designed to take advantage of the ability of the system to reach the sliding surface in a finite time. The high control activity seen in typical sliding controllers is avoided in this formulation, resulting in a guidance law that is both globally stable and robust against unknown, but bounded perturbations. The proposed MSSG does not require any off-line trajectory generation and therefore it is flexible enough to target a large variety of point on the planet’s surface without the need for calculation of multiple reference trajectories. However, after initial analysis, it has been seen that the performance of MSSG is very sensitive to the choice in guidance gains. MSSG generated trajectories have been compared to an optimal solution to begin an investigation of the relationship between the optimality and performance of MSSG and the selection of the guidance parameters. A full study has been performed to investigate and tune the parameters of MSSG utilizing reinforcement learning in order to truly optimize the performance of the MSSG algorithm. Results show that the MSSG algorithm can indeed generate trajectories that come very close to the optimal solution in terms of fuel usage. A full comparison of the trajectories is included, as well as a further study examining the capability of the MSSG algorithm under perturbed conditions using the optimized set of parameters.

[1]  S. Ploen,et al.  Performance trades for Mars pinpoint landing , 2006, 2006 IEEE Aerospace Conference.

[2]  Moshe Idan,et al.  Integrated Sliding Mode Guidance and Control for Missile with On-Off Actuators , 2006 .

[3]  Arie Levant,et al.  Principles of 2-sliding mode design , 2007, Autom..

[4]  Roberto Furfaro,et al.  Non-linear Sliding Guidance algorithms for precision lunar landing , 2011 .

[5]  Arie Levant,et al.  Quasi-continuous high-order sliding-mode controllers , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[6]  Kenneth D. Mease,et al.  Fuel Efficient Powered Descent Guidance for Mars Landing , 2005 .

[7]  Yuri B. Shtessel,et al.  Aeronautical and space vehicle control in dynamic sliding manifolds , 2003 .

[8]  Anil V. Rao,et al.  Direct Trajectory Optimization and Costate Estimation via an Orthogonal Collocation Method , 2006 .

[9]  E.C. Wong,et al.  Guidance and Control Design for Powered Descent and Landing on Mars , 2007, 2007 IEEE Aerospace Conference.

[10]  Jos F. Sturm,et al.  A Matlab toolbox for optimization over symmetric cones , 1999 .

[11]  Roberto Furfaro,et al.  Asteroid Precision Landing via Multiple Sliding Surfaces Guidance Techniques , 2013 .

[12]  Arie Levant,et al.  Homogeneity approach to high-order sliding mode design , 2005, Autom..

[13]  Federico Najson,et al.  A Computationally Non-Expensive Guidance Algorithm for Fuel Efficient Soft Landing , 2005 .

[14]  D. Criswell,et al.  Lunar resources and their utilization , 1977 .

[15]  A. Levant Sliding order and sliding accuracy in sliding mode control , 1993 .

[16]  Yuri B. Shtessel,et al.  Aeronautical and space vehicle control in dynamic sliding manifolds , 2003 .

[17]  Stephen P. Banks,et al.  Sliding Mode Control with Optimal Sliding Surfaces for Missile Autopilot Design , 2000 .

[18]  Behcet Acikmese,et al.  Convex programming approach to powered descent guidance for mars landing , 2007 .

[19]  Behçet Açikmese,et al.  Lossless convexification of a class of optimal control problems with non-convex control constraints , 2011, Autom..

[20]  Yuri B. Shtessel,et al.  Integrated Higher-Order Sliding Mode Guidance and Autopilot for Dual Control Missiles , 2009 .

[21]  S. N. Balakrishnan,et al.  Reentry Terminal Guidance Through Sliding Mode Control , 2010 .

[22]  Allan R. Klumpp A manually retargeted automatic landing system for lunar module /LM/. , 2003 .

[23]  Weiping Li,et al.  Applied Nonlinear Control , 1991 .

[24]  Behcet Acikmese,et al.  Minimum-Landing-Error Powered-Descent Guidance for Mars Landing Using Convex Optimization , 2010 .

[25]  Allan R. Klumpp,et al.  Apollo lunar descent guidance , 1974, Autom..

[26]  Christopher N. D'Souza,et al.  AN OPTIMAL GUIDANCE LAW FOR PLANETARY LANDING , 1997 .

[27]  Michael A. Saunders,et al.  SNOPT: An SQP Algorithm for Large-Scale Constrained Optimization , 2002, SIAM J. Optim..

[28]  Arie Levant,et al.  Higher-order sliding modes, differentiation and output-feedback control , 2003 .

[29]  Anil V. Rao,et al.  Algorithm 902: GPOPS, A MATLAB software for solving multiple-phase optimal control problems using the gauss pseudospectral method , 2010, TOMS.

[30]  Yuri B. Shtessel,et al.  Missile Controlled by Lift and Divert Thrusters Using Nonlinear Dynamic Sliding Manifolds , 2006 .