Aspects of intuitive control: Stabilize, optimize, and identify

This paper presents a novel interpretation to intuitive control with aspects of adaptive as well as optimal control. The regulation of system states in an iterative fashion to approach an optimal controller is presented. The system model is assumed to be linear time invariant with unknown internal dynamics. The adaptive portion employs a Model Reference Adaptive controller to stabilize the plant and subsequently provide an initial stabilizing gain for a Reinforcement Learning based policy iteration algorithm. The objective based decomposition of the controller into stabilization and optimization phases can prove instrumental in applications of robotics and aircraft navigation. A novel method for identification of unknown internal dynamics (MIMO plant) is also presented based on information from Optimization phase. The performance of this approach is demonstrated on an aircraft lateral-directional control application.

[1]  Noah J. Cowan,et al.  Autostabilizing airframe articulation: Animal inspired air vehicle control , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[2]  Andreas Pitsillides,et al.  Mimicking the bird flocking behavior for controlling congestion in sensor networks , 2010, 2010 3rd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2010).

[3]  Richard S. Sutton,et al.  Reinforcement Learning is Direct Adaptive Optimal Control , 1992, 1991 American Control Conference.

[4]  Jae Young Lee,et al.  A novel generalized value iteration scheme for uncertain continuous-time linear systems , 2010, 49th IEEE Conference on Decision and Control (CDC).

[5]  D. Bernstein Matrix Mathematics: Theory, Facts, and Formulas , 2009 .

[6]  Frank L. Lewis,et al.  Adaptive optimal control for continuous-time linear systems based on policy iteration , 2009, Autom..

[7]  Richard S. Sutton,et al.  Neural networks for control , 1990 .

[8]  D. Kleinman On an iterative technique for Riccati equation computations , 1968 .

[9]  Erik Maehle,et al.  SelSta - A Biologically Inspired Approach for Self-Stabilizing Humanoid Robot Walking , 2010, DIPES/BICC.

[10]  George M. Siouris,et al.  Applied Optimal Control: Optimization, Estimation, and Control , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[11]  M. Branicky Multiple Lyapunov functions and other analysis tools for switched and hybrid systems , 1998, IEEE Trans. Autom. Control..

[12]  F. Lewis,et al.  Continuous-Time ADP for Linear Systems with Partially Unknown Dynamics , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.

[13]  Petros A. Ioannou,et al.  Robust Adaptive Control , 2012 .

[14]  Jason L. Speyer,et al.  Primer on Optimal Control Theory , 2010 .