A Unified Approach of Controller Design and System Identification for Optimal Control of Single-Input Nonlinear Systems

This paper presents a novel unified approach of controller design and identification for unknown input affine nonlinear systems. An issue with obtaining the best performance of optimal control is that identification errors degrade control performance. One solution to overcome it is direct controller tuning without system identification, which is expected to give high quality control. However, many experiments with various control inputs are required in the design procedure. The proposed framework simultaneously implements system identification and controller design. This method adopts a weighted least squares method to cope with various identification criteria. An unknown plant system is identified with a weight selected appropriately for control. It is selected in a simple manner based on an analytical discussion of nonlinear optimal control theory, namely the Hamilton-Jacobi-Bellman equation. An iterative calculation procedure for a preliminarily given data set is derived. This iterative algorithm gives a (sub-)optimal pair of the weight and the model parameter that improves the control performance. Numerical results demonstrate that the proposed approach achieves better optimal control performance than the standard least squares method.

[1]  Junichi Murata,et al.  Ecological driving based on preceding vehicle prediction using MPC , 2011 .

[2]  Frank L. Lewis,et al.  2009 Special Issue: Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems , 2009 .

[3]  Jianqiu Li,et al.  A review on the key issues for lithium-ion battery management in electric vehicles , 2013 .

[4]  Arthur J. Krener,et al.  Numerical Solutions to the Bellman Equation of Optimal Control , 2013, Journal of Optimization Theory and Applications.

[5]  Michel Gevers,et al.  Iterative weighted least-squares identification and weighted LQG control design , 1995, Autom..

[6]  G. Saridis,et al.  Approximate Solutions to the Time-Invariant Hamilton–Jacobi–Bellman Equation , 1998 .

[7]  Alexandre Trofino,et al.  A note on optimal control of a class of single input nonlinear systems , 2015, 2015 American Control Conference (ACC).

[8]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[9]  Michel Gevers,et al.  Identification for Control: From the Early Achievements to the Revival of Experiment Design , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[10]  George G. Lendaris,et al.  Adaptive dynamic programming , 2002, IEEE Trans. Syst. Man Cybern. Part C.

[11]  Dennis S. Bernstein,et al.  Direct frequency response function based, uncertainty accomodating optimal controller design , 2009, 2009 American Control Conference.

[12]  Kenji Fujimoto,et al.  Approximate Solutions to Hamilton-Jacobi Equations Based on Chebyshev Polynomials , 2008 .

[13]  Sergio M. Savaresi,et al.  Virtual reference feedback tuning: a direct method for the design of feedback controllers , 2002, Autom..

[14]  Philippe Chevrel,et al.  Shared Steering Control Between a Driver and an Automation: Stability in the Presence of Driver Behavior Uncertainty , 2013, IEEE Transactions on Intelligent Transportation Systems.

[15]  Klaus-Robert Müller,et al.  Covariate Shift Adaptation by Importance Weighted Cross Validation , 2007, J. Mach. Learn. Res..

[16]  Svante Gunnarsson,et al.  Iterative feedback tuning: theory and applications , 1998 .

[17]  Shigeru Yamamoto,et al.  Comparative Study of Model-Free Predictive Control and Its Database Maintenance for Unstable Systems , 2015 .

[18]  Toshiharu Sugie,et al.  LQ Optimal Controller Design Method based on Experimental Data , 1996 .

[19]  Takao Fujii,et al.  Fictitious Reference Iterative Tuning in the Two-Degree of Freedom Control Scheme and Its Application to a Facile Closed Loop System Identification , 2006 .

[20]  Toshiyuki Ohtsuka,et al.  Solutions to the Hamilton-Jacobi Equation With Algebraic Gradients , 2011, IEEE Transactions on Automatic Control.