Adaptive Linear Quadratic Gaussian Control: The Cost-Biased Approach Revisited

In adaptive control, a standard approach is to resort to the so-called certainty equivalence principle which consists of generating some standard parameter estimate and then using it in the control law as if it were the true parameter. As a consequence of this philosophy, the estimation problem is decoupled from the control problem and this substantially simplifies the corresponding adaptive control scheme. On the other hand, the complete absence of dual properties makes certainty equivalent controllers run into an identifiability problem which generally leads to a strictly suboptimal performance. In this paper, we introduce a cost-biased parameter estimator to overcome this difficulty. This estimator is applied to a linear quadratic Gaussian controller. The corresponding adaptive scheme is proven to be stable and optimal when the unknown system parameter lies in an infinite, yet compact, parameter set.

[1]  Michel Loève,et al.  Probability Theory I , 1977 .

[2]  H. Teicher,et al.  Probability theory: Independence, interchangeability, martingales , 1978 .

[3]  V. Borkar,et al.  Adaptive control of Markov chains, I: Finite parameter set , 1979, 1979 18th IEEE Conference on Decision and Control including the Symposium on Adaptive Processes.

[4]  P. Hall,et al.  Martingale Limit Theory and Its Application , 1980 .

[5]  J. Douglas Faires,et al.  Numerical Analysis , 1981 .

[6]  P. R. Kumar,et al.  Stochastic control of a queue with two servers of different rates , 1982 .

[7]  P. Kumar,et al.  A new family of optimal adaptive controllers for Markov chains , 1982 .

[8]  P. Kumar Simultaneous identification and adaptive control of unknown systems over finite parameter sets , 1983 .

[9]  P. R. Kumar,et al.  Optimal Adaptive Control of Linear-Quadratic-Gaussian Systems , 1983 .

[10]  D. Delchamps Analytic feedback control and the algebraic Riccati equation , 1984 .

[11]  Jan Sternby,et al.  Consistency in least-squares estimation: A Bayesian approach , 1984, Autom..

[12]  P. Kumar,et al.  Adaptive control with the stochastic approximation algorithm: Geometry and convergence , 1985 .

[13]  Woei Lin,et al.  Will the self-tuning approach work for general cost criteria? , 1985 .

[14]  Stability in the Sense of Bounded Average Power , 1985 .

[15]  Pravin Varaiya,et al.  Stochastic Systems: Estimation, Identification, and Adaptive Control , 1986 .

[16]  Han-Fu Chen,et al.  Convergence rate of least-squares identification and adaptive control for stochastic systems† , 1986 .

[17]  Han-Fu Chen,et al.  Optimal adaptive control and consistent parameter estimates for ARMAX model with quadratic cost , 1986, 1986 25th IEEE Conference on Decision and Control.

[18]  Dimitri P. Bertsekas,et al.  Dynamic Programming: Deterministic and Stochastic Models , 1987 .

[19]  Han-Fu Chen,et al.  Optimal adaptive control and consistent parameter estimates for ARMAX model withquadratic cost , 1987 .

[20]  Lei Guo,et al.  Asymptotically optimal adaptive control with consistent parameter estimates , 1987 .

[21]  Guo Lei,et al.  A robust stochastic adaptive controller , 1988 .

[22]  Tyrone E. Duncan,et al.  Adaptive control of continuous-time linear stochastic systems , 1990, Math. Control. Signals Syst..

[23]  S. Bittanti,et al.  Recursive least-squares identification algorithms with incomplete excitation: convergence analysis and application to adaptive control , 1990 .

[24]  P. Kumar,et al.  Convergence of Least-Squares Parameter Estimate Based Adaptive Control Schemes , 1990 .

[25]  B. Pasik-Duncan,et al.  Adaptive control of continuous-time linear stochastic systems , 1990, Math. Control. Signals Syst..

[26]  Han-Fu Chen,et al.  Identification and Stochastic Adaptive Control , 1991 .

[27]  Control theory methods for consistency in some least squares identification problems , 1993, IEEE Trans. Autom. Control..

[28]  Lei Guo Self-convergence of weighted least-squares with applications to stochastic adaptive control , 1996, IEEE Trans. Autom. Control..

[29]  Marco C. Campi,et al.  The problem of pole-zero cancellation in transfer function identification and application to adaptive stabilization , 1996, Autom..