System identification approach for inverse optimal control of finite-horizon linear quadratic regulators

Abstract The inverse optimal control for finite-horizon discrete-time linear quadratic regulators is investigated in this paper, which is to estimate the parameters in the objective function using noisy measurements of partial optimal states only. By the Pontryagin’s minimum principle, the concerned inverse optimal control problem is recast as the identification of a parameterized causal-and-anticausal mixed system excited by boundary conditions. Sufficient identifiability conditions for the unknown parameters are provided in terms of the system model itself, rather than relying on the exact values of optimal states or control inputs. In addition, an elegant algebraic solution is provided for the concerned identification problem that is inherently a challenging optimization problem with trilinear equality constraints, and it can recover the true parameters (up to a scalar ambiguity) in the absence of measurement noise or can consistently identify the parameters (up to a scalar ambiguity) in the presence of white measurement noise. The presented algebraic solution relies on recursive matrix calculations so that its computational burden is much less than directly solving a high-dimensional non-convex optimization problem as done in many existing works. The effectiveness of the proposed method as well as its noise sensitivity issue is illustrated by simulation examples.

[1]  Ondřej Straka,et al.  Noise covariance matrices in state‐space models: A survey and comparison of estimation methods—Part I , 2017 .

[2]  Johannes P. Schlöder,et al.  Estimating Parameters in Optimal Control Problems , 2012, SIAM J. Sci. Comput..

[3]  Lennart Ljung,et al.  Constrained Subspace Method for the Identification of Structured State-Space Models (COSMOS) , 2020, IEEE Transactions on Automatic Control.

[4]  Tristan Perez,et al.  Finite-horizon inverse optimal control for discrete-time nonlinear systems , 2018, Autom..

[5]  Jongeun Choi,et al.  Solutions to the Inverse LQR Problem With Application to Biological Systems Analysis , 2015, IEEE Transactions on Control Systems Technology.

[6]  Xiaoming Hu,et al.  Inverse optimal control for discrete-time finite-horizon Linear Quadratic Regulators , 2019, Autom..

[7]  Vishal Gupta,et al.  Data-driven estimation in equilibrium using inverse optimization , 2013, Mathematical Programming.

[8]  Michel Verhaegen,et al.  Data-driven fault estimation of non-minimum phase LTI systems , 2018, Autom..

[9]  R. E. Kalman,et al.  When Is a Linear Control System Optimal , 1964 .

[10]  V. Verdult,et al.  Filtering and System Identification: A Least Squares Approach , 2007 .

[11]  Chuanbo Guo,et al.  Impacts of hatchery-reared mandarin fish Siniperca chuatsi stocking on wild fish community and water quality in a shallow Yangtze lake , 2018, Scientific Reports.

[12]  B. Anderson,et al.  Nonlinear regulator theory and an inverse optimal control problem , 1973 .

[13]  Jean-Paul Laumond,et al.  From human to humanoid locomotion—an inverse optimal control approach , 2010, Auton. Robots.

[14]  Tristan Perez,et al.  Discrete-time inverse optimal control with partial-state information: A soft-optimality approach with constrained state estimation , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[15]  Michel Verhaegen,et al.  Identification of affinely parameterized state-space models with unknown inputs , 2020, Autom..

[16]  Nacim Ramdani,et al.  Towards solving inverse optimal control in a bounded-error framework , 2015, 2015 American Control Conference (ACC).

[17]  John E. R. Staddon,et al.  Optima for animals , 1982 .

[18]  Kalyanmoy Deb,et al.  Handling inverse optimal control problems using evolutionary bilevel optimization , 2016, 2016 IEEE Congress on Evolutionary Computation (CEC).

[19]  Dirk Wollherr,et al.  An Inverse Optimal Control Approach to Explain Human Arm Reaching Control Based on Multiple Internal Models , 2018, Scientific Reports.

[20]  Jongeun Choi,et al.  Determining Human Control Intent Using Inverse LQR Solutions , 2013 .

[21]  Michel Verhaegen,et al.  Subspace identification of individual systems in a large-scale heterogeneous network , 2019, Autom..

[22]  Dana Kulic,et al.  Inverse Optimal Control with Incomplete Observations , 2018, ArXiv.