Numerical methods for infinite horizon risk sensitive stochastic control

We are concerned with numerical methods for infinite-time horizon risk sensitive stochastic control problems. Let x/sub t/ be a controlled Markov process in R/sup n/, governed by a given stochastic differential equation. The controller can observe x/sub t/ (state feedback). In risk sensitive control, exponential running cost criteria are considered. A measure of risk sensitivity is fixed. On a finite time horizon, consider the optimal cost function. Under suitable assumptions, the Isaacs partial differential equation (PDE) for a stochastic differential game, with average cost per unit time payoff criterion, is satisfied. In this game, the minimizing player chooses control u/sub t/. The maximizing player, corresponding to "unfriendly nature" chooses a control v/sub t/. The exponential LQR (LEQR) problem leads to a deterministic differential game, which is the same game which arises in state space robust control formulations of disturbance attenuation problems. LEQR problems reduce to solving matrix Riccati equations. This technique is no longer available for nonlinear dynamics or nonquadratic cases. Instead, we resort to numerical solution of the PDE. An heuristic is used.<<ETX>>

[1]  P. Whittle Risk-Sensitive Optimal Control , 1990 .

[2]  E. Barron,et al.  Total risk aversion, stochastic optimal control, and differential games , 1989 .

[3]  S. Varadhan,et al.  Asymptotic evaluation of certain Markov process expectations for large time , 1975 .

[4]  T. Runolfsson The equivalence between infinite-horizon optimal control of stochastic systems with exponential-of-integral performance index and stochastic differential games , 1994, IEEE Trans. Autom. Control..

[5]  P. Khargonekar,et al.  State-space solutions to standard H/sub 2/ and H/sub infinity / control problems , 1989 .

[6]  Matthew R. James,et al.  Numerical approximation of the H∞ norm for nonlinear systems , 1995, Autom..

[7]  P. Whittle A risk-sensitive maximum principle , 1990 .

[8]  Fabien Campillo,et al.  Numerical methods in ergodic optimal stochastic control application to semi-active vehicle suspensions , 1990, 29th IEEE Conference on Decision and Control.

[9]  Matthew R. James,et al.  Asymptotic analysis of nonlinear stochastic risk-sensitive control and differential games , 1992, Math. Control. Signals Syst..

[10]  W. Fleming,et al.  Risk-Sensitive Control on an Infinite Time Horizon , 1995 .

[11]  Tamer Basar,et al.  H∞-Optimal Control and Related , 1991 .

[12]  W. Fleming,et al.  Controlled Markov processes and viscosity solutions , 1992 .

[13]  Matthew R. James,et al.  The risk-sensitive index and theH2 andH∞, norms for nonlinear systems , 1995, Math. Control. Signals Syst..

[14]  K. Glover,et al.  State-space formulae for all stabilizing controllers that satisfy and H ∞ norm bound and relations to risk sensitivity , 1988 .