Classes of Nonlinear Partially Observable Stochastic Optimal Control Problems with Explicit Optimal Control Laws

This paper introduces certain nonlinear partially observable stochastic optimal control problems which are equivalent to completely observable control problems with finite-dimensional state space. In some cases the optimal control laws are analogous to linear-exponential-quadratic-Gaussian and linear-quadratic-Gaussian tracking problems. The problems discussed allow nonlinearities to enter the unobservable dynamics as gradients of potential functions. The methodology is based on explicit solutions of a modified Duncan--Mortensen--Zakai equation.

[1]  Ioannis Karatzas,et al.  On the Relation of Zakai’s and Mortensen’s Equations , 1983 .

[2]  Robert J. Elliott,et al.  General finite-dimensional risk-sensitive problems and small noise limits , 1996, IEEE Trans. Autom. Control..

[3]  Etienne Pardoux,et al.  A conditionally almost Linear filtering problem with non-Gaussian initial condition , 1988 .

[4]  A. Bensoussan,et al.  Optimal control of partially observable stochastic systems with an exponential-of-integral performance index , 1985 .

[5]  John S. Baras,et al.  Existence, uniqueness, and asymptotic behavior of solutions to a class of Zakai equations with unbounded coefficients , 1983 .

[6]  Alʹbert Nikolaevich Shiri︠a︡ev,et al.  Statistics of random processes , 1977 .

[7]  W. Wonham On the Separation Theorem of Stochastic Control , 1968 .

[8]  D. W. Stroock,et al.  Multidimensional Diffusion Processes , 1979 .

[9]  Charalambos D. Charalambous Partially observable nonlinear risk-sensitive control problems: dynamic programming and verification theorems , 1997, IEEE Trans. Autom. Control..

[10]  Kevin L. Moore,et al.  Solvable risk-sensitive control problems with output feedback , 1994, Proceedings of 1994 33rd IEEE Conference on Decision and Control.

[11]  C. Charalambous,et al.  Minimum principle for partially observable nonlinear risk-sensitive control problems using measure-valued decompositions , 1996 .

[12]  E. Pardoux Non-linear Filtering, Prediction and Smoothing , 1981 .

[13]  Robert J. Elliott,et al.  A Finite-Dimensional Risk-Sensitive Control Problem , 1995 .

[14]  A. Bensoussan Stochastic Control of Partially Observable Systems , 1992 .

[15]  A. Friedman Stochastic Differential Equations and Applications , 1975 .