Optimum adaptive control in an unknown environment

This correspondence describes the formulation and solution of a nonlinear, non-Gaussian stochastic control problem. Dynamic programming is used to obtain the solution to the problem of optimally controlling a robot, equipped with sensors, that is operating in an unknown environment.

[1]  Ronald A. Howard,et al.  Information Value Theory , 1966, IEEE Trans. Syst. Sci. Cybern..