Abstract Let ξn be a sequence of independent vector-valued random variables. Let xn be a vector with components x0(n), …, xr(n), and let θn be a vector control. The maximum principle and canonical equations of Pontryagin are derived for the vector system xn = F(xn − 1, θn, ξn) with loss function E x0(N), where N, the control time, is fixed. In the continuous case, it is derived for the form x = ƒ(x, θ) + σξ , and loss E x0(T), where σ is a nonnegative definite matrix and ξ is vector-valued white Gaussian noise. Under suitable smoothness assumptions, the expectation (conditioned upon xn) of the adjoint variables are the derivatives of the loss (min E x0(N) conditioned upon xn).
[1]
S.S.L. Chang.
DYNAMIC PROGRAMMING AND PONTRYAGIN'S MAXIMUM PRINCIPLE
,
1961
.
[2]
R. Kálmán.
THE THEORY OF OPTIMAL CONTROL AND THE CALCULUS OF VARIATIONS
,
1960
.
[3]
Michel Loève,et al.
Probability Theory I
,
1977
.
[4]
Harold J. Kushner,et al.
Optimal stochastic control
,
1962
.
[5]
S. Katz.
A Discrete Version of Pontrjagin's Maximum Principle†
,
1962
.
[6]
J. Florentin.
Optimal Control of Continuous Time, Markov, Stochastic Systems†
,
1961
.