论文信息 - Stackelberg strategies in linear-quadratic stochastic differential games

Stackelberg strategies in linear-quadratic stochastic differential games

This paper obtains the Stackelberg solution to a class of two-player stochastic differential games described by linear state dynamics and quadratic objective functionals. The information structure of the problem is such that the players make independent noisy measurements of the initial state and are permitted to utilize only this information in constructing their controls. Furthermore, by the very nature of the Stackelberg solution concept, one of the players is assumed to know, in advance, the strategy of the other player (the leader). For this class of problems, we first establish existence and uniqueness of the Stackelberg solution and then relate the derivation of the leader's Stackelberg solution to the optimal solution of a nonstandard stochastic control problem. This stochastic control problem is solved in a more general context, and its solution is utilized in constructing the Stackelberg strategy of the leader. For the special case Gaussian statistics, it is shown that this optimal strategy is affine in observation of the leader. The paper also discusses numerical aspects of the Stackelberg solution under general statistics and develops algorithms which converge to the unique Stackelberg solution.

T. Başar | A. Bagchi

[1] Heinrich von Stackelberg,et al. Stackelberg (Heinrich von) - The Theory of the Market Economy, translated from the German and with an introduction by Alan T. PEACOCK. , 1953 .

[2] H. Kushner. On the Existence of Optimal Stochastic Controls , 1965 .

[3] C. Chen,et al. Stackelburg solution for two-person games with biased information patterns , 1972 .

[4] Contraction-Mapping Algorithm with Guaranteed Convergence , 1972 .

[5] J. Cruz,et al. Additional aspects of the Stackelberg strategy in nonzero-sum games , 1973 .

[6] J. Cruz,et al. On the Stackelberg strategy in nonzero-sum games , 1973 .

[7] Michael Athans,et al. On stochastic dynamic stackelberg strategies , 1975, Autom..

[8] Jr. J. Cruz,et al. Leader-follower strategies for multilevel systems , 1978 .

[9] M Tamer Basar. Stochastic stagewise Stackleberg strategies for linear quadratic systems , 1979 .

[10] T. Başar,et al. Closed-loop Stackelberg strategies with applications in the optimal control of multilevel systems , 1979 .

[11] Tamer Başar. Hierarchical Decisionmaking under Uncertainty , 1980 .

[12] Tamer Basar,et al. Team decision theory for linear continuous-time systems , 1980 .