Nonuniqueness versus Uniqueness of Optimal Policies in Convex Discounted Markov Decision Processes
暂无分享,去创建一个
[1] J Jaap Wessels,et al. Note---A Note on Dynamic Programming with Unbounded Rewards , 1975 .
[2] J. Hadamard. Sur les problemes aux derive espartielles et leur signification physique , 1902 .
[3] W. Marsden. I and J , 2012 .
[4] O. Hernández-Lerma,et al. Further topics on discrete-time Markov control processes , 1999 .
[5] A. Peressini,et al. The Mathematics Of Nonlinear Programming , 1988 .
[6] Raúl Montes-de-Oca,et al. Conditions for the uniqueness of optimal policies of discounted Markov decision processes , 2004, Math. Methods Oper. Res..
[7] Kensuke Tanaka,et al. ON AN $ \varepsilon $-OPTIMAL POLICY OF DISCRETE TIME STOCHASTIC CONTROL PROCESSES , 1995 .