Uniqueness of optimal policies as a generic property of discounted Markov decision processes: Ekeland's variational principle approach
暂无分享,去创建一个
[1] Phillipp Bergmann. Dynamic Programming Deterministic And Stochastic Models , 2016 .
[2] J. M. machorro,et al. UNIFORM CONVERGENCE OF VALUE ITERATION POLICIES FOR DISCOUNTED MARKOV DECISION PROCESSES , 2006 .
[3] O. Hernández-Lerma,et al. Discrete-time Markov control processes , 1999 .
[4] W. Fleming. Book Review: Discrete-time Markov control processes: Basic optimality criteria , 1997 .
[5] Raúl Montes-de-Oca,et al. Nonuniqueness versus Uniqueness of Optimal Policies in Convex Discounted Markov Decision Processes , 2013, J. Appl. Math..
[6] Raúl Montes-de-Oca,et al. An unbounded Berge's minimum theorem with applications to discounted Markov decision processes , 2012, Kybernetika.
[7] Raúl Montes-de-Oca,et al. Conditions for the uniqueness of optimal policies of discounted Markov decision processes , 2004, Math. Methods Oper. Res..
[8] Roberto Lucchetti,et al. Convexity and well-posed problems , 2006 .
[9] I. Ekeland. On the variational principle , 1974 .
[10] R. Phelps,et al. THE SUPPORT FUNCTIONALS OF A CONVEX SET , 1986 .
[11] J. Borwein,et al. Techniques of variational analysis , 2005 .