Finite-horizon variance penalised Markov decision processes
暂无分享,去创建一个
[1] Cyrus Derman,et al. Finite State Markovian Decision Processes , 1970 .
[2] G. C. Shephard,et al. Convex Polytopes and the Upper Bound Conjecture , 1971 .
[3] M. J. Sobel. The variance of discounted Markov decision processes , 1982 .
[4] Jerzy A. Filar,et al. Variance-Penalized Markov Decision Processes , 1989, Math. Oper. Res..
[5] D. J. White. Computational approaches to variance-penalised Markov decision processes , 1992 .
[6] Ying Huang,et al. On Finding Optimal Policies for Markov Decision Chains: A Unifying Framework for Mean-Variance-Tradeoffs , 1994, Math. Oper. Res..
[7] D. J. White. A mathematical programming approach to a problem in variance penalised Markov decision processes , 1994 .
[8] E. J. Collins,et al. Finite-horizon dynamic optimisation when the terminal reward is a concave functional of the distribution of the final state , 1998, Advances in Applied Probability.