Total Reward Variance in Discrete and Continuous Time Markov Chains
暂无分享,去创建一个
[1] M. J. Sobel. The variance of discounted Markov decision processes , 1982 .
[2] D. G. MacKay. Context-dependent stuttering , 1970, Kybernetik.
[3] Francisco Benito. Calculating the variance in Markov-processes with random reward , 1982 .
[4] K. Sladký,et al. Optimal Solutions for Undiscounted Variance Penalized Markov Decision Chains , 2004 .
[5] Georg Ch. Pflug,et al. Dynamic Stochastic Optimization , 2004 .
[6] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[7] D. White. Mean, variance, and probabilistic criteria in finite Markov decision processes: A review , 1988 .