Cost rate heuristics for semi-Markov decision processes
暂无分享,去创建一个
[1] C. S. Chen,et al. A discounted cost relationship , 1988 .
[2] Michael N. Katehakis,et al. The Multi-Armed Bandit Problem: Decomposition and Computation , 1987, Math. Oper. Res..
[3] Terje Aven,et al. Optimal replacement under a minimal repair strategy—a general failure model , 1983, Advances in Applied Probability.
[4] T. Aven,et al. Optimal replacement times — a general set-up , 1986, Journal of Applied Probability.
[5] D. Blackwell. Discounted Dynamic Programming , 1965 .
[6] C. White. Bounds on optimal cost for a replacement problem with partial observations , 1979 .
[7] S. Christian Albright,et al. Structural Results for Partially Observable Markov Decision Processes , 1979, Oper. Res..
[8] K. Glazebrook. Strategy evaluation for stochastic scheduling problems with order constraints , 1991, Advances in Applied Probability.
[9] David Ruppert,et al. Sequential Nonparametric Age Replacement Policies , 1985 .