Exponential Convergence in Undiscounted Continuous-Time Markov Decision Chains
暂无分享,去创建一个
[1] Daniel P. Heyman,et al. Stochastic models in operations research , 1982 .
[2] John Bather. OPTIMAL STATIONARY POLICIES FOR DENUMERABLE MARKOV CHAINS IN CONTINUOUS TIME , 1976 .
[3] Mark R. Lembersky. On Maximal Rewards and $|varepsilon$-Optimal Policies in Continuous Time Markov Decision Chains , 1974 .
[4] Awi Federgruen,et al. A GENERAL MARKOV DECISION METHOD I: MODEL AND TECHNIQUES , 1977 .
[5] A. Federgruen,et al. A general markov decision method II: Applications , 1977, Advances in Applied Probability.
[6] E. Coddington,et al. Theory of Ordinary Differential Equations , 1955 .
[7] W. Zijm. Asymptotic expansions for dynamic programming recursions with general nonnegative matrices , 1987 .
[8] W. Barry. On the Iterative Method of Dynamic Programming on a Finite Space Discrete Time Markov Process , 1965 .
[9] Bharat T. Doshi,et al. Continuous time control of Markov processes on an arbitrary state space: Average return criterion , 1976 .
[10] Paul J. Schweitzer,et al. The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems , 1977, Math. Oper. Res..
[11] F. A. van der Duyn Schouten. Markov decision processes with continuous time parameter , 1983 .
[12] P. Kakumanu,et al. Nondiscounted Continuous Time Markovian Decision Process with Countable State Space , 1972 .
[13] B. L. Miller. Finite state continuous time Markov decision processes with an infinite planning horizon , 1968 .
[14] R. Bellman. Dynamic programming. , 1957, Science.
[15] Ronald A. Howard,et al. Dynamic Programming and Markov Processes , 1960 .
[16] B. L. Miller. Finite State Continuous Time Markov Decision Processes with a Finite Planning Horizon , 1968 .
[17] P. Schweitzer,et al. Geometric convergence of value-iteration in multichain Markov decision problems , 1979, Advances in Applied Probability.
[18] Kai Lai Chung,et al. Markov Chains with Stationary Transition Probabilities , 1961 .