论文信息 - Continuous time control of Markov processes on an arbitrary state space: Average return criterion

Continuous time control of Markov processes on an arbitrary state space: Average return criterion

The paper deals with continuous time Markov decision processes on a fairly general state space. The economic criterion is the long-run average return. A set of conditions is shown to be sufficient for a constant g to be optimal average return and a stationary policy [pi]* to be optimal. This condition is shown to be satisfied under appropriate assumptions on the optimal discounted return function. A policy improvement algorithm is proposed and its convergence to an optimal policy is proved.

Bharat T. Doshi | B. Doshi

[1] Anders Martin-Löf,et al. Optimal Control of a Continuous-Time Markov Chain with Periodic Transition Probabilities , 1967, Oper. Res..

[2] D. Blackwell. Discrete Dynamic Programming , 1962 .

[3] B. L. Miller. Finite state continuous time Markov decision processes with an infinite planning horizon , 1968 .

[4] S. Ross. Arbitrary State Markovian Decision Processes , 1968 .

[5] C. Derman. DENUMERABLE STATE MARKOVIAN DECISION PROCESSES: AVERAGE COST CRITERION. , 1966 .

[6] Anders Martin-Löf,et al. Existence of a Stationary Control for a Markov Chain Maximizing the Average Reward , 1967, Oper. Res..

[7] P. Mandl. Analytical treatment of one-dimensional Markov processes , 1968 .

[8] H. M. Taylor. Markovian sequential replacement processes , 1965 .

[9] S. S. Chitgopekar. Continuous Time Markovian Sequential Control Processes , 1969 .

[10] Ronald A. Howard,et al. Dynamic Programming and Markov Processes , 1960 .

[11] S. Ross. NON-DISCOUNTED DENUMERABLE MARKOVIAN DECISION MODELS , 1968 .

[12] William Feller,et al. An Introduction to Probability Theory and Its Applications , 1951 .

[13] W. Rudin. Principles of mathematical analysis , 1964 .

[14] P. Kakumanu,et al. Nondiscounted Continuous Time Markovian Decision Process with Countable State Space , 1972 .

[15] B. Doshi. Continuous Time Control of Markov Processes on an Arbitrary State Space: Discounted Rewards , 1976 .

[16] B. L. Miller. Finite State Continuous Time Markov Decision Processes with a Finite Planning Horizon , 1968 .