Continuous time control of Markov processes on an arbitrary state space: Average return criterion

The paper deals with continuous time Markov decision processes on a fairly general state space. The economic criterion is the long-run average return. A set of conditions is shown to be sufficient for a constant g to be optimal average return and a stationary policy [pi]* to be optimal. This condition is shown to be satisfied under appropriate assumptions on the optimal discounted return function. A policy improvement algorithm is proposed and its convergence to an optimal policy is proved.

[1]  Anders Martin-Löf,et al.  Optimal Control of a Continuous-Time Markov Chain with Periodic Transition Probabilities , 1967, Oper. Res..

[2]  D. Blackwell Discrete Dynamic Programming , 1962 .

[3]  B. L. Miller Finite state continuous time Markov decision processes with an infinite planning horizon , 1968 .

[4]  S. Ross Arbitrary State Markovian Decision Processes , 1968 .

[5]  C. Derman DENUMERABLE STATE MARKOVIAN DECISION PROCESSES: AVERAGE COST CRITERION. , 1966 .

[6]  Anders Martin-Löf,et al.  Existence of a Stationary Control for a Markov Chain Maximizing the Average Reward , 1967, Oper. Res..

[7]  P. Mandl Analytical treatment of one-dimensional Markov processes , 1968 .

[8]  H. M. Taylor Markovian sequential replacement processes , 1965 .

[9]  S. S. Chitgopekar Continuous Time Markovian Sequential Control Processes , 1969 .

[10]  Ronald A. Howard,et al.  Dynamic Programming and Markov Processes , 1960 .

[11]  S. Ross NON-DISCOUNTED DENUMERABLE MARKOVIAN DECISION MODELS , 1968 .

[12]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1951 .

[13]  W. Rudin Principles of mathematical analysis , 1964 .

[14]  P. Kakumanu,et al.  Nondiscounted Continuous Time Markovian Decision Process with Countable State Space , 1972 .

[15]  B. Doshi Continuous Time Control of Markov Processes on an Arbitrary State Space: Discounted Rewards , 1976 .

[16]  B. L. Miller Finite State Continuous Time Markov Decision Processes with a Finite Planning Horizon , 1968 .