论文信息 - Ergodic control of Markov chains - 字舞流文

Ergodic control of Markov chains

The ergodic or long-run average cost problem, one of the classical problems in Markov decision processes, is addressed. A brief description of Borkar's convex analytic approach is presented which treats the problem with countable state space, compact action space, and unbounded cost. The ergodic control problem is related to the finite horizon and discounted cost problems.<<ETX>>

Steven I. Marcus | Mrinal K. Ghosh | M. K. Ghosh | S. Marcus

[1] V. Borkar. Control of Markov chains with long-run average cost criterion: the dynamic programming equations , 1989 .

[2] C. Derman. DENUMERABLE STATE MARKOVIAN DECISION PROCESSES: AVERAGE COST CRITERION. , 1966 .

[3] R. Cavazos-Cadena. Necessary conditions for the optimality equation in average-reward Markov decision processes , 1989 .

[4] H. M. Taylor. Markovian sequential replacement processes , 1965 .

[5] C. Derman,et al. A SOLUTION TO A COUNTABLE SYSTEM OF EQUATIONS ARISING IN MARKOVIAN DECISION PROCESSES. , 1966 .

[6] Linn I. Sennott,et al. Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs , 1989, Oper. Res..

[7] A. Arapostathis,et al. On partially observable Markov decision processes with an average cost criterion , 1989, Proceedings of the 28th IEEE Conference on Decision and Control,.

[8] C. Derman. On Sequential Decisions and Markov Chains , 1962 .

[9] Zvi Rosberg,et al. Optimal control of service in tandem queues , 1982 .

[10] Arie Hordijk,et al. Dynamic programming and Markov potential theory , 1974 .

[11] S. Ross. Arbitrary State Markovian Decision Processes , 1968 .

[12] James Flynn. Averaging vs. Discounting in Dynamic Programming: a Counterexample , 1974 .

[13] V. Borkar. On Minimum Cost Per Unit Time Control of Markov Chains , 1984 .

[14] R. Cavazos-Cadena. Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains , 1988 .

[15] Vivek S. Borkar,et al. Control of Markov Chains with Long-Run Average Cost Criterion , 1988 .

[16] Paul J. Schweitzer,et al. Denumerable Undiscounted Semi-Markov Decision Processes with Unbounded Rewards , 1983, Math. Oper. Res..

[17] R. Weber,et al. Optimal control of service rates in networks of queues , 1987, Advances in Applied Probability.

[18] Linn I. Sennott,et al. Average Cost Semi-Markov Decision Processes and the Control of Queueing Systems , 1989, Probability in the Engineering and Informational Sciences.

[19] Ari Arapostathis,et al. On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes , 1991, Ann. Oper. Res..

[20] James Flynn. On optimality criteria for dynamic programs with long finite horizons , 1980 .

[21] D. Blackwell. Discrete Dynamic Programming , 1962 .

[22] Shaler Stidham,et al. Monotonic and Insensitive Optimal Policies for Control of Queues with Undiscounted Costs , 1989, Oper. Res..