论文信息 - Classification Problems in MDPs

Classification Problems in MDPs

In this paper we investigate classification problems for Markov deci-sion processes (MDPs). These MDPs can be classified in several ways. One way is based on the concept communicating, and distinguishes between communicating, weakly communicating and noncommunicating. Another way of classification is based on the ergodic structure. In this approach the distinction between completely ergodic, unichain and multichain is made. Furthermore, there is a classification based on decomposition of the state space. This decomposition distinguishes between several levels. At each level there is a set of recurrent classes and a (perhaps empty) set of transient states.

L. C. M. Kallenberg | L. Kallenberg

[1] Bennett L. Fox,et al. Scientific Applications: An algorithm for identifying the ergodic subchains and transient states of a stochastic matrix , 1967, Commun. ACM.

[2] Herbert Weinblatt,et al. A New Search Algorithm for Finding the Simple Cycles of a Finite Directed Graph , 1972, JACM.

[3] Robert E. Tarjan,et al. Depth-First Search and Linear Graph Algorithms , 1972, SIAM J. Comput..

[4] J. Bather. Optimal decision procedures for finite Markov chains. Part III: General convex systems , 1973 .

[5] J. Bather. Optimal decision procedures for finite Markov chains. Part II: Communicating systems , 1973, Advances in Applied Probability.

[6] J. Bather. Optimal decision procedures for finite markov chains. Part I: Examples , 1973, Advances in Applied Probability.

[7] Eric V. Denardo,et al. Periods of Connected Networks and Powers of Nonnegative Matrices , 1977, Math. Oper. Res..

[8] Loren Platzman,et al. Technical Note - Improved Conditions for Convergence in Undiscounted Markov Renewal Programming , 1977, Oper. Res..

[9] L. C. M. Kallenberg,et al. Linear programming and finite Markovian control problems , 1984 .

[10] J. Filar,et al. Communicating MDPs: Equivalence and LP properties , 1988 .

[11] Keith W. Ross,et al. Multichain Markov Decision Processes with a Sample Path Constraint: A Decomposition Approach , 1991, Math. Oper. Res..