Classification Problems in MDPs
暂无分享,去创建一个
[1] Bennett L. Fox,et al. Scientific Applications: An algorithm for identifying the ergodic subchains and transient states of a stochastic matrix , 1967, Commun. ACM.
[2] Herbert Weinblatt,et al. A New Search Algorithm for Finding the Simple Cycles of a Finite Directed Graph , 1972, JACM.
[3] Robert E. Tarjan,et al. Depth-First Search and Linear Graph Algorithms , 1972, SIAM J. Comput..
[4] J. Bather. Optimal decision procedures for finite Markov chains. Part III: General convex systems , 1973 .
[5] J. Bather. Optimal decision procedures for finite Markov chains. Part II: Communicating systems , 1973, Advances in Applied Probability.
[6] J. Bather. Optimal decision procedures for finite markov chains. Part I: Examples , 1973, Advances in Applied Probability.
[7] Eric V. Denardo,et al. Periods of Connected Networks and Powers of Nonnegative Matrices , 1977, Math. Oper. Res..
[8] Loren Platzman,et al. Technical Note - Improved Conditions for Convergence in Undiscounted Markov Renewal Programming , 1977, Oper. Res..
[9] L. C. M. Kallenberg,et al. Linear programming and finite Markovian control problems , 1984 .
[10] J. Filar,et al. Communicating MDPs: Equivalence and LP properties , 1988 .
[11] Keith W. Ross,et al. Multichain Markov Decision Processes with a Sample Path Constraint: A Decomposition Approach , 1991, Math. Oper. Res..