Blackwell Optimality in Denumerable Markov Decision Chains
暂无分享,去创建一个
We consider Markov decision chains with a denumerable state space E, compact metric action sets and the usual continuity properties of the transition probabilities and (possibly unbounded) immediate rewards.