Time-Sharing Policies for Controlled Markov Chains
暂无分享,去创建一个
[1] Manfred SchÄl,et al. Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal , 1975 .
[2] B. Fox,et al. Adaptive Policies for Markov Renewal Programs , 1973 .
[3] Arie Hordijk,et al. Constrained Undiscounted Stochastic Dynamic Programming , 1984, Math. Oper. Res..
[4] J. Kingman. A FIRST COURSE IN STOCHASTIC PROCESSES , 1967 .
[5] E. Altman,et al. Adaptive control of constrained Markov chains: Criteria and policies , 1991 .
[6] F. Beutler,et al. Optimal policies for controlled markov chains with a constraint , 1985 .
[7] Manfred Schäl,et al. ASYMPTOTIC RESULTS FOR SEQUENTIAL MARKOV DECISION MODELS UNDER UNCERTAINTY , 1984 .
[8] E. Altman,et al. Markov decision problems and state-action frequencies , 1991 .
[9] Kai Lai Chung,et al. Markov Chains with Stationary Transition Probabilities , 1961 .
[10] Keith W. Ross,et al. Randomized and Past-Dependent Policies for Markov Decision Processes with Multiple Constraints , 1989, Oper. Res..
[11] Armand M. Makowski,et al. Steering Policies for Markov Decision Processes Under a Recurrence Condition. , 1988 .
[12] Adam Shwartz,et al. Optimal priority assignment: a time sharing approach , 1989 .
[13] Cyrus Derman,et al. Finite State Markovian Decision Processes , 1970 .