Continuous-Time Markov Decision Processes with Controlled Observations
暂无分享,去创建一个
[1] Abraham Wald,et al. Some Generalizations of the Theory of Cumulative Sums of Random Variables , 1945 .
[2] T. Başar. Minimax control of switching systems under sampling , 1994, Proceedings of 1994 33rd IEEE Conference on Decision and Control.
[3] Jr. Shaler Stidham. Optimal control of admission to a queueing system , 1985 .
[4] Jürgen Pannek,et al. Numerical Optimal Control of Nonlinear Systems , 2011 .
[5] Peter E. Caines,et al. Stochastic optimal control under Poisson-distributed observations , 2000, IEEE Trans. Autom. Control..
[6] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[7] R. Durrett. Probability: Theory and Examples , 1993 .
[8] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[9] Vikram Krishnamurthy,et al. Partially observed Markov decision processes (POMDPs) , 2016 .
[10] Tamer Basar,et al. Optimal control of LTI systems over unreliable communication links , 2006, Autom..
[11] Eitan Altman,et al. Applications of Markov Decision Processes in Communication Networks , 2000 .
[12] Edwin K. P. Chong,et al. UAV Path Planning in a Dynamic Environment via Partially Observable Markov Decision Process , 2013, IEEE Transactions on Aerospace and Electronic Systems.
[13] Daniel Liberzon,et al. Calculus of Variations and Optimal Control Theory: A Concise Introduction , 2012 .