暂无分享,去创建一个
Marek Petrik | Dharmashankar Subramanian | Janusz Marecki | Marek Petrik | J. Marecki | D. Subramanian
[1] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[2] E. Altman. Constrained Markov Decision Processes , 1999 .
[3] Marek Petrik,et al. Linear Dynamic Programs for Resource Management , 2011, AAAI.
[4] Garud Iyengar,et al. Robust Dynamic Programming , 2005, Math. Oper. Res..
[5] Michail G. Lagoudakis,et al. Binary action search for learning continuous-action control policies , 2009, ICML '09.
[6] L. Ghaoui,et al. Robust markov decision processes with uncertain transition matrices , 2004 .
[7] Michael L. Littman,et al. Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes , 2012, ICAPS.
[8] Naoki Abe,et al. Optimizing debt collections using constrained reinforcement learning , 2010, KDD.
[9] Marek Petrik,et al. An Approximate Solution Method for Large Risk-Averse Markov Decision Processes , 2012, UAI.
[10] Jean-Philippe P. Richard,et al. KRANNERT GRADUATE SCHOOL OF MANAGEMENT , 2010 .
[11] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.