Hierarchical decision making in semiconductor fabs using multi-time scale Markov decision processes
暂无分享,去创建一个
[1] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[2] Shalabh Bhatnagar,et al. Approximate Policy Iteration for Semiconductor Fab-Level Decision Making - a Case Study , 2000 .
[3] S. Marcus,et al. Multi-time Scale Markov Decision Processes , 2005 .
[4] John N. Tsitsiklis,et al. Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.
[5] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[6] M.C. Fu,et al. A Markov decision process model for capacity expansion and allocation , 1999, Proceedings of the 38th IEEE Conference on Decision and Control (Cat. No.99CH36304).
[7] Mark A. Shayman,et al. Multitime scale Markov decision processes , 2003, IEEE Trans. Autom. Control..
[8] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .