Rollout Algorithms for Combinatorial Optimization
暂无分享,去创建一个
[1] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.
[2] Krishna R. Pattipati,et al. Application of heuristic search and information theory to sequential fault diagnosis , 1990, IEEE Trans. Syst. Man Cybern..
[3] Fred W. Glover,et al. A user's guide to tabu search , 1993, Ann. Oper. Res..
[4] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..
[5] Gerald Tesauro,et al. On-line Policy Improvement using Monte-Carlo Search , 1996, NIPS.
[6] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[7] Andrew G. Barto,et al. Reinforcement learning , 1998 .
[8] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .