Reinforcement learning in scheduling
暂无分享,去创建一个
Thomas G. Dietterich | Prasad Tadepalli | Wei Zhang | DoKyeong Ok | Prasad Tadepalli | DoKyeong Ok | Wei Zhang
[1] Gerald Tesauro,et al. Temporal Difference Learning of Backgammon Strategy , 1992, ML Workshop.
[2] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..
[3] Norman Sadeh,et al. Look-ahead techniques for micro-opportunistic job shop scheduling , 1992 .
[4] David S. Johnson,et al. Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .
[5] A. J. Clewett,et al. Introduction to sequencing and scheduling , 1974 .
[6] Monte Zweben,et al. Learning to Improve Constraint-Based Scheduling , 1992, Artif. Intell..
[7] Andrew W. Moore,et al. Memory-based Reinforcement Learning: Converging with Less Data and Less Real Time , 1993 .
[8] Mark S. Fox,et al. Constraint-Directed Search: A Case Study of Job-Shop Scheduling , 1987 .