Optimized scheduling of complex resource allocation systems through approximate dynamic programming

In spite of the tremendous progress that has been attained by the scientific community on the sequencing and scheduling problems that underlie the operation of many technological applications, there is a remaining gap between the analytical insights and results offered by the existing scheduling theory and the solutions to the scheduling problems that are typically adopted in the industrial practice. The work undertaken in this project seeks to bridge this gap by tapping upon recent developments in the Supervisory Control of Resource Allocation Systems and Approximate Dynamic Programming. When combined, these two areas provide high-fidelity representations of the underlying dynamics and effective tools for developing (near-)optimal scheduling policies while explicitly managing the complexity that underlies the development and implementation of these policies. This write-up outlines the basic framework that supports the proposed approach and reports upon the current progress of the work.

[1]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[2]  E. Mark Gold,et al.  Deadlock Prediction: Easy and Difficult Cases , 1978, SIAM J. Comput..

[3]  Spyros A. Reveliotis,et al.  The destabilizing effect of blocking due to finite buffering capacity in multi-class queueing networks , 2000, IEEE Trans. Autom. Control..

[4]  Sean P. Meyn Control Techniques for Complex Networks: Workload , 2007 .

[5]  Thomas E. Morton,et al.  Heuristic scheduling systems : with applications to production systems and project management , 1993 .

[6]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[7]  Spyros Reveliotis Real-time management of resource allocation systems : a discrete event systems approach , 2004 .

[8]  P. Schweitzer,et al.  Generalized polynomial approximations in Markovian decision processes , 1985 .

[9]  P. R. Kumar Scheduling Manufacturing Systems of Re-Entrant Lines , 1994 .

[10]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .

[11]  Richard Sutton,et al.  Learning to Maximize Rewards: A Review of "Reinforcement Learning: An Introduction , 2000 .

[12]  Sean P. Meyn,et al.  Duality and linear programs for stability and performance analysis of queuing networks and scheduling policies , 1996, IEEE Trans. Autom. Control..

[13]  Warren B. Powell,et al.  Approximate Dynamic Programming - Solving the Curses of Dimensionality , 2007 .

[14]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[15]  Warren B. Powell,et al.  Handbook of Learning and Approximate Dynamic Programming , 2006, IEEE Transactions on Automatic Control.

[16]  D. C. Gardner,et al.  Semiconductor factory automation: designing for phased automation , 1996, IEEE/SEMI 1996 Advanced Semiconductor Manufacturing Conference and Workshop. Theme-Innovative Approaches to Growth in the Semiconductor Industry. ASMC 96 Proceedings.

[17]  Michail G. Lagoudakis,et al.  Least-Squares Policy Iteration , 2003, J. Mach. Learn. Res..

[18]  Spyros Reveliotis,et al.  Deadlock Avoidance for Sequential Resource Allocation Systems: Hard and Easy Cases , 2001 .

[19]  Xi-Ren Cao Stochastic Learning and Optimization , 2007 .

[20]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[21]  Christos G. Cassandras,et al.  Introduction to Discrete Event Systems , 1999, The Kluwer International Series on Discrete Event Dynamic Systems.

[22]  Upendra Dave,et al.  Heuristic Scheduling Systems , 1993 .

[23]  Wallace J. Hopp,et al.  Factory physics : foundations of manufacturing management , 1996 .

[24]  Spyros A. Reveliotis,et al.  Relative value function approximation for the capacitated re-entrant line scheduling problem , 2004, IEEE Transactions on Automation Science and Engineering.

[25]  Jin Young Choi,et al.  Performance Modeling, Analysis and Control of Capacitated Re-entrant Lines , 2004 .

[26]  Sheldon M. Ross,et al.  Introduction to probability models , 1975 .

[27]  Sean P. Meyn,et al.  Duality and linear programs for stability and performance analysis of queueing networks and scheduling policies , 1994, Proceedings of 1994 33rd IEEE Conference on Decision and Control.