Relative value function approximation for the capacitated re-entrant line scheduling problem

The problem addressed in this study is that of determining how to allocate the workstation processing and buffering capacity in a capacitated re-entrant line to the job instances competing for it, in order to maximize its long-run/steady-state throughput, while maintaining the logical correctness of the underlying material flow, i.e., deadlock-free operations. An approximation scheme for the optimal policy that is based on neuro-dynamic programming theory is proposed, and its performance is assessed through a numerical experiment. The derived results indicate that the proposed method holds considerable promise for providing a viable, computationally efficient approach to the problem and highlight directions for further investigation. Note to Practitioners-Sequencing and scheduling problems arising in the context of contemporary manufacturing environments are known to be extremely hard. For this reason, in most practical situations, these problems have been resolved through the application of a number of heuristics-i.e., "rules of thumb" that are expected to provide reasonable performance. Things are complicated even further in the automated versions of these environments, since the applied sequencing and scheduling logic must guarantee, in addition to good performance, logically correct and smooth operation. Both the logical and the performance-oriented control problem of flexibly automated production systems can be-and have been-addressed through formal systems theory. However, a challenging remaining problem is the approximation of the derived optimal policies in a way that will maintain near optimality, and at the same time, it will be computationally tractable in the context of the "real-world" applications. Our past work has addressed this approximation problem primarily with respect to the optimal logical control policy. The work presented in this paper undertakes the complementary problem of approximating the optimal scheduling policy. To this end, we employ some recently emerged results from a field known as neuro-dynamic programming (which is essentially a systematic approximation framework for dynamic programming). Our results indicate that the proposed approximation framework holds considerable promise toward developing a systematic analytical methodology for deriving near-optimal and logically correct scheduling policies for flexibly automated production systems. More specifically, it is shown that, when applied to some prototypical problems concerning the scheduling of re-entrant lines with finite buffering capacity at their workstations, the proposed approximation framework: 1) effectively integrates past results concerning the logical control of these environments and 2) the obtained performance is consistently superior to the performance provided by the typically used heuristics.

[1]  P. R. Kumar,et al.  Performance bounds for queueing networks and scheduling policies , 1994, IEEE Trans. Autom. Control..

[2]  Jin Young Choi,et al.  Performance Modeling, Analysis and Control of Capacitated Re-entrant Lines , 2004 .

[3]  M. Degroot,et al.  Probability and Statistics , 2021, Examining an Operational Approach to Teaching Probability.

[4]  S. Fotopoulos Stochastic modeling and analysis of manufacturing systems , 1996 .

[5]  Sunil Kumar,et al.  Queueing network models in the design and analysis of semiconductor wafer fabs , 2001, IEEE Trans. Robotics Autom..

[6]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[7]  Spyros A. Reveliotis,et al.  The destabilizing effect of blocking due to finite buffering capacity in multi-class queueing networks , 2000, IEEE Trans. Autom. Control..

[8]  Sunil Kumar,et al.  Fluctuation smoothing policies are stable for stochastic re-entrant lines , 1996, Discret. Event Dyn. Syst..

[9]  Spyros Reveliotis,et al.  Deadlock Avoidance for Sequential Resource Allocation Systems: Hard and Easy Cases , 2001 .

[10]  John N. Tsitsiklis,et al.  Feature-based methods for large scale dynamic programming , 2004, Machine Learning.

[11]  Benjamin Van Roy Learning and value function approximation in complex decision processes , 1998 .

[12]  P. R. Kumar Scheduling Manufacturing Systems of Re-Entrant Lines , 1994 .

[13]  S. Sushanth Kumar,et al.  Performance bounds for queueing networks and scheduling policies , 1993, Proceedings of 32nd IEEE Conference on Decision and Control.

[14]  Walter L. Smith Probability and Statistics , 1959, Nature.

[15]  Benjamin Van Roy,et al.  The linear programming approach to approximate dynamic programming: theory and application , 2002 .

[16]  P. Schweitzer,et al.  Generalized polynomial approximations in Markovian decision processes , 1985 .

[17]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[18]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[19]  John N. Tsitsiklis,et al.  The complexity of optimal queueing network control , 1994, Proceedings of IEEE 9th Annual Conference on Structure in Complexity Theory.

[20]  John N. Tsitsiklis,et al.  Optimization of multiclass queuing networks: polyhedral and nonlinear characterizations of achievable performance , 1994 .

[21]  Spyros Reveliotis,et al.  Structural Control of Large-Scale Flexibly Automated Manufacturing Systems , 2019, The Design of Manufacturing Systems.

[22]  P.R. Kumar Scheduling semiconductor manufacturing plants , 1994, IEEE Control Systems.

[23]  John N. Tsitsiklis,et al.  The Complexity of Optimal Queuing Network Control , 1999, Math. Oper. Res..

[24]  Christos G. Cassandras,et al.  Introduction to Discrete Event Systems , 1999, The Kluwer International Series on Discrete Event Dynamic Systems.

[25]  Christine Westall Performance Modeling , 2002 .

[26]  Spyros A. Reveliotis,et al.  A generalized stochastic Petri net model for performance analysis and control of capacitated reentrant lines , 2003, IEEE Trans. Robotics Autom..

[27]  Sunil Kumar,et al.  Fluctuation smoothing policies are stable for stochastic re-entrant lines , 1994, Proceedings of 1994 33rd IEEE Conference on Decision and Control.

[28]  S.C.H. Lu,et al.  Efficient scheduling policies to reduce mean and variance of cycle-time in semiconductor manufacturing plants , 1994 .

[29]  Lawrence M. Wein,et al.  Scheduling semiconductor wafer fabrication , 1988 .