Approximate solutions for large-scale piecewise deterministic control systems arising in manufacturing flow control models

We propose a numerical technique for approximately solving large-scale piecewise deterministic control systems that are typically related to manufacturing flow control problems in unreliable production systems. The method consists of reformulating the stochastic control problem under study into a Markov decision process. Then we exploit the associated dynamic programming conditions and we propose an "approximate" policy iteration algorithm. This will be based on an approximation of the Bellman functions by a combination of a set of base functions, using a specific decomposition technique. The numerical method is applicable whenever a turnpike property holds for some associated infinite horizon deterministic control problem. To illustrate the approach, we solve an example and compare this new approximation method with a more classical approximation-by-decomposition technique. >

[1]  R. Rishel Dynamic Programming and Minimum Principles for Systems with Jump Markov Disturbances , 1975 .

[2]  Alain Haurie,et al.  Overtaking optimal regulation and tracking of piecewise diffusion linear systems , 1992 .

[3]  Christian van Delft,et al.  Turnpike properties for a class of piecewise deterministic systems arising in manufacturing flow control , 1989, Ann. Oper. Res..

[4]  Stanley B. Gershwin,et al.  Performance of hierarchical production scheduling policy , 1984 .

[5]  P. L’Ecuyer Computing Approximate Solutions to Markov Renewal Programs with Continuous State Spaces , 1989 .

[6]  Christian van Delft,et al.  A turnpike improvement algorithm for piecewise deterministic control , 1991 .

[7]  George Liberopoulos,et al.  Perturbation Analysis for the Design of Flexible Manufacturing System Flow Controllers , 1992, Oper. Res..

[8]  Juan Ye Optimal control of piecewise deterministic Markov processes. , 1990 .

[9]  O. Costa Impulse control of piecewise-deterministic processes via linear programming , 1991 .

[10]  W. Fleming,et al.  An Optimal Stochastic Production Planning Problem with Randomly Fluctuating Demand , 1987 .

[11]  Martin L. Puterman,et al.  On the Convergence of Policy Iteration in Stationary Dynamic Programming , 1979, Math. Oper. Res..

[12]  Mark H. A. Davis Piecewise‐Deterministic Markov Processes: A General Class of Non‐Diffusion Stochastic Models , 1984 .

[13]  Oded Maimon,et al.  Dynamic Scheduling and Routing for Flexible Manufacturing Systems that Have Unreliable Machines , 1988, Oper. Res..

[14]  M. Puterman,et al.  Modified Policy Iteration Algorithms for Discounted Markov Decision Problems , 1978 .

[15]  Bruce H. Krogh,et al.  Efficient computation of coordinating controls in hierarchical structures for failure-prone multicell flexible assembly systems , 1990, IEEE Trans. Robotics Autom..

[16]  M. Frisén A note on alternating estimation in non-linear regression , 1979 .

[17]  Panganamala Ramana Kumar,et al.  Optimality of Zero-Inventory Policies for Unreliable Manufacturing Systems , 1988, Oper. Res..

[18]  Stanley B. Gershwin,et al.  An algorithm for the computer control of a flexible manufacturing system , 1983 .

[19]  P. Schweitzer,et al.  Generalized polynomial approximations in Markovian decision processes , 1985 .

[20]  Oded Maimon,et al.  Value function approximation via linear programming for FMS scheduling , 1990 .

[21]  El-Kébir Boukas,et al.  Manufacturing flow control and preventing maintenance: a stochastic control approach , 1988 .

[22]  Ram Akella,et al.  Optimal control of production rate in a failure prone manufacturing system , 1985 .