The optimal control of just-in-time-based production and distribution systems and performance comparisons with optimized pull systems

In just-in-time (JIT) production systems, there is both input stock in the form of parts and output stock in the form of product at each stage. These activities are controlled by production-ordering and withdrawal kanbans. This paper discusses a discrete-time optimal control problem in a multistage JIT-based production and distribution system with stochastic demand and capacity, developed to minimize the expected total cost per unit of time. The problem can be formulated as an undiscounted Markov decision process (UMDP); however, the curse of dimensionality makes it very difficult to find an exact solution. The author proposes a new neuro-dynamic programming (NDP) algorithm, the simulation-based modified policy iteration method (SBMPIM), to solve the optimal control problem. The existing NDP algorithms and SBMPIM are numerically compared with a traditional UMDP algorithm for a single-stage JIT production system. It is shown that all NDP algorithms except the SBMPIM fail to converge to an optimal control. Additionally, a new algorithm for finding the optimal parameters of pull systems is proposed. Numerical comparisons between near-optimal controls computed using the SBMPIM and optimized pull systems are conducted for three-stage JIT-based production and distribution systems. UMDPs with 42 million states are solved using the SBMPIM. The pull systems discussed are the kanban, base stock, CONWIP, hybrid and extended kanban.

[1]  S. Marcus,et al.  A Simulation-Based Policy Iteration Algorithm for Average Cost Unichain Markov Decision Processes , 2000 .

[2]  Jack P. C. Kleijnen,et al.  An evolutionary approach to select a pull system among Kanban, Conwip and Hybrid , 2000, J. Intell. Manuf..

[3]  Abhijit Gosavi,et al.  Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning , 2003 .

[4]  P. Shahabudeen,et al.  Design of a Two-Card Dynamic Kanban System Using a Simulated Annealing Algorithm , 2003 .

[5]  Steven I. Marcus,et al.  Simulation-based Algorithms for Markov Decision Processes/ Hyeong Soo Chang ... [et al.] , 2013 .

[6]  Yves Dallery,et al.  Extended kanban control system: combining kanban and base stock , 2000 .

[7]  Fulya Altiparmak,et al.  A comparison of the performance of artificial intelligence techniques for optimizing the number of kanbans , 2002, J. Oper. Res. Soc..

[8]  Taiichi Ohno,et al.  Toyota Production System : Beyond Large-Scale Production , 1988 .

[9]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .

[10]  C. Sendil Kumar,et al.  Literature review of JIT-KANBAN system , 2007 .

[11]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[12]  David L. Woodruff,et al.  CONWIP: a pull alternative to kanban , 1990 .

[13]  J. George Shanthikumar,et al.  A GENERAL APPROACH FOR COORDINATING PRODUCTION IN MULTIPLE‐CELL MANUFACTURING SYSTEMS , 1992 .

[14]  Yves Dallery,et al.  A unified framework for pull control mechanisms in multi‐stage manufacturing systems , 2000, Ann. Oper. Res..

[15]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[16]  Steve New,et al.  Celebrating the enigma: the continuing puzzle of the Toyota Production System , 2007 .

[17]  J. Banks,et al.  Handbook of Simulation , 1998 .

[18]  John A. Buzacott,et al.  Stochastic models of manufacturing systems , 1993 .

[19]  R. D. Hurrion,et al.  An example of simulation optimisation using a neural network metamodel: finding the optimum number of kanbans in a manufacturing system , 1997 .

[20]  José Luis González Velarde,et al.  Computing Tools for Modeling, Optimization and Simulation , 2000 .

[21]  Y. Sugimori,et al.  Toyota production system and Kanban system Materialization of just-in-time and respect-for-human system , 1977 .

[22]  Yannick Frein,et al.  Comparison among three pull control policies: kanban, base stock, and generalized kanban , 2000, Ann. Oper. Res..

[23]  Tapas K. Das,et al.  A reinforcement learning approach to a single leg airline revenue management problem with multiple fare classes and overbooking , 2002 .

[24]  Christos G. Cassandras,et al.  Optimization of kanban-based manufacturing systems , 1999, Autom..

[25]  Maria Di Mascolo,et al.  On the design of generalized kanban control systems , 1995 .

[26]  S. Mahadevan,et al.  Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning , 1999 .

[27]  Yves Dallery,et al.  A performance comparison of pull type control mechanisms for multi-stage manufacturing , 2000 .

[28]  Warren B. Powell,et al.  “Approximate dynamic programming: Solving the curses of dimensionality” by Warren B. Powell , 2007, Wiley Series in Probability and Statistics.

[29]  John F. Magee,et al.  Production planning and inventory control , 1958 .

[30]  Yannick Frein,et al.  Unified Framework for Describing and Comparing the Dynamics of Pull Control Policies , 2004, Ann. Oper. Res..

[31]  Takuji Nishimura,et al.  Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator , 1998, TOMC.

[32]  Dimitrios E. Koulouriotis,et al.  Simulation optimisation of pull control policies for serial manufacturing lines and assembly manufacturing systems using genetic algorithms , 2010 .

[33]  Paul Glasserman,et al.  The Stability of a Capacitated, Multi-Echelon Production-Inventory System Under a Base-Stock Policy , 1994, Oper. Res..

[34]  Katsuhisa Ohno,et al.  Computing Optimal Policies for Controlled Tandem Queueing Systems , 1987, Oper. Res..

[35]  Asbjoern M. Bonvik,et al.  A comparison of production-line control mechanisms , 1997 .

[36]  Paul H. Zipkin,et al.  Foundations of Inventory Management , 2000 .

[37]  Christos Alexopoulos,et al.  Output Data Analysis , 2007 .

[38]  Mitsutoshi Kojima,et al.  Optimal numbers of two kinds of kanbans in a JIT production system , 1995 .

[39]  P. Shahabudeen,et al.  Design of bi-criteria Kanban system using simulated annealing technique , 2002 .

[40]  John B. Kidd,et al.  Toyota Production System , 1993 .

[41]  Fulya Altiparmak,et al.  The optimization of number of kanbans with genetic algorithms, simulated annealing and tabu search , 2000, Proceedings of the 2000 Congress on Evolutionary Computation. CEC00 (Cat. No.00TH8512).