论文信息 - The optimal control of just-in-time-based production and distribution systems and performance comparisons with optimized pull systems

The optimal control of just-in-time-based production and distribution systems and performance comparisons with optimized pull systems

In just-in-time (JIT) production systems, there is both input stock in the form of parts and output stock in the form of product at each stage. These activities are controlled by production-ordering and withdrawal kanbans. This paper discusses a discrete-time optimal control problem in a multistage JIT-based production and distribution system with stochastic demand and capacity, developed to minimize the expected total cost per unit of time. The problem can be formulated as an undiscounted Markov decision process (UMDP); however, the curse of dimensionality makes it very difficult to find an exact solution. The author proposes a new neuro-dynamic programming (NDP) algorithm, the simulation-based modified policy iteration method (SBMPIM), to solve the optimal control problem. The existing NDP algorithms and SBMPIM are numerically compared with a traditional UMDP algorithm for a single-stage JIT production system. It is shown that all NDP algorithms except the SBMPIM fail to converge to an optimal control. Additionally, a new algorithm for finding the optimal parameters of pull systems is proposed. Numerical comparisons between near-optimal controls computed using the SBMPIM and optimized pull systems are conducted for three-stage JIT-based production and distribution systems. UMDPs with 42 million states are solved using the SBMPIM. The pull systems discussed are the kanban, base stock, CONWIP, hybrid and extended kanban.

Katsuhisa Ohno | K. Ohno

[1] S. Marcus,et al. A Simulation-Based Policy Iteration Algorithm for Average Cost Unichain Markov Decision Processes , 2000 .

[2] Jack P. C. Kleijnen,et al. An evolutionary approach to select a pull system among Kanban, Conwip and Hybrid , 2000, J. Intell. Manuf..

[3] Abhijit Gosavi,et al. Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning , 2003 .

[4] P. Shahabudeen,et al. Design of a Two-Card Dynamic Kanban System Using a Simulated Annealing Algorithm , 2003 .

[5] Steven I. Marcus,et al. Simulation-based Algorithms for Markov Decision Processes/ Hyeong Soo Chang ... [et al.] , 2013 .

[6] Yves Dallery,et al. Extended kanban control system: combining kanban and base stock , 2000 .

[7] Fulya Altiparmak,et al. A comparison of the performance of artificial intelligence techniques for optimizing the number of kanbans , 2002, J. Oper. Res. Soc..

[8] Taiichi Ohno,et al. Toyota Production System : Beyond Large-Scale Production , 1988 .

[9] Andrew G. Barto,et al. Reinforcement learning , 1998 .

[10] C. Sendil Kumar,et al. Literature review of JIT-KANBAN system , 2007 .

[11] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.