Reinforcement learning for an intelligent and autonomous production control of complex job-shops under time constraints

Reinforcement learning (RL) offers promising opportunities to handle the ever-increasing complexity in managing modern production systems. We apply a Q -learning algorithm in combination with a process-based discrete-event simulation in order to train a self-learning, intelligent, and autonomous agent for the decision problem of order dispatching in a complex job shop with strict time constraints. For the first time, we combine RL in production control with strict time constraints. The simulation represents the characteristics of complex job shops typically found in semiconductor manufacturing. A real-world use case from a wafer fab is addressed with a developed and implemented framework. The performance of an RL approach and benchmark heuristics are compared. It is shown that RL can be successfully applied to manage order dispatching in a complex environment including time constraints. An RL-agent with a gain function rewarding the selection of the least critical order with respect to time-constraints beats heuristic rules strictly by picking the most critical lot first. Hence, this work demonstrates that a self-learning agent can successfully manage time constraints with the agent performing better than the traditional benchmark, a time-constraint heuristic combining due date deviations and a classical first-in-first-out approach.

[1]  Volker Stich,et al.  Produktionsplanung und -steuerung 2 , 2012 .

[2]  Dong-Seok Sun,et al.  Scheduling and control for time-constrained processes in semiconductor manufacturing , 2005, ISSM 2005, IEEE International Symposium on Semiconductor Manufacturing, 2005..

[3]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[4]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[5]  Andreas Klemmt,et al.  Scheduling jobs with time constraints between consecutive process steps in semiconductor manufacturing , 2012, Proceedings Title: Proceedings of the 2012 Winter Simulation Conference (WSC).

[6]  Andreas Kuhnle,et al.  Reinforcement learning for adaptive order dispatching in the semiconductor industry , 2018 .

[7]  John W. Fowler,et al.  Production Planning and Control for Semiconductor Wafer Fabrication Facilities - Modeling, Analysis, and Systems , 2013, Operations research / computer science interfaces series.

[8]  Lihui Wang,et al.  Global production networks: Design and operation , 2019, CIRP Annals.

[9]  Sebastian Thiede,et al.  Matrix structures for high volumes and flexibility in production systems , 2014 .

[10]  Birgit Vogel-Heuser,et al.  Industrie 4.0 in Produktion, Automatisierung und Logistik , 2014 .

[11]  Gisela Lanza,et al.  Reinforcement learning for opportunistic maintenance optimization , 2018, Prod. Eng..

[12]  Thomas Bauernhansl,et al.  Production Scheduling in Complex Job Shops from an Industry 4.0 Perspective: A Review and Challenges in the Semiconductor Industry , 2016, SAMI@iKNOW.

[13]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[14]  Peter Nyhuis,et al.  Changeable Manufacturing - Classification, Design and Operation , 2007 .

[15]  Manfred Weiß,et al.  Produktions-Management: Grundlagen der Produktionsplanung und -steuerung , 2012 .

[16]  Lenz Belzner,et al.  Deep reinforcement learning for semiconductor production scheduling , 2018, 2018 29th Annual SEMI Advanced Semiconductor Manufacturing Conference (ASMC).

[17]  Andreas Kuhnle,et al.  Design, Implementation and Evaluation of Reinforcement Learning for an Adaptive Order Dispatching in Job Shop Manufacturing Systems , 2019, Procedia CIRP.

[18]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[19]  Gisela Lanza,et al.  Autonomous order dispatching in the semiconductor industry using reinforcement learning , 2019, Procedia CIRP.

[20]  Sebastian Knopp,et al.  Complex Job-Shop Scheduling with Batching in Semiconductor Manufacturing , 2016 .