Deep-Reinforcement-Learning-Based Production Scheduling in Industrial Internet of Things

The unprecedented prosperity of the Industrial Internet of Things (IIoT) promotes the traditional industry transforming into intelligent manufacturing so that the whole production process can be comprehensively controlled to achieve flexible production. Intelligent scheduling, as one of the key enabling techniques, is desired to allocate the production of several machines by an efficient solution with minimum makespan. Existing approaches adopt a fixed search paradigm based on expert knowledge to seek satisfactory solutions. However, considering the varying data distribution and large sized of the practical problems, these methods fail to guarantee the quality of the obtained solution under the real-time requirement. To address this challenge, we formulate the production scheduling problem as a Markov decision process (MDP) and specifically design a job scheduling model made up of a job batching module for the hybrid flow-shop scheduling problem on batch processing machines (HFSP-BPM). Our proposed model consists of an actor network that learns the action under different conditions and a critic network that evaluates the action of the actor. We analyze the convergence of the model under different parameter settings to determine the optimal parameter. Extensive numerical experiments on both publicly available data set and real steel plant production data set demonstrate that the proposed deep reinforcement learning (DRL) approach compared with other baselines, more than 6% average improvements can be observed in many instances.

[1]  Weipeng Jing,et al.  Throughput Optimization in Heterogeneous Swarms of Unmanned Aircraft Systems for Advanced Aerial Mobility , 2022, IEEE Transactions on Intelligent Transportation Systems.

[2]  H. Song,et al.  Reinforcement Learning based Scheduling for Heterogeneous UAV Networking , 2021, 2021 17th International Conference on Mobility, Sensing and Networking (MSN).

[3]  H. Song,et al.  Reinforcement Learning Optimized Throughput for 5G Enhanced Swarm UAS Networking , 2021, ICC 2021 - IEEE International Conference on Communications.

[4]  Weipeng Jing,et al.  Blockchain enabled verification for cellular-connected unmanned aircraft system networking , 2021, Future Gener. Comput. Syst..

[5]  Laurence T. Yang,et al.  Cognitive Optimal-Setting Control of AIoT Industrial Applications With Deep Reinforcement Learning , 2021, IEEE Transactions on Industrial Informatics.

[6]  Jie Zhang,et al.  Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning , 2020, NeurIPS.

[7]  İbrahim Muter,et al.  Exact algorithms to minimize makespan on single and parallel batch processing machines , 2020, Eur. J. Oper. Res..

[8]  Houbing Song,et al.  Counter-Unmanned Aircraft System(s) (C-UAS): State of the Art, Challenges, and Future Trends , 2020, IEEE Aerospace and Electronic Systems Magazine.

[9]  M. K. Marichelvam,et al.  A literature review on hybrid flow shop scheduling , 2020, Int. J. Adv. Oper. Manag..

[10]  Quan-Ke Pan,et al.  Hybrid Artificial Bee Colony Algorithm for a Parallel Batching Distributed Flow-Shop Problem With Deteriorating Jobs , 2020, IEEE Transactions on Cybernetics.

[11]  Nei Kato,et al.  Ten Challenges in Advancing Machine Learning Technologies toward 6G , 2020, IEEE Wireless Communications.

[12]  Panos M. Pardalos,et al.  Two-stage hybrid flow shop scheduling on parallel batching machines considering a job-dependent deteriorating effect and non-identical job sizes , 2019, Appl. Soft Comput..

[13]  Der-Jiunn Deng,et al.  Smart Manufacturing Scheduling With Edge Computing Using Multiclass Deep Q Network , 2019, IEEE Transactions on Industrial Informatics.

[14]  Shijin Wang,et al.  An energy-efficient two-stage hybrid flow shop scheduling problem in a glass production , 2019, Int. J. Prod. Res..

[15]  Nei Kato,et al.  An Intelligent Route Computation Approach Based on Real-Time Deep Learning Strategy for Software Defined Communication Systems , 2019, IEEE Transactions on Emerging Topics in Computing.

[16]  Benjamim Fonseca,et al.  Deep Reinforcement Learning as a Job Shop Scheduling Solver: A Literature Review , 2018, HIS.

[17]  Benlian Xu,et al.  Ant Colony Optimization , 2001, Handbook of Machine Learning.

[18]  Chen-Yang Cheng,et al.  Multi-objective genetic algorithm for energy-efficient hybrid flow shop scheduling with lot streaming , 2018, Ann. Oper. Res..

[19]  김명재,et al.  Genetic Algorithm , 2018, Handbook of Machine Learning.

[20]  Alexandre Lacoste,et al.  Learning Heuristics for the TSP by Policy Gradient , 2018, CPAIOR.

[21]  Lawrence V. Snyder,et al.  Reinforcement Learning for Solving the Vehicle Routing Problem , 2018, NeurIPS.

[22]  Yinghui Xu,et al.  Solving a New 3D Bin Packing Problem with Deep Reinforcement Learning Method , 2017, ArXiv.

[23]  Christopher D. Manning,et al.  Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[24]  Alasdair Gilchrist Industry 4.0: The Industrial Internet of Things , 2016 .

[25]  Andy Davis,et al.  This Paper Is Included in the Proceedings of the 12th Usenix Symposium on Operating Systems Design and Implementation (osdi '16). Tensorflow: a System for Large-scale Machine Learning Tensorflow: a System for Large-scale Machine Learning , 2022 .

[26]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[27]  Jinjiang Yuan,et al.  Online Over Time Scheduling on Parallel-Batch Machines: A Survey , 2014 .

[28]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[29]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[30]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[31]  Jung Woo Jung,et al.  Flowshop-scheduling problems with makespan criterion: a review , 2005 .

[32]  John L. Hunsucker,et al.  Comparative performance analysis of priority rules in a constrained flow shop with multiple processors environment , 1994 .

[33]  Éric D. Taillard,et al.  Benchmarks for basic scheduling problems , 1993 .

[34]  Reha Uzsoy,et al.  Efficient Algorithms for Scheduling Semiconductor Burn-In Operations , 1992, Oper. Res..

[35]  Ravi Sethi,et al.  The Complexity of Flowshop and Jobshop Scheduling , 1976, Math. Oper. Res..

[36]  Jatinder N. D. Gupta,et al.  A Functional Heuristic Algorithm for the Flowshop Scheduling Problem , 1971 .

[37]  R. A. Dudek,et al.  A Heuristic Algorithm for the n Job, m Machine Sequencing Problem , 1970 .

[38]  S. M. Johnson,et al.  Optimal two- and three-stage production schedules with setup times included , 1954 .

[39]  B. Eggers Computers And Intractability A Guide To The Theory Of Np Completeness , 2016 .

[40]  Inyong Ham,et al.  A heuristic algorithm for the m-machine, n-job flow-shop sequencing problem , 1983 .