Reinforcement learning for dynamic condition-based maintenance of a system with individually repairable components

Abstract In this article, a reinforcement learning approach is used to develop a new dynamic maintenance policy for multi-component systems with individually repairable components, where each component experiences two competing failure processes of degradation and random shocks. The gamma process is used to model the degradation path of each component in the system. It is assumed that each incoming shock causes damage to the degradation path of all the components. A combination of component degradation is then used to define the system states. The optimal maintenance action for each component at each specific state is found by modeling the problem as Markov decision process and solving it by using a Q-learning algorithm. Using a reinforcement learning approach provides a more time-efficient and cost-effective method compared to the traditional maintenance optimization solutions, and it can also provide a dynamic maintenance policy for each specific degradation state of the system which is more useful and beneficial compared to the fixed or stationary maintenance plan which is often proposed by previous studies. A numerical example shows how the reinforcement learning can be used to find the optimal maintenance action for systems with different system configurations.

[1]  Ajith Kumar Parlikad,et al.  Condition-based maintenance of multi-component systems with degradation state-rate interactions , 2016, Reliab. Eng. Syst. Saf..

[2]  Heping Li,et al.  A condition-based maintenance policy for multi-component systems with Lévy copulas dependence , 2016, Reliab. Eng. Syst. Saf..

[3]  Tongdan Jin,et al.  Adaptive Opportunistic Maintenance for Multi-unit Systems Subject to Stochastic Degradation , 2018, 2018 Annual Reliability and Maintainability Symposium (RAMS).

[4]  Enrico Zio,et al.  A reinforcement learning framework for optimal operation and maintenance of power grids , 2019, Applied Energy.

[5]  Samuel H. Huang,et al.  System health monitoring and prognostics — a review of current paradigms and practices , 2006 .

[6]  Tangbin Xia,et al.  Degradation modeling and classification of mixed populations using segmental continuous hidden Markov models , 2018, Qual. Reliab. Eng. Int..

[7]  Yisha Xiang,et al.  A sequential inspection and replacement policy for degradation-based systems , 2017, 2017 Annual Reliability and Maintainability Symposium (RAMS).

[8]  Xiao Wang,et al.  Multi-agent reinforcement learning based maintenance policy for a resource constrained flow line system , 2016, J. Intell. Manuf..

[9]  Seyyed Danial Nazemi,et al.  Economic and Operational Evaluation of PV and CHP combined with Energy Storage Systems considering Energy and Regulation Markets , 2020, 2020 IEEE Texas Power and Energy Conference (TPEC).

[10]  Qianmei Feng,et al.  Reliability modeling for dependent competing failure processes with changing degradation rate , 2014 .

[11]  Ying Peng,et al.  Current status of machine prognostics in condition-based maintenance: a review , 2010 .

[12]  Vitaly Levdik,et al.  Time Limits in Reinforcement Learning , 2017, ICML.

[13]  David W. Coit,et al.  Reliability and maintenance modeling for systems subject to multiple dependent competing failure processes , 2010 .

[14]  Norman F. Schneidewind Reliability Modeling , 2010, Encyclopedia of Software Engineering.

[15]  Mohsen A. Jafari,et al.  Evaluation of the behind-the-meter benefits of energy storage systems with consideration of ancillary market opportunities , 2020 .

[16]  Shahrul Kamaruddin,et al.  An overview of time-based and condition-based maintenance in industrial application , 2012, Comput. Ind. Eng..

[17]  Nan Chen,et al.  Condition-based maintenance using the inverse Gaussian degradation model , 2015, Eur. J. Oper. Res..

[18]  Subramaniam Ganesan,et al.  Condition based maintenance: a survey , 2012 .

[19]  David W. Coit,et al.  Dynamic Inspection Planning for Systems with Individually Repairable Components , 2019 .

[20]  David W. Coit,et al.  Inspection plan prediction for multi-repairable component systems using neural network , 2020, ArXiv.

[21]  Firoozeh Haghighi,et al.  Copula-based reliability analysis of gamma degradation process and Weibull failure time , 2019, International Journal of Quality & Reliability Management.

[22]  Liang Tang,et al.  An intelligent agent-based self-evolving maintenance and operations reasoning system , 2006, 2006 IEEE Aerospace Conference.

[23]  Yisha Xiang,et al.  Optimal maintenance policies for multi-level preventive maintenance with complex effects , 2019, IISE Trans..

[24]  R. Cooke,et al.  A Bayesian failure model based on isotropic deterioration , 1995 .

[25]  David W. Coit,et al.  Reliability for systems of degrading components with distinct component shock sets , 2014, Reliab. Eng. Syst. Saf..

[26]  D. Coit,et al.  Reliability Analysis of Systems Subject To Mutually Dependent Competing Failure Processes With Changing Degradation Rate , 2019, 1903.00076.

[27]  Lirong Cui,et al.  Reliability modeling for degradation-shock dependence systems with multiple species of shocks , 2019, Reliab. Eng. Syst. Saf..

[28]  Vipul Jain,et al.  Availability Analysis of Repairable Mechanical Systems Using Analytical Semi-Markov Approach , 2013 .

[29]  Yu Li,et al.  Gamma Degradation Process and Accelerated Model Combined Reliability Analysis Method for Rubber O-Rings , 2018, IEEE Access.

[30]  David W. Coit,et al.  Dynamic maintenance policy for systems with repairable components subject to mutually dependent competing failure processes , 2020, Comput. Ind. Eng..

[31]  Daishi Harada,et al.  Reinforcement Learning with Time , 1997, AAAI/IAAI.

[32]  Irad Ben-Gal,et al.  Condition-Based Maintenance via Simulation and a Targeted Bayesian Network Metamodel , 2013 .

[33]  Bo Guo,et al.  A maintenance optimization model for mission-oriented systems based on Wiener degradation , 2013, Reliab. Eng. Syst. Saf..

[34]  Qianmei Feng,et al.  Optimization of on-condition thresholds for a system of degrading components with competing dependent failure processes , 2019, Reliab. Eng. Syst. Saf..

[35]  Dimitris Kiritsis,et al.  Advances in Production Management Systems. Innovative and Knowledge-Based Production Management in a Global-Local World , 2014, IFIP Advances in Information and Communication Technology.

[36]  David W. Coit,et al.  Condition-Based Maintenance for Repairable Deteriorating Systems Subject to a Generalized Mixed Shock Model , 2015, IEEE Transactions on Reliability.

[37]  O. Arda Vanli,et al.  A Failure Time Prediction Method for Condition-Based Maintenance , 2014 .

[38]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[39]  Eduardo F. Morales,et al.  An Introduction to Reinforcement Learning , 2011 .

[40]  Dipak Kumar Jana,et al.  Availability and cost-centered preventive maintenance scheduling of continuous operating series systems using multi-objective genetic algorithm: A case study , 2016 .

[41]  Weiwen Peng,et al.  Inverse Gaussian process models for degradation analysis: A Bayesian perspective , 2014, Reliab. Eng. Syst. Saf..

[42]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.