Hidden Markov Models and Their Application for Predicting Failure Events

We show how Markov mixed membership models (MMMM) can be used to predict the degradation of assets. We model the degradation path of individual assets, to predict overall failure rates. Instead of a separate distribution for each hidden state, we use hierarchical mixtures of distributions in the exponential family. In our approach the observation distribution of the states is a finite mixture distribution of a small set of (simpler) distributions shared across all states. Using tied-mixture observation distributions offers several advantages. The mixtures act as a regularization for typically very sparse problems, and they reduce the computational effort for the learning algorithm since there are fewer distributions to be found. Using shared mixtures enables sharing of statistical strength between the Markov states and thus transfer learning. We determine for individual assets the trade-off between the risk of failure and extended operating hours by combining a MMMM with a partially observable Markov decision process (POMDP) to dynamically optimize the policy for when and how to maintain the asset.

[1]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[2]  Somnath Datta,et al.  Inference Based on Imputed Failure Times for the Proportional Hazards Model with Interval-Censored Data , 1998 .

[3]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[4]  L. J. Wei,et al.  Regression analysis of multivariate incomplete failure time data by modeling marginal distributions , 1989 .

[5]  Michael I. Jordan,et al.  Mixed Membership Models for Time Series , 2013, Handbook of Mixed Membership Models and Their Applications.

[6]  Xin Zhou,et al.  A Survey of Predictive Maintenance: Systems, Purposes and Approaches , 2019, ArXiv.

[7]  Zoubin Ghahramani,et al.  An Introduction to Hidden Markov Models and Bayesian Networks , 2001, Int. J. Pattern Recognit. Artif. Intell..

[8]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[9]  Li Lin,et al.  Remaining useful life estimation of engineered systems using vanilla LSTM neural networks , 2018, Neurocomputing.

[10]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[11]  Yee Whye Teh,et al.  Sharing Clusters among Related Groups: Hierarchical Dirichlet Processes , 2004, NIPS.

[12]  Warren B. Powell,et al.  Tutorial on Stochastic Optimization in Energy—Part I: Modeling and Policies , 2016, IEEE Transactions on Power Systems.

[13]  Jr. G. Forney,et al.  Viterbi Algorithm , 1973, Encyclopedia of Machine Learning.

[14]  Leslie Pack Kaelbling,et al.  Acting Optimally in Partially Observable Stochastic Domains , 1994, AAAI.

[15]  P. Koprinkova-Hristova,et al.  Reinforcement Learning for Predictive Maintenance of Industrial Plants , 2013 .

[16]  Andrew Gelman,et al.  The No-U-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo , 2011, J. Mach. Learn. Res..

[17]  Warren B. Powell,et al.  A Unified Framework for Optimization Under Uncertainty , 2016 .

[18]  Guy Shani,et al.  Noname manuscript No. (will be inserted by the editor) A Survey of Point-Based POMDP Solvers , 2022 .

[19]  John W. Paisley,et al.  Markov Mixed Membership Models , 2015, ICML.

[20]  Marek J. Druzdzel,et al.  How to interpret the results of medical time series data analysis: Classical statistical approaches versus dynamic Bayesian network modeling , 2016, Journal of pathology informatics.

[21]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.