Failure Prediction of Hard Disk Drives Based on Adaptive Rao–Blackwellized Particle Filter Error Tracking Method

Active failure prediction of hard disk drives (HDDs) is critical to prevent data loss and spare parts replacement decisions. Existing methods for failure predictions of HDDs always used a binary classifier to distinguish the healthy or failed HDDs and cannot address the problem of variable degradation states. In this article, an adaptive error tracking method is proposed for the HDD failure prediction. This method regards the extracted degradation feature as time serials and uses a state filter to estimate the real-time HDD's health status. Then, the HDD failure online prediction is achieved according to the alarm threshold determined by the adaptive error tracking. The degradation of an HDD is described by a first-order Markov hybrid jump degradation model, and the advanced Rao–Blackwellized particle filter algorithm, together with the expectation-maximization (EM) algorithm, is derived to estimate the model parameters adaptively. Finally, to verify the effectiveness of the proposed method, an accelerated degradation test (ADT) based on the vibration was carried out. And the data from ADT and real data center show that the proposed method performs much better than the previous methods, such as Kalman filter, SVM, MD, and recurrent neural network (RNN) based methods, with respect to failure prediction and the alarm distance, which helps to backup data and optimize maintenance decision costs for users.

[1]  Wei Zheng,et al.  Fault diagnosis of train sensors based on evolutionary genetic Particle Filter , 2013, 2013 IEEE International Conference on Intelligent Rail Transportation Proceedings.

[2]  Gang Wang,et al.  Hard Drive Failure Prediction Using Classification and Regression Trees , 2014, 2014 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks.

[3]  Seiichi Sugaya Trends in enterprise hard disk drives , 2006 .

[4]  Joseph F. Murray,et al.  Improved disk-drive failure warnings , 2002, IEEE Trans. Reliab..

[5]  N. de Freitas Rao-Blackwellised particle filtering for fault diagnosis , 2002, Proceedings, IEEE Aerospace Conference.

[6]  Qiang Miao,et al.  Health monitoring of hard disk drive based on Mahalanobis distance , 2011, 2011 Prognostics and System Health Managment Confernece.

[7]  Yanyang Zi,et al.  Switching State-Space Degradation Model With Recursive Filter/Smoother for Prognostics of Remaining Useful Life , 2019, IEEE Transactions on Industrial Informatics.

[8]  Tie-Yan Liu,et al.  Health Status Assessment and Failure Prediction for Hard Drives with Recurrent Neural Networks , 2016, IEEE Transactions on Computers.

[9]  Nando de Freitas,et al.  Real-Time Monitoring of Complex Industrial Processes with Particle Filters , 2002, NIPS.

[10]  Weimin Zheng,et al.  Predicting Disk Failures with HMM- and HSMM-Based Approaches , 2010, ICDM.

[11]  Joseph F. Murray,et al.  Hard drive failure prediction using non-parametric statistical methods , 2003 .

[12]  Greg Hamerly,et al.  Bayesian approaches to failure prediction for disk drives , 2001, ICML.

[13]  Gang Wang,et al.  Proactive drive failure prediction for large scale storage systems , 2013, 2013 IEEE 29th Symposium on Mass Storage Systems and Technologies (MSST).

[14]  Yu Wang,et al.  Tribological Degradation of Head–Disk Interface in Hard Disk Drives Under Accelerated Wear Condition , 2014, IEEE Transactions on Magnetics.

[15]  Joseph F. Murray,et al.  Machine Learning Methods for Predicting Failures in Hard Drives: A Multiple-Instance Application , 2005, J. Mach. Learn. Res..

[16]  Simon J. Godsill,et al.  On sequential Monte Carlo sampling methods for Bayesian filtering , 2000, Stat. Comput..

[17]  Tommy W. S. Chow,et al.  A Two-Step Parametric Method for Failure Prediction in Hard Disk Drives , 2014, IEEE Transactions on Industrial Informatics.

[18]  Fredrik Lindsten,et al.  Recursive Maximum Likelihood Identification of Jump Markov Nonlinear Systems , 2013, IEEE Transactions on Signal Processing.