A clustering approach for mining reliability big data for asset management

Big data from very large fleets of assets challenge the asset management, as the number of maintenance strategies to optimize and administrate may become very large. To address this issue, we exploit a clustering approach that identifies a small number of sets of assets with similar reliability behaviors. This enables addressing the maintenance strategy optimization issue once for all the assets belonging to the same cluster and, thus, introduces a strong simplification in the asset management. However, the clustering approach may lead to additional maintenance costs, due to the loss of refinement in the cluster reliability model. For this, we propose a cost model to support asset managers in trading off the simplification brought by the cluster-based approach against the related extra costs. The proposed approach is applied to a real case study concerning a set of more than 30,000 switch point machines.

[1]  Enrico Zio,et al.  Comparison of Weibayes and Markov Chain Monte Carlo methods for the reliability analysis of turbine nozzle components with right censored data only , 2015 .

[2]  Christian Bauckhage,et al.  Computing the Kullback-Leibler Divergence between two Weibull Distributions , 2013, ArXiv.

[3]  John C. Gower,et al.  Measures of Similarity, Dissimilarity and Distance , 1985 .

[4]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[5]  Roberto Nappi Integrated Maintenance: analysis and perspective of innovation in railway sector , 2014, ArXiv.

[6]  Yan-Fu Li,et al.  A SVM framework for fault detection of the braking system in a high speed train , 2017, Mechanical Systems and Signal Processing.

[7]  Yili Hong,et al.  Reliability Meets Big Data: Opportunities and Challenges , 2014 .

[8]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[9]  Patrice Aknin,et al.  Floating train data systems for preventive maintenance: A data mining approach , 2013, Proceedings of 2013 International Conference on Industrial Engineering and Systems Management (IESM).

[10]  Saeed Maghsoodloo,et al.  Renewal and Renewal-Intensity Functions with Minimal Repair , 2014 .

[11]  Enrico Zio,et al.  Handling reliability big data: A similarity-based approach for clustering a large fleet of assets , 2015 .

[12]  Enrico Zio,et al.  Some Challenges and Opportunities in Reliability Engineering , 2016, IEEE Transactions on Reliability.

[13]  J. Gower,et al.  Metric and Euclidean properties of dissimilarity coefficients , 1986 .

[14]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[15]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[16]  Enrico Zio,et al.  Unsupervised Clustering for Fault Diagnosis in Nuclear Power Plant Components , 2013, Int. J. Comput. Intell. Syst..

[17]  Uday Kumar,et al.  Railway Assets: A Potential Domain for Big Data Analytics , 2015, INNS Conference on Big Data.

[18]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Miguel Figueres-Esteban,et al.  The role of data visualization in Railway Big Data Risk Analysis , 2015 .

[20]  Nii O. Attoh-Okine Big data challenges in railway engineering , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[21]  Davide Anguita,et al.  Condition Based Maintenance in Railway Transportation Systems Based on Big Data Streaming Analysis , 2015, INNS Conference on Big Data.

[22]  Enrico Zio,et al.  Evaluating maintenance policies by quantitative modeling and analysis , 2013, Reliab. Eng. Syst. Saf..

[23]  Marvin Zelen,et al.  Mathematical Theory of Reliability , 1965 .