An Integrated Model of kNN and GBDT for Fault Diagnosis of Wheel on Railway Vehicle

Wheelsets are critically important for the safety operation of rail vehicle. The real vibration data of railway vehicle wheel, obtained from a Chinese subway company, is used in this article. Principal component analysis (PCA) is conducted to reduce the dimension of the feature indexes. We propose an integrating algorithm with k-Nearest Neighbours (kNN) and gradient boosting decision tree (GBDT) to deal with the typical imbalance and divergent characteristics and satisfy the high requirement of fault detection accuracy. The results show that the classification accuracy of kNN-GBDT reaches 94.66%, 82.35%. While, the kNN and the SVM miss classified all fault samples into normal condition, and GBDT got an accuracy of 56.82% in fault detection. The entire process of the proposed model finished in about 0.35s. Our kNN-GBDT integrated algorithm satisfies the requirements of real-time performance and accuracy for online fault detection.

[1]  Stephen V. Stehman,et al.  Selecting and interpreting measures of thematic classification accuracy , 1997 .

[2]  Chen Lu,et al.  Fault diagnosis of gearbox using empirical mode decomposition and multi-fractal detrended cross-correlation analysis , 2016 .

[3]  Gary M. Bone,et al.  Fault detection and diagnosis of diesel engine valve trains , 2016 .

[4]  Paolo Pennacchi,et al.  The relationship between kurtosis- and envelope-based indexes for the diagnostic of rolling element bearings , 2014 .

[5]  Lixiang Duan,et al.  Segmented infrared image analysis for rotating machinery fault diagnosis , 2016 .

[6]  Haibo He,et al.  ADASYN: Adaptive synthetic sampling approach for imbalanced learning , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[7]  Tielin Shi,et al.  Nondestructive diagnosis of flip chips based on vibration analysis using PCA-RBF , 2017 .

[8]  N Qin Ensemble empirical mode decomposition and fuzzy entropy in fault feature analysis for high-speed train bogie , 2014 .

[9]  Ankit Bansal,et al.  Chemometrics tools used in analytical chemistry: an overview. , 2014, Talanta.

[10]  Nathalie Japkowicz,et al.  The class imbalance problem: A systematic study , 2002, Intell. Data Anal..

[11]  Hari Wijayanto,et al.  SMOTE Bagging Algorithm for Imbalanced Dataset in Logistic Regression Analysis (Case: Credit of Bank X) , 2015 .

[12]  V. Sugumaran,et al.  Exploiting sound signals for fault diagnosis of bearings using decision tree , 2013 .

[13]  David E. Booth,et al.  Chemometrics: Data Analysis for the Laboratory and Chemical Plant , 2004, Technometrics.

[14]  Anna Barbati,et al.  Stochastic gradient boosting classification trees for forest fuel types mapping through airborne laser scanning and IRS LISS-III imagery , 2013, Int. J. Appl. Earth Obs. Geoinformation.

[15]  R. A. Mollineda,et al.  The class imbalance problem in pattern classification and learning , 2009 .

[16]  Ai Yanting,et al.  Fusion information entropy method of rolling bearing fault diagnosis based on n-dimensional characteristic parameter distance , 2017 .

[17]  Tung Khac Truong,et al.  A Roller Bearing Fault Diagnosis Method Based on LCD Energy Entropy and ACROA-SVM , 2014 .

[18]  Mir Mohammad Ettefagh,et al.  Diagnosis of combined faults in Rotary Machinery by Non-Naive Bayesian approach , 2017 .

[19]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[20]  Guiming Mei,et al.  A Novel Fault Diagnosis Model for Bearing of Railway Vehicles Using Vibration Signals Based on Symmetric Alpha-Stable Distribution Feature Extraction , 2016 .

[21]  Jing Tian,et al.  Motor Bearing Fault Detection Using Spectral Kurtosis-Based Feature Extraction Coupled With K-Nearest Neighbor Distance Analysis , 2016, IEEE Transactions on Industrial Electronics.