Maintaining model efficiency, avoiding bias and reducing input parameter volume in compressor fault classification

With the exponential growth in data collection and storage and the necessity for timely prognostic health monitoring of industrial processes traditional methods of data analysis are becoming redundant. Big data sets and huge volumes of inputs give rise to equally massive computational requirements. In this paper the differences in input parameter selection using a subset of the original variables and using data reduction techniques are compared. Each selection procedure being illustrated by both statistical methods and machine learning techniques. It is shown that the subsequent classification models are highly comparable. Finally the merits of a combined multivariate statistical and wavelet decomposition approach is considered. Techniques are applied to output signals from an experimental compressor rig.

[1]  Tian Han,et al.  Fault Diagnosis System of Induction Motors Based on Multiscale Entropy and Support Vector Machine with Mutual Information Algorithm , 2016 .

[2]  M. Victor Wickerhauser,et al.  Adapted wavelet analysis from theory to software , 1994 .

[3]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[4]  Michael E. Tipping Sparse Bayesian Learning and the Relevance Vector Machine , 2001, J. Mach. Learn. Res..

[5]  J. E. Jackson Multivariate Statistical Methods , 1986 .

[6]  S. Joe Qin,et al.  Multivariate process monitoring and fault diagnosis by multi-scale PCA , 2002 .

[7]  B. Everitt,et al.  Applied Multivariate Data Analysis: Everitt/Applied Multivariate Data Analysis , 2001 .

[8]  William Volk,et al.  Applied statistics for engineers , 1971 .

[9]  Fengshou Gu,et al.  Selection of Input Parameters for Multivariate Classifiersin Proactive Machine Health Monitoring by Clustering Envelope Spectrum Harmonics , 2015 .

[10]  Stephen R. Marsland,et al.  Machine Learning - An Algorithmic Perspective , 2009, Chapman and Hall / CRC machine learning and pattern recognition series.

[11]  B. Everitt,et al.  Applied Multivariate Data Analysis. , 1993 .

[12]  Bo-Suk Yang,et al.  Condition classification of small reciprocating compressor for refrigeration using artificial neural networks and support vector machines , 2005 .

[13]  O. Weck,et al.  A COMPARISON OF PARTICLE SWARM OPTIMIZATION AND THE GENETIC ALGORITHM , 2005 .

[14]  B. Manly Multivariate Statistical Methods : A Primer , 1986 .

[15]  B. Bakshi Multiscale PCA with application to multivariate statistical process monitoring , 1998 .

[16]  C. Williams Applied Multivariate Data Analysis (2nd Edition) , 2002 .

[17]  Fengshou Gu,et al.  Fault diagnosis of reciprocating compressors using revelance vector machines with a genetic algorithm based on vibration data , 2014, 2014 20th International Conference on Automation and Computing.

[18]  Karsten P. Ulland,et al.  Vii. References , 2022 .