Fault-diagnosis for reciprocating compressors using big data and machine learning

Abstract Reciprocating compressors are widely used in petroleum industry. A small fault in reciprocating compressor may cause serious issues in operation. Traditional regular maintenance and fault diagnosis solutions cannot efficiently detect potential faults in reciprocating compressors. This paper proposes a fault-diagnosis system for reciprocating compressors. It applies machine-learning techniques to data analysis and fault diagnosis. The raw data is denoised first. Then the denoised data is sparse coded to train a dictionary. Based on the learned dictionary, potential faults are finally recognized and classified by support vector machine (SVM). The system is evaluated by using 5-year operation data collected from an offshore oil corporation in a cloud environment. The collected data is evenly divided into two halves. One half is used for training, and the other half is used for testing. The results demonstrate that the proposed system can efficiently diagnose potential faults in compressors with more than 80% accuracy, which represents a better result than the current practice.

[1]  Yuefei Wang,et al.  Fault diagnosis of reciprocating compressor valve with the method integrating acoustic emission signal and simulated valve motion , 2015 .

[2]  AbdiHervé,et al.  Principal Component Analysis , 2010, Essentials of Pattern Recognition.

[3]  Yinong Chen,et al.  A Novel Multi-Focus Image Fusion Method Based on Stochastic Coordinate Coding and Local Density Peaks Clustering , 2016, Future Internet.

[4]  Chih-Jen Lin,et al.  Training and Testing Low-degree Polynomial Data Mappings via Linear SVM , 2010, J. Mach. Learn. Res..

[5]  Jian Sun,et al.  A direct method for power system corrective control to relieve current violation in transient with UPFCs by barrier functions , 2016 .

[6]  Wei-Tek Tsai,et al.  Fault-Diagnosis for Reciprocating Compressors Using Big Data , 2016, 2016 IEEE Second International Conference on Big Data Computing Service and Applications (BigDataService).

[7]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[8]  Andrew Ball,et al.  Numerical simulation and experimental study of a two-stage reciprocating compressor for condition monitoring , 2008 .

[9]  Yi Chai,et al.  A Geometric Dictionary Learning Based Approach for Fluorescence Spectroscopy Image Fusion , 2017 .

[10]  Wei-Tek Tsai,et al.  Test-Algebra Execution in a Cloud Environment , 2014, 2014 IEEE 8th International Symposium on Service Oriented System Engineering.

[11]  John Wright,et al.  Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Matrices via Convex Optimization , 2009, NIPS.

[12]  Genady Grabarnik,et al.  Sparse Modeling: Theory, Algorithms, and Applications , 2014 .

[13]  Qiong Zhang,et al.  An Integrated Dictionary-Learning Entropy-Based Medical Image Fusion Framework , 2017, Future Internet.

[14]  Wei-Tek Tsai,et al.  Integrated Adaptive Reasoning Testing Framework with Automated Fault Detection , 2015, 2015 IEEE Symposium on Service-Oriented System Engineering.

[15]  Ángel Fernando Kuri Morales,et al.  Evolutionary Training of SVM for Multiple Category Classification Problems with Self-adaptive Parameters , 2006, IBERAMIA-SBIA.

[16]  Wei-Tek Tsai,et al.  Concurrent Test Algebra Execution with Combinatorial Testing , 2014, 2014 IEEE 8th International Symposium on Service Oriented System Engineering.

[17]  Yi Chai,et al.  A Novel Geometric Dictionary Construction Approach for Sparse Representation Based Image Fusion , 2017, Entropy.

[18]  Qingyang Li,et al.  Stochastic Coordinate Coding and Its Application for Drosophila Gene Expression Pattern Annotation , 2014, ArXiv.

[19]  Fatos Xhafa,et al.  Stochastic simulation and modelling of metabolic networks in a machine learning framework , 2011, Simul. Model. Pract. Theory.

[20]  Hong Zhu,et al.  Tenant-based access control model for multi-tenancy and sub-tenancy architecture in Software-as-a-Service , 2017, Frontiers of Computer Science.

[21]  Guillermo Sapiro,et al.  Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[22]  H. Abdi,et al.  Principal component analysis , 2010 .

[23]  Yong Luo,et al.  A cloud-based triage log analysis and recovery framework , 2017, Simul. Model. Pract. Theory.

[24]  G. Sapiro,et al.  A collaborative framework for 3D alignment and classification of heterogeneous subvolumes in cryo-electron tomography. , 2013, Journal of structural biology.

[25]  Yi Chai,et al.  A novel multi-modality image fusion method based on image decomposition and sparse representation , 2017, Inf. Sci..

[26]  B. Walker Deepwater horizon oil spill. , 2010, Journal of environmental health.

[27]  Jian Sun,et al.  L-infinity event-triggered networked control under time-varying communication delay with communication cost reduction , 2015, Journal of the Franklin Institute.

[28]  Wei-Tek Tsai,et al.  Scalable SaaS Indexing Algorithms with Automated Redundancy and Recovery Management , 2013, Int. J. Softw. Informatics.

[29]  Wei-Tek Tsai,et al.  Integrated fault detection and test algebra for combinatorial testing in TaaS (Testing-as-a-Service) , 2016, Simul. Model. Pract. Theory.

[30]  Wei-Tek Tsai,et al.  Autonomous Decentralized Combinatorial Testing , 2015, 2015 IEEE Twelfth International Symposium on Autonomous Decentralized Systems.

[31]  Wei-Tek Tsai,et al.  DICB: Dynamic Intelligent Customizable Benign Pricing Strategy for Cloud Computing , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[32]  Jian Sun,et al.  Frequency Regulation of Power Systems with Self-Triggered Control under the Consideration of Communication Costs , 2017 .

[33]  Jian Sun,et al.  BLDC motor speed control system fault diagnosis based on LRGF neural network and adaptive lifting scheme , 2014, Appl. Soft Comput..

[34]  Helen D. Karatza,et al.  High-performance modelling and simulation for big data applications , 2017, Simul. Model. Pract. Theory.