Ensemble semi-supervised Fisher discriminant analysis model for fault classification in industrial processes.

In this paper, an ensemble form of the semi-supervised Fisher Discriminant Analysis (FDA) model is developed for fault classification in industrial processes. This method uses the K Nearest Neighbor (KNN) algorithm to merge the metric level outputs, which are obtained by the sub-classifiers in the ensemble model, to get the final classification result. An adaptive form is further proposed to improve the classification performance by putting forward to a new method of weight adjustment. While semi-supervised learning can generate a better model by exploiting additional information contained in unlabeled data, ensemble learning achieves the promotion of algorithm robustness by integrating a series of weak learners. In addition, the property of diversity in ensemble learning can be boosted by incorporating different unlabeled data to different weak learners. Therefore, the combination of those two methods can achieve great generalization for the fault classification model. The performances of two proposed methods are evaluated through an industrial benchmark process.

[1]  Zhiqiang Ge,et al.  Distributed Parallel PCA for Modeling and Monitoring of Large-Scale Plant-Wide Processes With Big Data , 2017, IEEE Transactions on Industrial Informatics.

[2]  Zhiqiang Ge,et al.  Review and big data perspectives on robust data mining approaches for industrial process modeling with outliers and missing data , 2018, Annu. Rev. Control..

[3]  Xin Yao,et al.  Ensemble learning via negative correlation , 1999, Neural Networks.

[4]  Zhiqiang Ge,et al.  Semi-supervised Fisher discriminant analysis model for fault classification in industrial processes , 2014 .

[5]  Anoushiravan Farshidianfar,et al.  Rolling element bearings multi-fault classification based on the wavelet denoising and support vector machine , 2007 .

[6]  O. Chapelle,et al.  Semi-Supervised Learning (Chapelle, O. et al., Eds.; 2006) [Book reviews] , 2009, IEEE Transactions on Neural Networks.

[7]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[8]  Christos Georgakis,et al.  Plant-wide control of the Tennessee Eastman problem , 1995 .

[9]  Zhiqiang Ge,et al.  Review on data-driven modeling and monitoring for plant-wide industrial processes , 2017 .

[10]  Bruce E. Rosen,et al.  Ensemble Learning Using Decorrelated Neural Networks , 1996, Connect. Sci..

[11]  B. Kulicke,et al.  Neural network approach to fault classification for high speed protective relaying , 1995 .

[12]  E. F. Vogel,et al.  A plant-wide industrial process control problem , 1993 .

[13]  Zhiqiang Ge,et al.  Process Data Analytics via Probabilistic Latent Variable Models: A Tutorial Review , 2018, Industrial & Engineering Chemistry Research.

[14]  Feng Jian,et al.  Process monitoring for chemical process based on semi-supervised principal component analysis , 2013, 2013 25th Chinese Control and Decision Conference (CCDC).

[15]  Yuan Yao,et al.  Semi-supervised mixture discriminant monitoring for chemical batch processes , 2014 .

[16]  Yi Liu,et al.  SemiBoost: Boosting for Semi-Supervised Learning , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Moisès Graells,et al.  A semi-supervised approach to fault diagnosis for chemical processes , 2010, Comput. Chem. Eng..

[18]  N.S.D. Brito,et al.  Fault detection and classification in transmission lines based on wavelet transform and ANN , 2006, IEEE Transactions on Power Delivery.

[19]  Ayhan Demiriz,et al.  Semi-Supervised Support Vector Machines , 1998, NIPS.

[20]  Furong Gao,et al.  Review of Recent Research on Data-Based Process Monitoring , 2013 .

[21]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[22]  Peter Meer,et al.  Semi-Supervised Kernel Mean Shift Clustering , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[24]  Zhiqiang Ge,et al.  Large-scale plant-wide process modeling and hierarchical monitoring: A distributed Bayesian network approach , 2017 .

[25]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[26]  Nikunj C. Oza,et al.  Online Ensemble Learning , 2000, AAAI/IAAI.

[27]  Zhiqiang Ge,et al.  Scalable Semisupervised GMM for Big Data Quality Prediction in Multimode Processes , 2019, IEEE Transactions on Industrial Electronics.

[28]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[29]  Donghai Guan,et al.  Semi-supervised learning using frequent itemset and ensemble learning for SMS classification , 2015, Expert Syst. Appl..

[30]  Richard D. Braatz,et al.  Fault Detection and Diagnosis in Industrial Systems , 2001 .

[31]  R. Polikar,et al.  Ensemble based systems in decision making , 2006, IEEE Circuits and Systems Magazine.

[32]  Jian Ma,et al.  Sentiment classification: The contribution of ensemble learning , 2014, Decis. Support Syst..

[33]  Zhiqiang Ge,et al.  Distributed predictive modeling framework for prediction and diagnosis of key performance index in plant-wide processes , 2017 .

[34]  Xin Yao,et al.  The Impact of Diversity on Online Ensemble Learning in the Presence of Concept Drift , 2010, IEEE Transactions on Knowledge and Data Engineering.

[35]  Zhiqiang Ge,et al.  Data Mining and Analytics in the Process Industry: The Role of Machine Learning , 2017, IEEE Access.

[36]  S. Joe Qin,et al.  Process data analytics in the era of big data , 2014 .

[37]  Zhiqiang Ge,et al.  Big data quality prediction in the process industry: A distributed parallel modeling framework , 2018, Journal of Process Control.