A Stacked Autoencoder With Sparse Bayesian Regression for End-Point Prediction Problems in Steelmaking Process

The steelmaking process in the iron and steel industry involves complicated physicochemical reactions. The main aim of steelmaking is to adjust the quality of molten steel. During the steel-tapping process, the temperature and carbon content are the most essential quality indices for end-point prediction. This article presents a novel machine learning framework for the end-point prediction problems in the smelting process. Considering the importance of data representation in modeling, the original data are inputted to a stacked autoencoder (SAE) to extract the essential features in an unsupervised manner. The top layer is then designed as a sparse Bayesian regression (SBR) layer to obtain the predicted mean values and error bars that measure the uncertainty in the prediction. To improve the generalization ability of the prediction model, an intelligent optimization algorithm based on improved differential evolution (DE) is used to optimize the hyperparameters of the model. The main advantage of this model is that the underlying characteristics of the samples can be learned automatically layer by layer, instead of designing them manually. Finally, the effectiveness of the proposed method is verified using real data collected from two steel plants. The experimental results show that the proposed model gives a more precise prediction than other existing models and can provide error bars for the end-point prediction. Note to Practitioners—The end-point prediction is critical to the quality of products produced in the iron and steel production process. Because of the complexity of the physiochemical reactions during smelting, it is difficult to build mechanism models for the complex environment. Generally, expert knowledge and experience are needed for real production. The motivation behind this article was to establish a data-driven model through machine learning methods to solve end-point prediction problems. First, a stacked autoencoder (SAE) was used to extract the essential information from the original data. Second, sparse Bayesian regression (SBR) was applied to the top layer. Thus, the predicted mean values and error bars could be obtained to improve the robustness of the prediction model. Furthermore, an improved differential evolution (DE) algorithm was designed to adaptively optimize the hyperparameters of the model. In the experiments, the proposed method was applied to two real steelmaking production processes. The results verify the effectiveness of the proposed model. This article can be extended to other processes such as continuous casting and reheating furnace. The model can also be generalized for other practical industries to improve the product quality and enable safer production.

[1]  Henry C. W. Lau,et al.  Application of Genetic Algorithms to Solve the Multidepot Vehicle Routing Problem , 2010, IEEE Transactions on Automation Science and Engineering.

[2]  Avimanyu Sahoo,et al.  Neural Network-Based Event-Triggered State Feedback Control of Nonlinear Continuous-Time Systems , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[3]  Zhi-Long Chen,et al.  Integrated Charge Batching and Casting Width Selection at Baosteel , 2014, Oper. Res..

[4]  Hong Wang,et al.  Data-Driven Nonlinear Subspace Modeling for Prediction and Control of Molten Iron Quality Indices in Blast Furnace Ironmaking , 2017, IEEE Transactions on Control Systems Technology.

[5]  Jhareswar Maiti,et al.  Process control strategies for a steel making furnace using ANN with bayesian regularization and ANFIS , 2010, Expert Syst. Appl..

[6]  Jiu-sun Zeng,et al.  Data-driven predictive control for blast furnace ironmaking process , 2010, Comput. Chem. Eng..

[7]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[8]  Chang Liu,et al.  A Dynamic Analytics Method Based on Multistage Modeling for a BOF Steelmaking Process , 2019, IEEE Transactions on Automation Science and Engineering.

[9]  Yajun Wang,et al.  Multiscale Neighborhood Normalization-Based Multiple Dynamic PCA Monitoring Method for Batch Processes With Frequent Operations , 2018, IEEE Transactions on Automation Science and Engineering.

[10]  Lixin Tang,et al.  An Improved Differential Evolution Algorithm for Practical Dynamic Scheduling in Steelmaking-Continuous Casting Production , 2014, IEEE Transactions on Evolutionary Computation.

[11]  Fang Deng,et al.  Sensor Multifault Diagnosis With Improved Support Vector Machines , 2017, IEEE Transactions on Automation Science and Engineering.

[12]  Weisi Lin,et al.  Learning a referenceless stereopair quality engine with deep nonnegativity constrained sparse autoencoder , 2018, Pattern Recognit..

[13]  Xianpeng Wang,et al.  A Hybrid Multiobjective Evolutionary Algorithm for Multiobjective Optimization Problems , 2013, IEEE Transactions on Evolutionary Computation.

[14]  Hong Zhang,et al.  Facial expression recognition via learning deep sparse autoencoders , 2018, Neurocomputing.

[15]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[16]  Zhiqiang Ge,et al.  Scalable Semisupervised GMM for Big Data Quality Prediction in Multimode Processes , 2019, IEEE Transactions on Industrial Electronics.

[17]  Zheng Tang,et al.  An artificial bee colony algorithm search guided by scale-free networks , 2019, Inf. Sci..

[18]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[19]  Walter Ukovich,et al.  An Integrated System for Production Scheduling in Steelmaking and Casting Plants , 2016, IEEE Transactions on Automation Science and Engineering.

[20]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[21]  Javier R. Movellan,et al.  Tracking Motion, Deformation, and Texture Using Conditionally Gaussian Processes , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Peter B. Luh,et al.  An Effective Subgradient Method for Scheduling a Steelmaking-Continuous Casting Process , 2015, IEEE Transactions on Automation Science and Engineering.

[23]  Christian Igel,et al.  Evolutionary tuning of multiple SVM parameters , 2005, ESANN.

[24]  Manabu Kano,et al.  Data-based process monitoring, process control, and quality improvement: Recent developments and applications in steel industry , 2008, Comput. Chem. Eng..

[25]  Fei He,et al.  Prediction model of end-point phosphorus content in BOF steelmaking process based on PCA and BP neural network , 2018, Journal of Process Control.

[26]  Leni J. Matos,et al.  The Relevance Vector Machine Applied to the Modeling of Wireless Channels , 2013, IEEE Transactions on Antennas and Propagation.

[27]  Hui Liu,et al.  Basic oxygen furnace steelmaking end-point prediction based on computer vision and general regression neural network , 2014 .

[28]  I. Arvanitidis,et al.  Physical modelling and control of dynamic foaming in an LD-converter process , 2000, Conference Record of the 2000 IEEE Industry Applications Conference. Thirty-Fifth IAS Annual Meeting and World Conference on Industrial Applications of Electrical Energy (Cat. No.00CH37129).

[29]  Yang Bai,et al.  A novel electrocardiogram arrhythmia classification method based on stacked sparse auto-encoders and softmax regression , 2017, International Journal of Machine Learning and Cybernetics.

[30]  Xiaolei Ma,et al.  Probabilistic Prediction of Bus Headway Using Relevance Vector Machine Regression , 2017, IEEE Transactions on Intelligent Transportation Systems.

[31]  Jacek M. Zurada,et al.  Nonredundant sparse feature extraction using autoencoders with receptive fields clustering , 2017, Neural Networks.

[32]  Oral Büyüköztürk,et al.  Deep Learning‐Based Crack Damage Detection Using Convolutional Neural Networks , 2017, Comput. Aided Civ. Infrastructure Eng..

[33]  Quan-Ke Pan,et al.  An Effective Artificial Bee Colony Algorithm for a Real-World Hybrid Flowshop Problem in Steelmaking Process , 2013, IEEE Transactions on Automation Science and Engineering.

[34]  Lin Li,et al.  Nonlinear Dynamic Soft Sensor Modeling With Supervised Long Short-Term Memory Network , 2020, IEEE Transactions on Industrial Informatics.

[35]  Hsien-Pin Hsu,et al.  Solving Feeder Assignment and Component Sequencing Problems for Printed Circuit Board Assembly Using Particle Swarm Optimization , 2017, IEEE Transactions on Automation Science and Engineering.

[36]  Fei He,et al.  Hybrid Model of Molten Steel Temperature Prediction Based on Ladle Heat Status and Artificial Neural Network , 2014 .

[37]  Soumik Sarkar,et al.  LLNet: A deep autoencoder approach to natural low-light image enhancement , 2015, Pattern Recognit..

[38]  Zhiqiang Ge,et al.  Big data quality prediction in the process industry: A distributed parallel modeling framework , 2018, Journal of Process Control.

[39]  Zhiqiang Ge,et al.  Deep Learning of Semisupervised Process Data With Hierarchical Extreme Learning Machine and Soft Sensor Application , 2018, IEEE Transactions on Industrial Electronics.

[40]  Zhiqiang Ge,et al.  Local Parameter Optimization of LSSVM for Industrial Soft Sensing With Big Data and Cloud Implementation , 2020, IEEE Transactions on Industrial Informatics.

[41]  Ling Jian,et al.  Exploiting Expertise Rules for Statistical Data-Driven Modeling , 2017, IEEE Transactions on Industrial Electronics.

[42]  George Eastman House,et al.  Sparse Bayesian Learning and the Relevance Vector Machine , 2001 .

[43]  Haidong Shao,et al.  A novel deep autoencoder feature learning method for rotating machinery fault diagnosis , 2017 .

[44]  T. Brestovič,et al.  Measuring temperature of the atmosphere in the steelmaking furnace , 2015 .

[45]  Adel Nadjaran Toosi,et al.  Artificial fish swarm algorithm: a survey of the state-of-the-art, hybridization, combinatorial and indicative applications , 2012, Artificial Intelligence Review.

[46]  Qi Zhao,et al.  BOF endpoint prediction based on the flame radiation by hybrid SVC and SVR modeling , 2014 .

[47]  Kaoru Hirota,et al.  Softmax regression based deep sparse autoencoder network for facial emotion recognition in human-robot interaction , 2018, Inf. Sci..

[48]  Alexander Medvedev,et al.  Physical modelling and control of dynamic foaming in an LD-converter process , 2000 .

[49]  Chang Liu,et al.  Least squares support vector machine with self-organizing multiple kernel learning and sparsity , 2019, Neurocomputing.

[50]  Zhiqiang Ge,et al.  Probabilistic Sequential Network for Deep Learning of Complex Process Data and Soft Sensor Application , 2019, IEEE Transactions on Industrial Informatics.

[51]  Yangyu Fan,et al.  Unsupervised feature learning and automatic modulation classification using deep learning model , 2017, Phys. Commun..

[52]  Yangyu Fan,et al.  Automatic Modulation Classification Using Deep Learning Based on Sparse Autoencoders With Nonnegativity Constraints , 2017, IEEE Signal Processing Letters.

[53]  Shu-Tao Xia,et al.  A Novel Consistent Random Forest Framework: Bernoulli Random Forests , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[54]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[55]  Biao Huang,et al.  Deep Learning-Based Feature Representation and Its Application for Soft Sensor Modeling With Variable-Wise Weighted SAE , 2018, IEEE Transactions on Industrial Informatics.

[56]  Zhi-Zhong Mao,et al.  An Ensemble ELM Based on Modified AdaBoost.RT Algorithm for Predicting the Temperature of Molten Steel in Ladle Furnace , 2010, IEEE Transactions on Automation Science and Engineering.