A comparative study of deep and shallow predictive techniques for hot metal temperature prediction in blast furnace ironmaking

Abstract To realize stable operation of the ironmaking process, it is important to predict hot metal temperature (HMT) in a blast furnace. Recently, deep learning is emerging as a highly active area of research. Nonetheless, no thorough study has yet appeared comparing the performance of deep learning methods to the shallow learning methods in predicting HMT. This paper provides a comparative study on the deep and shallow predictive methods for the current time and multi-step-ahead HMT predictions. Three advanced deep predictive methods and seven effective shallow predictive methods are investigated from the application point of view. Both the deep and shallow predictive methods were applied to an industrial blast furnace, where the prediction performance and computational time of ten methods were evaluated. The results demonstrated that (1) shallow neural network is preferred for current time HMT prediction; (2) Gaussian process regression and support vector regression are preferred for multi-step-ahead HMT predictions.

[1]  Biao Huang,et al.  Dynamic Modeling, Predictive Control and Performance Monitoring: A Data-driven Subspace Approach , 2008 .

[2]  Manabu Kano,et al.  Estimation of active pharmaceutical ingredients content using locally weighted partial least squares and statistical wavelength selection. , 2011, International journal of pharmaceutics.

[3]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[4]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[5]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[6]  Jay H. Lee,et al.  Machine learning: Overview of the recent progresses and implications for the process systems engineering field , 2017, Comput. Chem. Eng..

[7]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[8]  Xinmin Zhang,et al.  Quality Prediction in Complex Batch Processes with Just-in-Time Learning Model Based on Non-Gaussian Dissimilarity Measure , 2015 .

[9]  Pierre Alliez,et al.  Convolutional Neural Networks for Large-Scale Remote-Sensing Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[10]  D. Basak,et al.  Support Vector Regression , 2008 .

[11]  Yi Yang,et al.  A discriminative CNN video representation for event detection , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[13]  Jianzhou Wang,et al.  Short-term wind speed prediction using empirical wavelet transform and Gaussian process regression , 2015 .

[14]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[15]  Klaus-Robert Müller,et al.  Efficient BackProp , 2012, Neural Networks: Tricks of the Trade.

[16]  Martin T. Hagan,et al.  Gauss-Newton approximation to Bayesian learning , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[17]  Lin Shi,et al.  Model of Hot Metal Silicon Content in Blast Furnace Based on Principal Component Analysis Application and Partial Least Square , 2011 .

[18]  Xiangang Li,et al.  Constructing long short-term memory based deep recurrent neural networks for large vocabulary speech recognition , 2014, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[19]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[20]  Manabu Kano,et al.  Defect Data Modeling and Analysis for Improving Product Quality and Productivity in Steel Industry , 2018 .

[21]  Zhiqiang Ge,et al.  Data Mining and Analytics in the Process Industry: The Role of Machine Learning , 2017, IEEE Access.

[22]  Koichi Fujiwara,et al.  Virtual sensing technology in process industries: Trends and challenges revealed by recent industria , 2013 .

[23]  Jin Wang,et al.  A feature-based soft sensor for spectroscopic data analysis , 2019, Journal of Process Control.

[24]  T. Bhattacharya Prediction of Silicon Content in Blast Furnace Hot Metal Using Partial Least Squares (PLS) , 2005 .

[25]  M. Kano,et al.  Development and Validation of Kinematical Blast Furnace Model with Long-term Operation Data , 2018, ISIJ International.

[26]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[27]  H. Abdi Partial least squares regression and projection on latent structure regression (PLS Regression) , 2010 .

[28]  Andreas Krause,et al.  A tutorial on Gaussian process regression: Modelling, exploring, and exploiting functions , 2016, bioRxiv.

[29]  Chuanhou Gao,et al.  Modeling of the Thermal State Change of Blast Furnace Hearth With Support Vector Machines , 2012, IEEE Transactions on Industrial Electronics.

[30]  A. Yu,et al.  Modeling of Internal State and Performance of an Ironmaking Blast Furnace: Slot vs Sector Geometries , 2016, Metallurgical and Materials Transactions B.

[31]  Tao Wang,et al.  End-to-end text recognition with convolutional neural networks , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[32]  T. H. Kjeldsen A Contextualized Historical Analysis of the Kuhn–Tucker Theorem in Nonlinear Programming: The Impact of World War II , 2000 .

[33]  G. De’ath Boosted trees for ecological modeling and prediction. , 2007, Ecology.

[34]  Laurence T. Yang,et al.  A survey on deep learning for big data , 2018, Inf. Fusion.

[35]  Jean Ponce,et al.  A Theoretical Analysis of Feature Pooling in Visual Recognition , 2010, ICML.

[36]  Antanas Verikas,et al.  Mining data with random forests: A survey and results of new tests , 2011, Pattern Recognit..

[37]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[38]  Bogdan Gabrys,et al.  Data-driven Soft Sensors in the process industry , 2009, Comput. Chem. Eng..

[39]  Alexander J. Smola,et al.  Learning with Kernels: support vector machines, regularization, optimization, and beyond , 2001, Adaptive computation and machine learning series.

[40]  S. Wold,et al.  PLS-regression: a basic tool of chemometrics , 2001 .

[41]  Peter H. N. de With,et al.  Fast Training of Object Detection Using Stochastic Gradient Descent , 2010, 2010 20th International Conference on Pattern Recognition.

[42]  Morimasa Ogawa,et al.  The state of the art in chemical process control in Japan: Good practice and questionnaire survey , 2010 .