Development of Robust and Physically Interpretable Soft Sensor for Industrial Distillation Column Using Transfer Learning with Small Datasets

In the development of soft sensors for industrial processes, the availability of data for data-driven modeling is usually limited, which led to overfitting and lack of interpretability when conventional deep learning models were used. In this study, the proposed soft sensor development methodology combining first-principle simulations and transfer learning was used to address these problems. Source-domain models were obtained using a large amount of data generated by dynamic simulations. They were then fine-tuned by a limited amount of real plant data to improve their prediction accuracies on the target domain and guaranteed the models with correct domain knowledge. An industrial C4 separation column operating at a refining unit was used as an example to illustrate the effectiveness of this approach. Results showed that fine-tuned networks could obtain better accuracy and improved interpretability compared to a simple feedforward network with or without regularization, especially when the amount of actual data available was small. For some secondary effects, such as interaction gain, its interpretability is mainly based on the interpretability of the corresponding source models.

[1]  Luigi Fortuna,et al.  Comparison of Soft-Sensor Design Methods for Industrial Plants Using Small Data Sets , 2009, IEEE Transactions on Instrumentation and Measurement.

[2]  Yoshua Bengio,et al.  Deep Learning of Representations for Unsupervised and Transfer Learning , 2011, ICML Unsupervised and Transfer Learning.

[3]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[4]  Chandan Singh,et al.  Definitions, methods, and applications in interpretable machine learning , 2019, Proceedings of the National Academy of Sciences.

[5]  Bogdan Gabrys,et al.  Data-driven Soft Sensors in the process industry , 2009, Comput. Chem. Eng..

[6]  Rogério Schmidt Feris,et al.  SpotTune: Transfer Learning Through Adaptive Fine-Tuning , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Manabu Kano,et al.  Soft‐sensor development using correlation‐based just‐in‐time modeling , 2009 .

[8]  R. K. Wood,et al.  Terminal composition control of a binary distillation column , 1973 .

[9]  Xia Hu,et al.  Techniques for interpretable machine learning , 2018, Commun. ACM.

[10]  Xuefeng Yan,et al.  Hybrid Model of Industrial p-Xylene Oxidation Incorporated Fractional Kinetic Model with Intelligent Models , 2013 .

[11]  Xiong Luo,et al.  Regression and classification using extreme learning machine based on L1-norm and L2-norm , 2016, Neurocomputing.

[12]  Jiaying Liu,et al.  Revisiting Batch Normalization For Practical Domain Adaptation , 2016, ICLR.

[13]  William L. Luyben,et al.  Simple Dynamic Gasifier Model That Runs in Aspen Dynamics , 2008 .

[14]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[15]  Murat Kulahci,et al.  The Role of Big Data in Industrial (Bio)chemical Process Operations , 2020 .

[16]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[17]  Hiromasa Kaneko,et al.  Moving Window and Just-in-Time Soft Sensor Model Based on Time Differences Considering a Small Number of Measurements , 2015 .

[18]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[19]  Peng Qiu,et al.  Data-driven analysis and optimization of externally heat-integrated distillation columns (EHIDiC) , 2019 .

[20]  Bogdan Gabrys,et al.  Local learning‐based adaptive soft sensor for catalyst activation prediction , 2011 .

[21]  Rafiqul Gani,et al.  A generalized model for distillation columns—I: Model description and applications , 1986 .

[22]  Liwei Wang,et al.  The Expressive Power of Neural Networks: A View from the Width , 2017, NIPS.

[23]  Salvatore Ingrassia,et al.  Neural Network Modeling for Small Datasets , 2005, Technometrics.

[24]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[25]  Xue-feng Yan,et al.  Hybrid model for main and side reactions of p-xylene oxidation with factor influence based monotone additive SVR , 2014 .

[26]  Peter L. Bartlett,et al.  The Sample Complexity of Pattern Classification with Neural Networks: The Size of the Weights is More Important than the Size of the Network , 1998, IEEE Trans. Inf. Theory.