Preparation of comprehensive data from huge data sets for predictive soft sensors

Abstract Soft sensors predict values of process variables that are difficult to measure in real time. Predictive ability of adaptive soft sensors depends on databases. However, there is no way to construct initial databases from huge data sets that are measured in plants and that are data rich, but information poor. Therefore, we propose a method to select comprehensive data from huge data sets to build soft sensors with high predictive ability. A genetic algorithm and the Kennard–Stone algorithm are modified for data selection considering predictive ability of regression models and data distribution. Through the analyses of numerical simulation data and real industrial data, we confirm that initial databases could be appropriately constructed from huge data sets and predictive accuracy of soft sensors subsequently increased.

[1]  Manabu Kano,et al.  Estimation of active pharmaceutical ingredients content using locally weighted partial least squares and statistical wavelength selection. , 2011, International journal of pharmaceutics.

[2]  Jie Yu,et al.  A quality relevant non‐Gaussian latent subspace projection method for chemical process monitoring and fault detection , 2014 .

[3]  S. Qin Recursive PLS algorithms for adaptive data modeling , 1998 .

[4]  Stefan Schaal,et al.  Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning , 2002, Applied Intelligence.

[5]  Hiromasa Kaneko,et al.  Database monitoring index for adaptive soft sensors and the application to industrial process , 2014 .

[6]  Shapour Azarm,et al.  An accumulative error based adaptive design of experiments for offline metamodeling , 2009 .

[7]  L. A. Stone,et al.  Computer Aided Design of Experiments , 1969 .

[8]  Hiromasa Kaneko,et al.  Development of a New Index to Monitor Database for Soft Sensors , 2013 .

[9]  Roberto Kawakami Harrop Galvão,et al.  A method for calibration and validation subset partitioning. , 2005, Talanta.

[10]  Hiromasa Kaneko,et al.  Adaptive soft sensor based on online support vector regression and Bayesian ensemble learning for various states in chemical plants , 2014 .

[11]  Manabu Kano,et al.  Data-based process monitoring, process control, and quality improvement: Recent developments and applications in steel industry , 2008, Comput. Chem. Eng..

[12]  Hiromasa Kaneko,et al.  Development of Soft Sensor Models Based on Time Difference of Process Variables with Accounting for Nonlinear Relationship , 2011 .

[13]  Andrey Bogomolov,et al.  Selecting optimal wavelength intervals for an optical sensor: A case study of milk fat and total protein analysis in the region 400-1100 nm , 2015 .

[14]  Hiromasa Kaneko,et al.  Maintenance-free soft sensor models with time difference of process variables , 2011 .

[15]  Hiromasa Kaneko,et al.  Application of Online Support Vector Regression for Soft Sensors , 2014 .

[16]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[17]  Lei Wu,et al.  Adaptive soft sensor modeling framework based on just-in-time learning and kernel partial least squares regression for nonlinear multiphase batch processes , 2014, Comput. Chem. Eng..

[18]  S. Joe Qin,et al.  Subspace approach to multidimensional fault identification and reconstruction , 1998 .

[19]  Desire L. Massart,et al.  Representative subset selection , 2002 .

[20]  Hiromasa Kaneko,et al.  Adaptive database management based on the database monitoring index for long-term use of adaptive soft sensors , 2015 .

[21]  Min-Sen Chiu,et al.  Adaptive generalized predictive control based on JITL technique , 2009 .

[22]  Xuefeng Yan,et al.  Just‐in‐time reorganized PCA integrated with SVDD for chemical process monitoring , 2014 .

[23]  Darrell Whitley,et al.  A genetic algorithm tutorial , 1994, Statistics and Computing.

[24]  Jizhen Liu,et al.  An adaptive least squares support vector machine model with a novel update for NOx emission prediction , 2015 .

[25]  Bogdan Gabrys,et al.  Local learning‐based adaptive soft sensor for catalyst activation prediction , 2011 .

[26]  Zhiqiang Ge,et al.  External analysis‐based regression model for robust soft sensing of multimode chemical processes , 2014 .

[27]  Bogdan Gabrys,et al.  Data-driven Soft Sensors in the process industry , 2009, Comput. Chem. Eng..

[28]  Bogdan Gabrys,et al.  Review of adaptation mechanisms for data-driven soft sensors , 2011, Comput. Chem. Eng..

[29]  B. Bakshi Multiscale PCA with application to multivariate statistical process monitoring , 1998 .

[30]  Manabu Kano,et al.  Monitoring independent components for fault detection , 2003 .

[31]  Hiromasa Kaneko,et al.  Classification of the Degradation of Soft Sensor Models and Discussion on Adaptive Models , 2013 .

[32]  Manabu Kano,et al.  Soft‐sensor development using correlation‐based just‐in‐time modeling , 2009 .

[33]  Min-Sen Chiu,et al.  Partitioned model-based IMC design using JITL modeling technique , 2007 .

[34]  Hiromasa Kaneko,et al.  Fast optimization of hyperparameters for support vector regression models with highly predictive ability , 2015 .

[35]  Thomas F. Edgar,et al.  Process monitoring using principal components in parallel coordinates , 2013 .

[36]  Hiromasa Kaneko,et al.  Development of a new soft sensor method using independent component analysis and partial least squares , 2009 .

[37]  Zhiqiang Ge,et al.  Robust modeling of mixture probabilistic principal component analysis and process monitoring application , 2014 .

[38]  Hiromasa Kaneko,et al.  A soft sensor method based on values predicted from multiple intervals of time difference for improvement and estimation of prediction accuracy , 2011 .

[39]  M. Chiu,et al.  A new data-based methodology for nonlinear process modeling , 2004 .

[40]  John F. MacGregor,et al.  Process monitoring and diagnosis by multiblock PLS methods , 1994 .

[41]  Luigi Fortuna,et al.  Soft Sensors for Monitoring and Control of Industrial Processes (Advances in Industrial Control) , 2006 .

[42]  D. Massart,et al.  The Mahalanobis distance , 2000 .