Multiple Tensor-on-Tensor Regression: An Approach for Modeling Processes With Heterogeneous Sources of Data

With advancements in sensor technology, heterogeneous sets of data such as those containing scalars, waveform signals, images, and even structured point clouds, are becoming increasingly popular. Developing statistical models based on such heterogeneous sets of data to represent the behavior of an underlying system can be used in the monitoring, control, and optimization of the system. Unfortunately, available methods only focus on the scalar and curve data and do not provide a general framework for integrating different sources of data to construct a model. This paper addresses the problem of estimating a process output, measured by a scalar, curve, image, or structured point cloud by a set of heterogeneous process variables such as scalar process setting, sensor readings, and images. We introduce a general multiple tensoron-tensor regression (MTOT) approach in which each set of input data (predictor) and output measurements are represented by tensors. We formulate a linear regression model between the input and output tensors and estimate the parameters by minimizing a least square loss function. In order to avoid overfitting and reduce the number of parameters to be estimated, we decompose the model parameters using several bases that span the input and output spaces. Next, we learn the bases and their spanning coefficients when minimizing the loss function using a block coordinate descent algorithm combined with an alternating least square (ALS) algorithm. We show that such a minimization has a closed-form solution in each iteration and can be computed very efficiently. Through several simulation and case studies, we evaluate the performance of the proposed method. The results reveal the advantage of the proposed method over some benchmarks in the literature in terms of the mean square prediction error.

[1]  Eric F. Lock,et al.  Tensor-on-Tensor Regression , 2017, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[2]  Xiaoshan Li,et al.  Tucker Tensor Regression and Neuroimaging Analysis , 2018, Statistics in Biosciences.

[3]  Ruiyan Luo,et al.  Function-on-Function Linear Regression by Signal Compression , 2017 .

[4]  Vatsal Sharan,et al.  Orthogonalized ALS: A Theoretically Principled Tensor Decomposition Algorithm for Practical Use , 2017, ICML.

[5]  A. Çeçen,et al.  Development of high throughput assays for establishing process-structure-property linkages in multiphase polycrystalline metals: Application to dual-phase steels , 2017 .

[6]  Ciro Cattuto,et al.  Detecting Anomalies in Time-Varying Networks Using Tensor Decomposition , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[7]  Ana-Maria Staicu,et al.  Penalized function-on-function regression , 2015, Comput. Stat..

[8]  Hao Yan,et al.  Image-Based Process Monitoring Using Low-Rank Tensor Decomposition , 2015, IEEE Transactions on Automation Science and Engineering.

[9]  Wolfgang Jank,et al.  FUNCTIONAL RESPONSE ADDITIVE MODEL ESTIMATION WITH ONLINE VIRTUAL STOCK MARKETS , 2014, 1502.00818.

[10]  Nelson Felix,et al.  Characterization of wafer geometry and overlay error on silicon wafers with nonuniform stress , 2013 .

[11]  Kevin T. Turner,et al.  Role of wafer geometry in wafer chucking , 2013 .

[12]  Hongtu Zhu,et al.  Tensor Regression with Applications in Neuroimaging Data Analysis , 2012, Journal of the American Statistical Association.

[13]  Hans-Georg Müller,et al.  Functional Data Analysis , 2016 .

[14]  Waldemar Wójcik,et al.  Combustion diagnosis by image processing , 2009 .

[15]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[16]  Y. Nishi,et al.  Handbook of Semiconductor Manufacturing Technology , 2007 .

[17]  Philip S. Yu,et al.  Window-based Tensor Analysis on High-dimensional and Multi-aspect Streams , 2006, Sixth International Conference on Data Mining (ICDM'06).

[18]  Carl Duchesne,et al.  Multivariate Image Analysis of Flames for Product Quality and Combustion Control in Rotary Kilns , 2006 .

[19]  T. Kolda Multilinear operators for higher-order decompositions , 2006 .

[20]  Jane-ling Wang,et al.  Functional linear regression analysis for longitudinal data , 2005, math/0603132.

[21]  H. Müller,et al.  FUNCTIONAL RESPONSE MODELS , 2004 .

[22]  Honglu Yu,et al.  Multivariate image analysis and regression for prediction of coating content and distribution in the production of snack foods , 2003 .

[23]  R. Carroll,et al.  The relationship between virologic and immunologic responses in AIDS clinical research using mixed-effects varying-coefficient models with measurement error. , 2003, Biostatistics.

[24]  H. Kiers Towards a standardized notation and terminology in multiway analysis , 2000 .

[25]  G. He,et al.  EXTENDING CORRELATION AND REGRESSION FROM MULTIVARIATE TO FUNCTIONAL DATA , 2000 .

[26]  E Bellon,et al.  Experimental Teleradiology. Novel Telematics Services Using Image Processing, Hypermedia and Remote Cooperation to Improve Image-Based Medical Decision Making , 1995, Journal of telemedicine and telecare.