MLCTR: A Fast Scalable Coupled Tensor Completion Based on Multi-Layer Non-Linear Matrix Factorization

Firms earning prediction plays a vital role in investment decisions, dividends expectation, and share price. It often involves multiple tensor-compatible datasets with non-linear multi-way relationships, spatiotemporal structures, and different levels of sparsity. Current non-linear tensor completion algorithms tend to learn noisy embedding and incur overfitting. This paper focuses on the embedding learning aspect of the tensor completion problem and proposes a new multi-layer neural network architecture for tensor factorization and completion (MLCTR). The network architecture entails multiple advantages: a series of low-rank matrix factorizations (MF) building blocks to minimize overfitting, interleaved transfer functions in each layer for non-linearity, and by-pass connections to reduce the gradient diminishing problem and increase the depths of neural networks. Furthermore, the model employs Stochastic Gradient Descent (SGD) based optimization for fast convergence in training. Our algorithm is highly efficient for imputing missing values in the EPS data. Experiments confirm that our strategy of incorporating non-linearity in factor matrices demonstrates impressive performance in embedding learning and end-toend tensor models, and outperforms approaches with nonlinearity in the phase of reconstructing tensors from factor

[1]  Richard A. Harshman,et al.  Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multi-model factor analysis , 1970 .

[2]  Nitesh V. Chawla,et al.  Neural Tensor Factorization for Temporal Interaction Learning , 2019, WSDM.

[3]  Pierre-Antoine Absil,et al.  Coupled tensor decomposition: A step towards robust components , 2016, 2016 24th European Signal Processing Conference (EUSIPCO).

[4]  Rong Pan,et al.  Personalized Tag Recommendation through Nonlinear Tensor Factorization Using Gaussian Kernel , 2015, AAAI.

[5]  L. Tucker,et al.  Some mathematical notes on three-mode factor analysis , 1966, Psychometrika.

[6]  Xiaoli Li,et al.  Rank-GeoFM: A Ranking based Geographical Factorization Method for Point of Interest Recommendation , 2015, SIGIR.

[7]  Rasmus Bro,et al.  Coupled Matrix Factorization with Sparse Factors to Identify Potential Biomarkers in Metabolomics , 2012, 2012 IEEE 12th International Conference on Data Mining Workshops.

[8]  Jieping Ye,et al.  Tensor Completion for Estimating Missing Values in Visual Data , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Sundaresh Ramnath,et al.  The Financial Analyst Forecasting Literature: A Taxonomy with Suggestions for Further Research , 2008 .

[10]  D. Bradley,et al.  Before an Analyst Becomes an Analyst: Does Industry Experience Matter? , 2015 .

[11]  Samuel Kaski,et al.  Bayesian multi-tensor factorization , 2016, Machine Learning.

[12]  Yan Liu,et al.  CoSTCo: A Neural Tensor Completion Model for Sparse Tensors , 2019, KDD.

[13]  Lee Sael,et al.  Scalable Tucker Factorization for Sparse Tensors - Algorithms and Discoveries , 2017, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[14]  Zenglin Xu,et al.  NeuralCP: Bayesian Multiway Data Analysis with Neural Tensor Decomposition , 2018, Cognitive Computation.

[15]  P. Corredor,et al.  The role of sentiment and stock characteristics in the translation of analysts’ forecasts into recommendations , 2019, The North American Journal of Economics and Finance.

[16]  Tamara G. Kolda,et al.  All-at-once Optimization for Coupled Matrix and Tensor Factorizations , 2011, ArXiv.

[17]  Hisashi Kashima,et al.  Tensor factorization using auxiliary information , 2011, Data Mining and Knowledge Discovery.

[18]  Eric C. So,et al.  Uncovering Expected Returns: Information in Analyst Coverage Proxies , 2016 .

[19]  Jin Fan,et al.  Improved Coupled Tensor Factorization with Its Applications in Health Data Analysis , 2019, Complex..

[20]  Evangelos E. Papalexakis,et al.  Constrained Coupled Matrix-Tensor Factorization and its Application in Pattern and Topic Detection , 2018, 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[21]  Philip S. Yu,et al.  DuSK: A Dual Structure-preserving Kernel for Supervised Tensor Learning with Applications to Neuroimages , 2014, SDM.

[22]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Michael S. Drake,et al.  A re-examination of analysts’ superiority over time-series forecasts of annual earnings , 2012 .

[24]  Eric Ghysels,et al.  Automated Earnings Forecasts:- Beat Analysts or Combine and Conquer? , 2017, Manag. Sci..

[25]  James Caverlee,et al.  Multi-Aspect Streaming Tensor Completion , 2017, KDD.

[26]  Zenglin Xu,et al.  Distributed Flexible Nonlinear Tensor Factorization , 2016, NIPS.

[27]  Yan Liu,et al.  Learning Temporal Causal Graphs for Relational Time-Series Analysis , 2010, ICML.

[28]  Yan Liu,et al.  Spatial-temporal causal modeling for climate change attribution , 2009, KDD.

[29]  Tamara G. Kolda,et al.  Scalable Tensor Factorizations for Incomplete Data , 2010, ArXiv.

[30]  B. Recht,et al.  Tensor completion and low-n-rank tensor recovery via convex optimization , 2011 .

[31]  Hwanjo Yu,et al.  Discriminative and Distinct Phenotyping by Constrained Tensor Factorization , 2017, Scientific Reports.