Performing Multi-Target Regression via a Parameter Sharing-Based Deep Network

Multi-target regression (MTR) comprises the prediction of multiple continuous target variables from a common set of input variables. There are two major challenges when addressing the MTR problem: the exploration of the inter-target dependencies and the modeling of complex input-output relationships. This paper proposes a neural network model that is able to simultaneously address these two challenges in a flexible way. A deep architecture well suited for learning multiple continuous outputs is designed, providing some flexibility to model the inter-target relationships by sharing network parameters as well as the possibility to exploit target-specific patterns by learning a set of nonshared parameters for each target. The effectiveness of the proposal is analyzed through an extensive experimental study on 18 datasets, demonstrating the benefits of using a shared representation that exploits the commonalities between target variables. According to the experimental results, the proposed model is competitive with respect to the state-of-the-art in MTR.

[1]  Jonathan Baxter,et al.  A Bayesian/Information Theoretic Model of Learning to Learn via Multiple Task Sampling , 1997, Machine Learning.

[2]  Concha Bielza,et al.  A survey on multi‐output regression , 2015, WIREs Data Mining Knowl. Discov..

[3]  S. Džeroski,et al.  Using single- and multi-target regression trees and ensembles to model a compound index of vegetation condition , 2009 .

[4]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[5]  Leonardo Franco,et al.  Layer multiplexing FPGA implementation for deep back-propagation learning , 2017, Integr. Comput. Aided Eng..

[6]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[7]  Maria L. Rizzo,et al.  Measuring and testing dependence by correlation of distances , 2007, 0803.4101.

[8]  Jonathan Baxter,et al.  Learning internal representations , 1995, COLT '95.

[9]  Hadi Meidani,et al.  Deep Learning for Accelerated Seismic Reliability Analysis of Transportation Networks , 2017, Comput. Aided Civ. Infrastructure Eng..

[10]  D. Rom A sequentially rejective test procedure based on a modified Bonferroni inequality , 1990 .

[11]  Oral Büyüköztürk,et al.  Deep Learning‐Based Crack Damage Detection Using Convolutional Neural Networks , 2017, Comput. Aided Civ. Infrastructure Eng..

[12]  Saeid Sanei,et al.  Deep Neural Architectures for Mapping Scalp to Intracranial EEG , 2018, Int. J. Neural Syst..

[13]  Xiaofei He,et al.  Multi-Target Regression via Robust Low-Rank Learning , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Ivan Bratko,et al.  First Order Regression , 1997, Machine Learning.

[15]  M. Friedman A Comparison of Alternative Tests of Significance for the Problem of $m$ Rankings , 1940 .

[16]  Grigorios Tsoumakas,et al.  MULAN: A Java Library for Multi-Label Learning , 2011, J. Mach. Learn. Res..

[17]  Yicheng Li,et al.  A Fast Detection Method via Region‐Based Fully Convolutional Neural Networks for Shield Tunnel Lining Defects , 2018, Comput. Aided Civ. Infrastructure Eng..

[18]  Donato Malerba,et al.  Leveraging the power of local spatial autocorrelation in geophysical interpolative clustering , 2014, Data Mining and Knowledge Discovery.

[19]  Hossein Hashemi,et al.  End‐to‐End Deep Learning Methodology for Real‐Time Traffic Network Management , 2018, Comput. Aided Civ. Infrastructure Eng..

[20]  Habib Fardoun,et al.  A locally weighted learning method based on a data gravitation model for multi-target regression , 2018, Int. J. Comput. Intell. Syst..

[21]  Saso Dzeroski,et al.  Ensembles of Multi-Objective Decision Trees , 2007, ECML.

[22]  Vojislav Kecman,et al.  Multi-target support vector regression via correlation regressor chains , 2017, Inf. Sci..

[23]  Dinggang Shen,et al.  Cognitive Assessment Prediction in Alzheimer’s Disease by Multi-Layer Multi-Target Regression , 2018, Neuroinformatics.

[24]  Yang Liu,et al.  Automated Pixel‐Level Pavement Crack Detection on 3D Asphalt Surfaces Using a Deep‐Learning Network , 2017, Comput. Aided Civ. Infrastructure Eng..

[25]  Sebastián Ventura,et al.  Evolutionary Strategy to Perform Batch-Mode Active Learning on Multi-Label Data , 2018, ACM Trans. Intell. Syst. Technol..

[26]  Y. Yao,et al.  On Early Stopping in Gradient Descent Learning , 2007 .

[27]  Albert Bifet,et al.  Deep learning in partially-labeled data streams , 2015, SAC.

[28]  Grigorios Tsoumakas,et al.  Multi-target Regression via Random Linear Target Combinations , 2014, ECML/PKDD.

[29]  Fernando Pérez-Cruz,et al.  SVM multiregression for nonlinear channel estimation in multiple-input multiple-output systems , 2004, IEEE Transactions on Signal Processing.

[30]  Athanasios Tsanas,et al.  Accurate quantitative estimation of energy performance of residential buildings using statistical machine learning tools , 2012 .

[31]  Hojjat Adeli,et al.  A novel unsupervised deep learning model for global and local health condition assessment of structures , 2018 .

[32]  Chaoyang Zhang,et al.  Deep learning architectures for multi-label classification of intelligent health risk prediction , 2017, BMC Bioinformatics.

[33]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[34]  Francesco Carlo Morabito,et al.  Deep Learning Representation from Electroencephalography of Early-Stage Creutzfeldt-Jakob Disease and Features for Differentiation from Rapidly Progressive Dementia , 2017, Int. J. Neural Syst..

[35]  G. Hommel,et al.  Improvements of General Multiple Test Procedures for Redundant Systems of Hypotheses , 1988 .

[36]  Rich Caruana,et al.  Multitask Learning: A Knowledge-Based Source of Inductive Bias , 1993, ICML.

[37]  Ying Liu,et al.  Real time prediction for converter gas tank levels based on multi-output least square support vector regressor , 2012 .

[38]  Paul Roe,et al.  Multi-Label Classification of Frog Species via Deep Learning , 2017, 2017 IEEE 13th International Conference on e-Science (e-Science).

[39]  Saso Dzeroski,et al.  Predicting Chemical Parameters of River Water Quality from Bioindicator Data , 2000, Applied Intelligence.

[40]  Boguslaw Cyganek,et al.  Image recognition with deep neural networks in presence of noise - Dealing with and taking advantage of distortions , 2017, Integr. Comput. Aided Eng..

[41]  Hojjat Adeli,et al.  A novel machine learning‐based algorithm to detect damage in high‐rise building structures , 2017 .

[42]  Khalid M. Mosalam,et al.  Deep Transfer Learning for Image‐Based Structural Damage Recognition , 2018, Comput. Aided Civ. Infrastructure Eng..

[43]  Grigorios Tsoumakas,et al.  Multi-target regression via input space expansion: treating targets as inputs , 2012, Machine Learning.

[44]  Grigorios Tsoumakas,et al.  An empirical study on sea water quality prediction , 2008, Knowl. Based Syst..

[45]  I-Cheng Yeh,et al.  Modeling slump flow of concrete using second-order regressions and artificial neural networks , 2007 .

[46]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[47]  Pang-Ning Tan,et al.  Position Preserving Multi-Output Prediction , 2013, ECML/PKDD.