论文信息 - Learning Multiple Tasks with Multilinear Relationship Networks

Learning Multiple Tasks with Multilinear Relationship Networks

Deep networks trained on large-scale data can learn transferable features to promote learning multiple tasks. Since deep features eventually transition from general to specific along deep networks, a fundamental problem of multi-task learning is how to exploit the task relatedness underlying parameter tensors and improve feature transferability in the multiple task-specific layers. This paper presents Multilinear Relationship Networks (MRN) that discover the task relationships based on novel tensor normal priors over parameter tensors of multiple task-specific layers in deep convolutional networks. By jointly learning transferable features and multilinear relationships of tasks and features, MRN is able to alleviate the dilemma of negative-transfer in the feature layers and under-transfer in the classifier layer. Experiments show that MRN yields state-of-the-art results on three multi-task learning datasets.

[1] Nitish Srivastava,et al. Discriminative Transfer Learning with Tree-based Priors , 2013, NIPS.

[2] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Massimiliano Pontil,et al. Multilinear Multitask Learning , 2013, ICML.

[4] Yuan Shi,et al. Geodesic flow kernel for unsupervised domain adaptation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5] Martial Hebert,et al. Cross-Stitch Networks for Multi-task Learning , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Yoshua Bengio,et al. How transferable are features in deep neural networks? , 2014, NIPS.

[7] Dietrich von Rosen,et al. The multilinear normal distribution: Introduction and some basic properties , 2013, J. Multivar. Anal..

[8] Lorenzo Rosasco,et al. Convex Learning of Multiple Tasks and their Structure , 2015, ICML.

[9] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[10] Yu Zhang,et al. A Survey on Multi-Task Learning , 2017, IEEE Transactions on Knowledge and Data Engineering.

[11] Kristen Grauman,et al. Learning with Whom to Share in Multi-task Feature Learning , 2011, ICML.

[12] Sethuraman Panchanathan,et al. Deep Hashing Network for Unsupervised Domain Adaptation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Jieping Ye,et al. A Convex Formulation for Learning a Shared Predictive Structure from Multiple Tasks , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[15] Trevor Darrell,et al. Adapting Visual Category Models to New Domains , 2010, ECCV.

[16] Xiaogang Wang,et al. Multi-task Recurrent Neural Network for Immediacy Prediction , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[17] Michael I. Jordan,et al. Learning Transferable Features with Deep Adaptation Networks , 2015, ICML.

[18] Rich Caruana,et al. Multitask Learning , 1997, Machine-mediated learning.

[19] Hal Daumé,et al. Learning Task Grouping and Overlap in Multi-task Learning , 2012, ICML.

[20] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[21] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[22] Yongxin Yang,et al. Deep Multi-task Representation Learning: A Tensor Factorisation Approach , 2016, ICLR.

[23] Trevor Darrell,et al. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[24] Jiayu Zhou,et al. Integrating low-rank and group-sparse structures for robust multi-task learning , 2011, KDD.

[25] Massimiliano Pontil,et al. Convex multi-task feature learning , 2008, Machine Learning.

[26] Massimiliano Pontil,et al. The Benefit of Multitask Representation Learning , 2015, J. Mach. Learn. Res..

[27] Xiaoou Tang,et al. Facial Landmark Detection by Deep Multi-task Learning , 2014, ECCV.

[28] A. Rukhin. Matrix Variate Distributions , 1999, The Multivariate Normal Distribution.

[29] Georg Heigold,et al. Multilingual acoustic models using distributed deep neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[30] Dit-Yan Yeung,et al. A Regularization Approach to Learning Task Relationships in Multitask Learning , 2014, ACM Trans. Knowl. Discov. Data.

[31] Massimiliano Pontil,et al. Regularized multi--task learning , 2004, KDD.

[32] Daniel Hernández-Lobato,et al. A Probabilistic Model for Dirty Multi-task Feature Selection , 2015, ICML.

[33] Yoshua Bengio,et al. Domain Adaptation for Large-Scale Sentiment Classification: A Deep Learning Approach , 2011, ICML.

[34] Dit-Yan Yeung,et al. A Convex Formulation for Learning Task Relationships in Multi-Task Learning , 2010, UAI.

[35] Trevor Darrell,et al. LSDA: Large Scale Detection through Adaptation , 2014, NIPS.

[36] Juhan Nam,et al. Multimodal Deep Learning , 2011, ICML.

[37] Xiaogang Wang,et al. Multi-source Deep Learning for Human Pose Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[38] Jeff G. Schneider,et al. Learning Multiple Tasks with a Sparse Matrix-Normal Penalty , 2010, NIPS.

[39] Tong Zhang,et al. A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , 2005, J. Mach. Learn. Res..

[40] Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41] Jean-Philippe Vert,et al. Clustered Multi-Task Learning: A Convex Formulation , 2008, NIPS.