论文信息 - Flexible Modeling of Latent Task Structures in Multitask Learning - 字舞流文

Flexible Modeling of Latent Task Structures in Multitask Learning

Multitask learning algorithms are typically designed assuming some fixed, a priori known latent structure shared by all the tasks. However, it is usually unclear what type of latent task structure is the most appropriate for a given multitask learning problem. Ideally, the "right" latent task structure should be learned in a data-driven manner. We present a flexible, nonparametric Bayesian model that posits a mixture of factor analyzers structure on the tasks. The nonparametric aspect makes the model expressive enough to subsume many existing models of latent task structures (e.g, mean-regularized tasks, clustered tasks, low-rank or linear/non-linear subspace assumption on tasks, etc.). Moreover, it can also learn more general task structures, addressing the shortcomings of such models. We present a variational inference algorithm for our model. Experimental results on synthetic and real-world datasets, on both regression and classification problems, demonstrate the effectiveness of the proposed method.

Jacques Wainer | Hal Daumé | Piyush Rai | Alexandre Passos | Alexandre Passos | Hal Daumé | Piyush Rai | J. Wainer

[1] Jorge Nocedal,et al. Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization , 1997, TOMS.

[2] Lawrence Carin,et al. Nonparametric factor analysis with beta process priors , 2009, ICML '09.

[3] Yee Whye Teh,et al. Stick-breaking Construction for the Indian Buffet Process , 2007, AISTATS.

[4] Andrew Slater,et al. The Learning Behind Gmail Priority Inbox , 2010 .

[5] Michael I. Jordan,et al. Hierarchical Beta Processes and the Indian Buffet Process , 2007, AISTATS.

[6] Thomas L. Griffiths,et al. Infinite latent feature models and the Indian buffet process , 2005, NIPS.

[7] Tong Zhang,et al. A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , 2005, J. Mach. Learn. Res..

[8] Michael I. Jordan,et al. Variational inference for Dirichlet process mixtures , 2006 .

[9] M. Elad,et al. $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[10] Michael I. Jordan,et al. An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[11] Lancelot F. James,et al. Gibbs Sampling Methods for Stick-Breaking Priors , 2001 .

[12] Massimiliano Pontil,et al. Multi-Task Feature Learning , 2006, NIPS.

[13] Jean-Philippe Vert,et al. Clustered Multi-Task Learning: A Convex Formulation , 2008, NIPS.

[14] Rich Caruana,et al. Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[15] Zoubin Ghahramani,et al. Variational Inference for Bayesian Mixtures of Factor Analysers , 1999, NIPS.

[16] A. Bruckstein,et al. K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[17] Edwin V. Bonilla,et al. Multi-task Gaussian Process Prediction , 2007, NIPS.

[18] T. Ferguson. A Bayesian Analysis of Some Nonparametric Problems , 1973 .

[19] Hal Daumé,et al. Learning Task Grouping and Overlap in Multi-task Learning , 2012, ICML.

[20] Alex Acero,et al. Adaptation of Maximum Entropy Capitalizer: Little Data Can Help a Lo , 2006, Comput. Speech Lang..

[21] Hal Daumé,et al. Infinite Predictor Subspace Models for Multitask Learning , 2010, AISTATS.

[22] Kristen Grauman,et al. Learning with Whom to Share in Multi-task Feature Learning , 2011, ICML.

[23] Rajat Raina,et al. Constructing informative priors using transfer learning , 2006, ICML.

[24] Michael I. Jordan,et al. A Variational Approach to Bayesian Logistic Regression Models and their Extensions , 1997, AISTATS.

[25] David B. Dunson,et al. Compressive Sensing on Manifolds Using a Nonparametric Mixture of Factor Analyzers: Algorithm and Performance Bounds , 2010, IEEE Transactions on Signal Processing.

[26] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.

[27] Hal Daumé,et al. Learning Multiple Tasks using Manifold Regularization , 2010, NIPS.

[28] Yoshua Bengio,et al. Bias learning, knowledge sharing , 2003, IEEE Trans. Neural Networks.

[29] Lawrence Carin,et al. Multi-Task Learning for Classification with Dirichlet Process Priors , 2007, J. Mach. Learn. Res..

[30] George Karypis,et al. Multi-task learning for recommender systems , 2010, ACML 2010.

[31] Dit-Yan Yeung,et al. A Convex Formulation for Learning Task Relationships in Multi-Task Learning , 2010, UAI.

[32] Yiming Yang,et al. Learning Multiple Related Tasks using Latent Independent Component Analysis , 2005, NIPS.

[33] Massimiliano Pontil,et al. An Algorithm for Transfer Learning in a Heterogeneous Environment , 2008, ECML/PKDD.

[34] Thomas L. Griffiths,et al. Modeling Transfer Learning in Human Categorization with the Hierarchical Dirichlet Process , 2010, ICML.

[35] Yee Whye Teh,et al. Variational Inference for the Indian Buffet Process , 2009, AISTATS.