The matrix stick-breaking process for flexible multi-task learning

In multi-task learning our goal is to design regression or classification models for each of the tasks and appropriately share information between tasks. A Dirichlet process (DP) prior can be used to encourage task clustering. However, the DP prior does not allow local clustering of tasks with respect to a subset of the feature vector without making independence assumptions. Motivated by this problem, we develop a new multitask-learning prior, termed the matrix stick-breaking process (MSBP), which encourages cross-task sharing of data. However, the MSBP allows separate clustering and borrowing of information for the different feature components. This is important when tasks are more closely related for certain features than for others. Bayesian inference proceeds by a Gibbs sampling algorithm and the approach is illustrated using a simulated example and a multi-national application.

[1]  L. Carin,et al.  The Matrix Stick-Breaking Process , 2008 .

[2]  J Ashby,et al.  The OECD program to validate the rat uterotrophic bioassay to screen compounds for in vivo estrogenic responses: phase 1. , 2001, Environmental health perspectives.

[3]  Lawrence Carin,et al.  Multi-Task Learning for Classification with Dirichlet Process Priors , 2007, J. Mach. Learn. Res..

[4]  J. E. Griffin,et al.  Order-Based Dependent Dirichlet Processes , 2006 .

[5]  S. MacEachern,et al.  An ANOVA Model for Dependent Random Measures , 2004 .

[6]  Rich Caruana,et al.  Multitask Learning , 1997, Machine-mediated learning.

[7]  N. Pillai,et al.  Bayesian density regression , 2007 .

[8]  Sebastian Thrun,et al.  Learning to Learn , 1998, Springer US.

[9]  John Geweke,et al.  Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments , 1991 .

[10]  S. MacEachern Decision Theoretic Aspects of Dependent Nonparametric Processes , 2000 .

[11]  J. Sethuraman A CONSTRUCTIVE DEFINITION OF DIRICHLET PRIORS , 1991 .

[12]  A. Gelfand,et al.  Dirichlet Process Mixed Generalized Linear Models , 1997 .

[13]  Volker Tresp,et al.  A nonparametric hierarchical bayesian framework for information filtering , 2004, SIGIR '04.

[14]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[15]  Lancelot F. James,et al.  Gibbs Sampling Methods for Stick-Breaking Priors , 2001 .

[16]  T. Ferguson A Bayesian Analysis of Some Nonparametric Problems , 1973 .

[17]  A. Raftery,et al.  How Many Iterations in the Gibbs Sampler , 1991 .

[18]  S. Chib,et al.  Bayesian analysis of binary and polychotomous response data , 1993 .

[19]  P. Müller,et al.  A method for combining inference across related nonparametric Bayesian models , 2004 .