论文信息 - Multitask Feature Selection with Task Descriptors

Multitask Feature Selection with Task Descriptors

Machine learning applications in precision medicine are severely limited by the scarcity of data to learn from. Indeed, training data often contains many more features than samples. To alleviate the resulting statistical issues, the multitask learning framework proposes to learn different but related tasks jointly, rather than independently, by sharing information between these tasks. Within this framework, the joint regularization of model parameters results in models with few non-zero coefficients and that share similar sparsity patterns. We propose a new regularized multitask approach that incorporates task descriptors, hence modulating the amount of information shared between tasks according to their similarity. We show on simulated data that this method outperforms other multitask feature selection approaches, particularly in the case of scarce data. In addition, we demonstrate on peptide MHC-I binding data the ability of the proposed approach to make predictions for new tasks for which no training data is available.

Véronique Stoven | Chloé-Agathe Azencott | Victor Bellon

[1] Aurelie C. Lozano,et al. Multi-level Lasso for Sparse Multi-task Regression , 2012, ICML.

[2] Suchi Saria,et al. High Frequency Remote Monitoring of Parkinson's Disease via Smartphone: Platform Overview and Medication Response Detection , 2016, ArXiv.

[3] Yoav Ben-Shlomo,et al. Parkinsonism and Related Disorders , 2016 .

[4] Jinbo Bi,et al. On Multiplicative Multitask Feature Learning , 2014, NIPS.

[5] M. Kenward,et al. An Introduction to the Bootstrap , 2007 .