Convolved Multi-output Gaussian Processes for Semi-Supervised Learning

Multi-output learning has become in a strong field of research in machine learning community during the last years. This setup considers the occurrence of multiple and related tasks in real-world problems. Another approach called semi-supervised learning (SSL) is the middle point between the case where all training samples are labeled (supervised learning) and the case where all training samples are unlabeled (unsupervised learning). In many applications it is difficult or impossible to access to fully labeled data. At these scenarios, SSL becomes a very useful methodology to achieve successful results, either for regression or for classification. In this paper, we propose the use of kernels for vector-valued functions for Gaussian process multi-output regression in the context of semi-supervised learning. We combine a Gaussian process with process convolution (PC) type of covariance function with techniques commonly used in semi-supervised learning like the Expectation-Maximization (EM) algorithm, and Graph-based regularization. We test our proposed method in two widely used databases for multi-output regression. Results obtained by our method exhibit a better performance compared to supervised methods based on Gaussian processes in scenarios where there are not available a good amount of labeled data.

[1]  Stefan Schaal,et al.  Incremental Online Learning in High Dimensions , 2005, Neural Computation.

[2]  Neil D. Lawrence,et al.  Computationally Efficient Convolved Multiple Output Gaussian Processes , 2011, J. Mach. Learn. Res..

[3]  Kian Ming Adam Chai,et al.  Variational Multinomial Logit Gaussian Process , 2012, J. Mach. Learn. Res..

[4]  Marco Loog,et al.  Semi-Supervised Nearest Mean Classification Through a Constrained Log-Likelihood , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[5]  D. Higdon Space and Space-Time Modeling using Process Convolutions , 2002 .

[6]  Marco Loog Semi-supervised linear discriminant analysis through moment-constraint parameter estimation , 2014, Pattern Recognit. Lett..

[7]  M. Álvarez,et al.  Variational Inducing Kernels for Sparse Convolved Multiple Output G aussian Processes , 2009, 0912.3268.

[8]  Rich Caruana,et al.  Multitask Learning , 1997, Machine Learning.

[9]  Guido Sanguinetti,et al.  Semisupervised Multitask Learning With Gaussian Processes , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[10]  Jason Weston,et al.  Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins , 2010, Bioinform..

[11]  Neil D. Lawrence,et al.  Kernels for Vector-Valued Functions: a Review , 2011, Found. Trends Mach. Learn..

[12]  S. Sathiya Keerthi,et al.  Semi-supervised multi-task learning of structured prediction models for web information extraction , 2011, CIKM '11.

[13]  Massimiliano Pontil,et al.  Convex multi-task feature learning , 2008, Machine Learning.

[14]  Anton Schwaighofer,et al.  Learning Gaussian processes from multiple tasks , 2005, ICML.

[15]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[16]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[17]  Pietro Perona,et al.  Self-Tuning Spectral Clustering , 2004, NIPS.

[18]  Neil D. Lawrence,et al.  Learning to learn with the informative vector machine , 2004, ICML.

[19]  Dit-Yan Yeung,et al.  Semi-Supervised Multi-Task Regression , 2009, ECML/PKDD.

[20]  Guido Sanguinetti,et al.  Bayesian Multitask Classification With Gaussian Process Priors , 2011, IEEE Transactions on Neural Networks.

[21]  Elisa Guerrero Vázquez,et al.  Multiple comparison procedures applied to model selection , 2002, Neurocomputing.

[22]  Nicu Sebe,et al.  Semisupervised learning of classifiers: theory, algorithms, and their application to human-computer interaction , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Edwin V. Bonilla,et al.  Kernel Multi-task Learning using Task-specific Features , 2007, AISTATS.