论文信息 - On Random Subsampling of Gaussian Process Regression: A Graphon-Based Analysis

On Random Subsampling of Gaussian Process Regression: A Graphon-Based Analysis

In this paper, we study random subsampling of Gaussian process regression, one of the simplest approximation baselines, from a theoretical perspective. Although subsampling discards a large part of training data, we show provable guarantees on the accuracy of the predictive mean/variance and its generalization ability. For analysis, we consider embedding kernel matrices into graphons, which encapsulate the difference of the sample size and enables us to evaluate the approximation and generalization errors in a unified manner. The experimental results show that the subsampling approximation achieves a better trade-off regarding accuracy and runtime than the Nystr\"{o}m and random Fourier expansion methods.

Yuichi Yoshida | Kohei Hayashi | Masaaki Imaizumi

[1] Rong Jin,et al. Nyström Method vs Random Fourier Features: A Theoretical and Empirical Comparison , 2012, NIPS.

[2] Subhashis Ghosal,et al. Supremum Norm Posterior Contraction and Credible Sets for Nonparametric Multivariate Regression , 2014, 1411.6716.

[3] Andrew Gordon Wilson,et al. Constant-Time Predictive Distributions for Gaussian Processes , 2018, ICML.

[4] M. Bálek,et al. Large Networks and Graph Limits , 2022 .

[5] Nenad Moraca,et al. Bounds for norms of the matrix inverse and the smallest singular value , 2008 .

[6] Carl E. Rasmussen,et al. A Unifying View of Sparse Approximate Gaussian Process Regression , 2005, J. Mach. Learn. Res..

[7] Adam Krzyzak,et al. A Distribution-Free Theory of Nonparametric Regression , 2002, Springer series in statistics.

[8] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[9] Carl E. Rasmussen,et al. Understanding Probabilistic Sparse Gaussian Process Approximations , 2016, NIPS.

[10] Yuichi Yoshida,et al. Minimizing Quadratic Functions in Constant Time , 2016, NIPS.

[11] Edward Lloyd Snelson,et al. Flexible and efficient Gaussian process models for machine learning , 2007 .