论文信息 - Optimization for Gaussian Processes via Chaining

Optimization for Gaussian Processes via Chaining

In this paper, we consider the problem of stochastic optimization under a bandit feedback model. We generalize the GP-UCB algorithm [Srinivas and al., 2012] to arbitrary kernels and search spaces. To do so, we use a notion of localized chaining to control the supremum of a Gaussian process, and provide a novel optimization scheme based on the computation of covering numbers. The theoretical bounds we obtain on the cumulative regret are more generic and present the same convergence rates as the GP-UCB algorithm. Finally, the algorithm is shown to be empirically more efficient than its natural competitors on simple and complex input spaces.

N. Vayatis | C. Malherbe | E. Contal

[1] R. Dudley. The Sizes of Compact Subsets of Hilbert Space and Continuity of Gaussian Processes , 1967 .

[2] David S. Johnson. Approximation algorithms for combinatorial problems , 1973, STOC '73.

[3] Jonas Mockus,et al. Bayesian Approach to Global Optimization , 1989 .

[4] D. Pollard. Empirical Processes: Theory and Applications , 1990 .

[5] Ran Raz,et al. A sub-constant error-probability low-degree test, and a sub-constant error-probability PCP characterization of NP , 1997, STOC '97.

[6] Donald R. Jones,et al. Efficient Global Optimization of Expensive Black-Box Functions , 1998, J. Glob. Optim..

[7] Hans-Peter Kriegel,et al. Shortest-path kernels on graphs , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[8] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[9] Michael A. Osborne. Bayesian Gaussian processes for sequential prediction, optimisation and quadrature , 2010 .

[10] John Shawe-Taylor,et al. Regret Bounds for Gaussian Process Bandit Problems , 2010, AISTATS 2010.

[11] Andreas Krause,et al. Contextual Gaussian Process Bandit Optimization , 2011, NIPS.