论文信息 - Gaussian Processes for Regression and Optimisation

Gaussian Processes for Regression and Optimisation

Gaussian processes have proved to be useful and powerful constructs for the purposes of regression. The classical method proceeds by parameterising a covariance function, and then infers the parameters given the training data. In this thesis, the classical approach is augmented by interpreting Gaussian processes as the outputs of linear filters excited by white noise. This enables a straightforward definition of dependent Gaussian processes as the outputs of a multiple output linear filter excited by multiple noise sources. We show how dependent Gaussian processes defined in this way can also be used for the purposes of system identification. Onewell known problemwith Gaussian process regression is that the computational complexity scales poorly with the amount of training data. We review one approximate solution that alleviates this problem, namely reduced rank Gaussian processes. We then show how the reduced rank approximation can be applied to allow for the efficient computation of dependent Gaussian processes. We then examine the application of Gaussian processes to the solution of other machine learning problems. To do so, we review methods for the parameterisation of full covariance matrices. Furthermore, we discuss how improvements can be made by marginalising over alternative models, and introduce methods to perform these computations efficiently. In particular, we introduce sequential annealed importance sampling as a method for calculating model evidence in an on-line fashion as new data arrives. Gaussian process regression can also be applied to optimisation. An algorithm is described that uses model comparison between multiple models to find the optimum of a function while taking as few samples as possible. This algorithm shows impressive performance on the standard control problem of double pole balancing. Finally, we describe how Gaussian processes can be used to efficiently estimate gradients of noisy functions, and numerically estimate integrals.

Phillip Boyle | Phillip Boyle

[1] Volker Tresp,et al. A Bayesian Committee Machine , 2000, Neural Computation.

[2] Matthias W. Seeger,et al. Bayesian Gaussian process models : PAC-Bayesian generalisation error bounds and sparse approximations , 2003 .

[3] David J. C. MacKay,et al. Bayesian Methods for Backpropagation Networks , 1996 .

[4] Quantitative Methods for Current Environmental Issues , 2005 .

[5] Herbert K. H. Lee,et al. Efficient models for correlated data via convolutions of intrinsic processes , 2005 .

[6] Radford M. Neal. Bayesian training of backpropagation networks by the hybrid Monte-Carlo method , 1992 .

[7] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[8] Radford M. Neal. Annealed importance sampling , 1998, Stat. Comput..

[9] Trevor J. Terrell. Introduction to Digital Filters , 1980 .

[10] C. E. SHANNON,et al. A mathematical theory of communication , 1948, MOCO.

[11] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.