Gaussian Process Regression for Binned Data

Many datasets are in the form of tables of binned data. Performing regression on these data usually involves either reading off bin heights, ignoring data from neighbouring bins or interpolating between bins thus over or underestimating the true bin integrals. In this paper we propose an elegant method for performing Gaussian Process (GP) regression given such binned data, allowing one to make probabilistic predictions of the latent function which produced the binned data. We look at several applications. First, for differentially private regression; second, to make predictions over other integrals; and third when the input regions are irregularly shaped collections of polytopes. In summary, our method provides an effective way of analysing binned data such that one can use more information from the histogram representation, and thus reconstruct a more useful and precise density for making predictions.

[1]  Aaron Roth,et al.  The Algorithmic Foundations of Differential Privacy , 2014, Found. Trends Theor. Comput. Sci..

[2]  David Hsu,et al.  Computing the largest inscribed isothetic rectangle , 1995, CCCG.

[3]  Dan Roth,et al.  Finding the Largest Area Axis-parallel Rectangle in a Polygon , 1997, Comput. Geom..

[4]  Daniela Marinescu,et al.  COVERING WITH RECTANGULAR PIECES , 2003 .

[5]  Lynne Billard,et al.  Principal component analysis for histogram-valued data , 2017, Adv. Data Anal. Classif..

[6]  Noel A Cressie,et al.  Some topics in convolution-based spatial modeling , 2007 .

[7]  P. Stein A Note on the Volume of a Simplex , 1966 .

[8]  Hans Raj Tiwary,et al.  Largest inscribed rectangles in convex polygons , 2012, J. Discrete Algorithms.

[9]  Kristjan Aû,et al.  Comprising Prior Knowledge in Dynamic Gaussian Process Models , 2005 .

[10]  A. O'Hagan,et al.  Bayes–Hermite quadrature , 1991 .

[11]  P. Kyriakidis A Geostatistical Framework for Area-to-Point Spatial Interpolation , 2004 .

[12]  Anthony O'Hagan,et al.  Uncertainty in prior elicitations: a nonparametric approach , 2007 .

[13]  Otfried Cheong,et al.  Finding largest rectangles in convex polygons , 2016, Comput. Geom..

[14]  Jeremy E. Oakley,et al.  Nonparametric elicitation for heavy-tailed prior distributions , 2007 .

[15]  Neil D. Lawrence,et al.  Latent Force Models , 2009, AISTATS.

[16]  Aki Vehtari,et al.  Gaussian processes with monotonicity information , 2010, AISTATS.