Active Uncertainty Calibration in Bayesian ODE Solvers

There is resurging interest, in statistics and machine learning, in solvers for ordinary differential equations (ODEs) that return probability measures instead of point estimates. Recently, Conrad et al. introduced a sampling-based class of methods that are 'well-calibrated' in a specific sense. But the computational cost of these methods is significantly above that of classic methods. On the other hand, Schober et al. pointed out a precise connection between classic Runge-Kutta ODE solvers and Gaussian filters, which gives only a rough probabilistic calibration, but at negligible cost overhead. By formulating the solution of ODEs as approximate inference in linear Gaussian SDEs, we investigate a range of probabilistic ODE solvers, that bridge the trade-off between computational cost and probabilistic calibration, and identify the inaccurate gradient measurement as the crucial source of uncertainty. We propose the novel filtering-based method Bayesian Quadrature filtering (BQF) which uses Bayesian quadrature to actively learn the imprecision in the gradient measurement by collecting multiple gradient evaluations.

[1]  Michael A. Osborne,et al.  Probabilistic Integration: A Role for Statisticians in Numerical Analysis? , 2015 .

[2]  Ernst Hairer,et al.  Solving Ordinary Differential Equations I: Nonstiff Problems , 2009 .

[3]  E. Hairer,et al.  Solving ordinary differential equations I (2nd revised. ed.): nonstiff problems , 1993 .

[4]  Søren Hauberg,et al.  Probabilistic Solutions to Differential Equations and their Application to Riemannian Statistics , 2013, AISTATS.

[5]  Michael A. Osborne,et al.  Probabilistic Integration , 2015, ArXiv.

[6]  Jouni Hartikainen,et al.  On the relation between Gaussian process quadratures and sigma-point methods , 2015, 1504.05994.

[7]  Marc Peter Deisenroth,et al.  Efficient reinforcement learning using Gaussian processes , 2010 .

[8]  J. Skilling Bayesian Solution of Ordinary Differential Equations , 1992 .

[9]  S. Aachen Stochastic Differential Equations An Introduction With Applications , 2016 .

[10]  Andrew M. Stuart,et al.  Statistical analysis of differential equations: introducing probability measures on numerical solutions , 2016, Statistics and Computing.

[11]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[12]  Jean-Pierre Bourguignon,et al.  Mathematische Annalen , 1893 .

[13]  M. Girolami,et al.  Bayesian Solution Uncertainty Quantification for Differential Equations , 2013 .

[14]  Mark Girolami,et al.  Bayesian Uncertainty Quantification for Differential Equations , 2013 .

[15]  David Duvenaud,et al.  Probabilistic ODE Solvers with Runge-Kutta Means , 2014, NIPS.

[16]  Ioannis Karatzas,et al.  Brownian Motion and Stochastic Calculus , 1987 .

[17]  William H. Press,et al.  Numerical Recipes 3rd Edition: The Art of Scientific Computing , 2007 .

[18]  C. Runge Ueber die numerische Auflösung von Differentialgleichungen , 1895 .

[19]  Simo Särkkä,et al.  Bayesian Filtering and Smoothing , 2013, Institute of Mathematical Statistics textbooks.

[20]  Patrick R. Conrad,et al.  Probability Measures for Numerical Solutions of Differential Equations , 2015, 1506.04592.

[21]  Michael A. Osborne,et al.  Probabilistic Integration: A Role in Statistical Computation? , 2015, Statistical Science.

[22]  Miroslav Simandl,et al.  Bayesian quadrature in nonlinear filtering , 2015, 2015 12th International Conference on Informatics in Control, Automation and Robotics (ICINCO).

[23]  Simo Särkkä,et al.  Recursive Bayesian inference on stochastic differential equations , 2006 .

[24]  Simo Srkk,et al.  Bayesian Filtering and Smoothing , 2013 .