Nonparametric Adjoint-Based Inference for Stochastic Differential Equations

We develop a nonparametric method to infer the drift and diffusion functions of a stochastic differential equation. With this method, we can build predictive models starting with repeated time series and/or high-dimensional longitudinal data. Typical use of the method includes forecasting the future density or distribution of the variable being measured. The key innovation in our method stems from efficient algorithms to evaluate a likelihood function and its gradient. These algorithms do not rely on sampling, instead, they use repeated quadrature and the adjoint method to enable the inference to scale well as the dimensionality of the parameter vector grows. In simulated data tests, when the number of sample paths is large, the method does an excellent job of inferring drift functions close to the ground truth. We show that even when the method does not infer the drift function correctly, it still yields models with good predictive power. Finally, we apply the method to real data on hourly measurements of ground level ozone, showing that it is capable of reasonable results.

[1]  Harish S. Bhat,et al.  Scalable SDE Filtering and Inference with Apache Spark , 2016, BigMine.

[2]  Johann S. Hawe,et al.  Crowdsourced analysis of clinical trial data to predict amyotrophic lateral sclerosis progression , 2014, Nature Biotechnology.

[3]  Dan Cornford,et al.  Gaussian Process Approximations of Stochastic Differential Equations , 2007, Gaussian Processes in Practice.

[4]  Stefano M. Iacus,et al.  Simulation and Inference for Stochastic Differential Equations: With R Examples , 2008 .

[5]  Hans-Georg Muller,et al.  Empirical dynamics for longitudinal data , 2010, 1211.2630.

[6]  Michael I. Jordan,et al.  Learning with Mixtures of Trees , 2001, J. Mach. Learn. Res..

[7]  Christiane Fuchs,et al.  Inference for Diffusion Processes: With Applications in Life Sciences , 2013 .

[8]  Shuo Chen,et al.  Some recent statistical learning methods for longitudinal high‐dimensional data , 2014, Wiley interdisciplinary reviews. Computational statistics.

[9]  Konrad P Kording,et al.  How advances in neural recording affect data analysis , 2011, Nature Neuroscience.

[10]  Jeffrey S. Simonoff,et al.  RE-EM trees: a data mining approach for longitudinal and clustered data , 2011, Machine Learning.

[11]  N. Wiener The Fourier Integral: and certain of its Applications , 1933, Nature.

[12]  Wing-Keung Wong,et al.  Repeated time series analysis of ARIMA-noise models , 1990 .

[13]  Andreas Ruttor,et al.  Approximate Gaussian process inference for the drift function in stochastic differential equations , 2013, NIPS.

[14]  Jane-ling Wang Nonparametric Regression Analysis of Longitudinal Data , 2005 .

[15]  Zhiliang Ying,et al.  Semiparametric and Nonparametric Regression Analysis of Longitudinal Data , 2001 .

[16]  Christiane Fuchs,et al.  Inference for Diffusion Processes , 2013 .

[17]  Helle S rensen Parametric Inference for Diffusion Processes Observed at Discrete Points in Time: A Survey , 2016 .

[18]  H. Bhat,et al.  Density Tracking by Quadrature for Stochastic Differential Equations , 2016, 1610.09572.

[19]  Harish S. Bhat,et al.  Parameter Inference for Stochastic Differential Equations with Density Tracking by Quadrature , 2015 .

[20]  Harish S. Bhat,et al.  Bayesian Inference of Stochastic Pursuit Models from Basketball Tracking Data , 2016 .

[21]  Marcel Urner Simulation And Inference For Stochastic Differential Equations With R Examples , 2016 .