Efficient Inference of Gaussian-Process-Modulated Renewal Processes with Application to Medical Event Data

The episodic, irregular and asynchronous nature of medical data render them difficult substrates for standard machine learning algorithms. We would like to abstract away this difficulty for the class of time-stamped categorical variables (or events) by modeling them as a renewal process and inferring a probability density over non-parametric longitudinal intensity functions that modulate the process. Several methods exist for inferring such a density over intensity functions, but either their constraints prevent their use with our potentially bursty event streams, or their time complexity renders their use intractable on our long-duration observations of high-resolution events, or both. In this paper we present a new efficient and flexible inference method that uses direct numeric integration and smooth interpolation over Gaussian processes. We demonstrate that our direct method is up to twice as accurate and two orders of magnitude more efficient than the best existing method (thinning). Importantly, our direct method can infer intensity functions over the full range of bursty to memoryless to regular events, which thinning and many other methods cannot do. Finally, we apply the method to clinical event data and demonstrate a simple example application facilitated by the abstraction.

[1]  T. Lasko,et al.  Computational Phenotype Discovery Using Unsupervised Feature Learning over Noisy, Sparse, and Irregular Clinical Data , 2013, PloS one.

[2]  J. Møller,et al.  Log Gaussian Cox Processes , 1998 .

[3]  Carl E. Rasmussen,et al.  Bayesian Monte Carlo , 2002, NIPS.

[4]  Ryan P. Adams,et al.  Slice sampling covariance hyperparameters of latent Gaussian models , 2010, NIPS.

[5]  P. Lewis Recent results in the statistical analysis of univariate point processes , 1971 .

[6]  David Page,et al.  Forest-Based Point Process for Event Prediction from Electronic Health Records , 2013, ECML/PKDD.

[7]  John P. Cunningham,et al.  Inferring Neural Firing Rates from Spike Trains Using Gaussian Processes , 2007, NIPS.

[8]  Mark Berman,et al.  Inhomogeneous and modulated gamma processes , 1981 .

[9]  Yee Whye Teh,et al.  Gaussian process modulated renewal processes , 2011, NIPS.

[10]  Puyang Xu,et al.  A Model for Temporal Dependencies in Event Streams , 2011, NIPS.

[11]  V. Rokhlin,et al.  A fast algorithm for the inversion of general Toeplitz matrices , 2004 .

[12]  Ryan P. Adams,et al.  Tractable nonparametric Bayesian inference in Poisson processes with Gaussian process intensities , 2009, ICML '09.

[13]  P. Bosdogianni,et al.  Singular-Value Decomposition (SVD) , 2011, Encyclopedia of Parallel Computing.

[14]  H. Ramlau-Hansen Smoothing Counting Process Intensities by Means of Kernel Functions , 1983 .

[15]  Ankur Parikh,et al.  Conjoint Modeling of Temporal Dependencies in Event Streams , 2012, BMA.