Efficient Learning of Continuous-Time Hidden Markov Models for Disease Progression

The Continuous-Time Hidden Markov Model (CT-HMM) is an attractive approach to modeling disease progression due to its ability to describe noisy observations arriving irregularly in time. However, the lack of an efficient parameter learning algorithm for CT-HMM restricts its use to very small models or requires unrealistic constraints on the state transitions. In this paper, we present the first complete characterization of efficient EM-based learning methods for CT-HMM models. We demonstrate that the learning problem consists of two challenges: the estimation of posterior state probabilities and the computation of end-state conditioned statistics. We solve the first challenge by reformulating the estimation problem in terms of an equivalent discrete time-inhomogeneous hidden Markov model. The second challenge is addressed by adapting three approaches from the continuous time Markov chain literature to the CT-HMM domain. We demonstrate the use of CT-HMMs with more than 100 states to visualize and predict disease progression using a glaucoma dataset and an Alzheimer's disease dataset.

[1]  Daphne Koller,et al.  Expectation Maximization and Complex Duration Distributions for Continuous Time Bayesian Networks , 2005, UAI.

[2]  James M. Rehg,et al.  Longitudinal Modeling of Glaucoma Progression Using 2-Dimensional Continuous-Time Hidden Markov Model , 2013, MICCAI.

[3]  A. Hobolth,et al.  Statistical Applications in Genetics and Molecular Biology Statistical Inference in Evolutionary Models of DNA Sequences via the EM Algorithm , 2011 .

[4]  S. Kingman Glaucoma is second leading cause of blindness globally. , 2004, Bulletin of the World Health Organization.

[5]  M. Bladt,et al.  Statistical inference for discretely observed Markov jump processes , 2005 .

[6]  Xiang Wang,et al.  Unsupervised learning of disease progression models , 2014, KDD.

[7]  A. Fagan,et al.  Decreased cerebrospinal fluid Aβ42 correlates with brain atrophy in cognitively normal elderly , 2009, Annals of neurology.

[8]  N. Bartolomeo,et al.  Progression of liver cirrhosis to HCC: an application of hidden Markov model , 2011, BMC medical research methodology.

[9]  Asger Hobolth,et al.  Comparison of methods for calculating conditional expectations of sufficient statistics for continuous time Markov chains , 2011, BMC Bioinformatics.

[10]  N. Higham Functions of Matrices: Theory and Computation (Other Titles in Applied Mathematics) , 2008 .

[11]  Philipp Metzner,et al.  Generator estimation of Markov jump processes based on incomplete observations nonequidistant in time. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Nicholas J. Higham,et al.  Functions of matrices - theory and computation , 2008 .

[13]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[14]  G. Wollstein,et al.  Optical coherence tomography longitudinal evaluation of retinal nerve fiber layer thickness in glaucoma. , 2005, Archives of ophthalmology.

[15]  J. Leiva-Murillo,et al.  Visualization and Prediction of Disease Interactions with Continuous-Time Hidden Markov Models , 2011 .

[16]  C. Loan Computing integrals involving the matrix exponential , 1978 .

[17]  Robert N Weinreb,et al.  Combining structural and functional measurements to improve estimates of rates of glaucomatous progression. , 2012, American journal of ophthalmology.

[18]  David R. Cox,et al.  The Theory of Stochastic Processes , 1967, The Mathematical Gazette.

[19]  Lindsey S. Folio,et al.  Retinal nerve fibre layer and visual function loss in glaucoma: the tipping point , 2011, British Journal of Ophthalmology.

[20]  Christof Schütte,et al.  Generator estimation of Markov jump processes , 2007, J. Comput. Phys..

[21]  Christopher H. Jackson,et al.  Multi-State Models for Panel Data: The msm Package for R , 2011 .

[22]  A. Hobolth,et al.  Summary Statistics for Endpoint-Conditioned Continuous-Time Markov Chains , 2011, Journal of Applied Probability.

[23]  Jens C. Pruessner,et al.  Operationalizing protocol differences for EADC-ADNI manual hippocampal segmentation , 2015, Alzheimer's & Dementia.