Inference in Mixed Hidden Markov Models and Applications to Medical Studies

The aim of the present paper is to document the need for adapting the definition of hidden Markov models (HMM) to population studies, as well as for corresponding learning methodologies. In this article, mixed hidden Markov models (MHMM) are introduced through a brief state of the art on hidden Markov models and related applications, especially focusing on disease related problems. Making the main assumption that a given pathology can be considered at different stages, hidden Markov models have for example already been used to study epileptic activity or migraine. Mixed-effects hidden Markov models have been newly introduced in the statistical literature. The notion of mixed hidden Markov models is particularly relevant for modeling medical symptoms, but the data complexity generally requires specific care and the available methodology for MHMM is relatively poor. Our new approach can be briefly described as follows. First, we suggest to estimate the population parameters with the SAEM (Stochastic Approximation EM) algorithm, which has the property to converge quickly. The well-known forward recursions developed for HMM allow to compute easily the complete likelihood at each step of the MCMC procedure used within SAEM. Then, for dealing with the individuals, we suggest to estimate each set of individual parameters with the MAP (Maximum A Posteriori) of the parameter distributions. Finally, the hidden state sequences are decoded using the Viterbi algorithm. Some Monte-Carlo experiments are presented to illustrate the accuracy of our algorithms.

[1]  J. Detilleux,et al.  The analysis of disease biomarker data using a mixed hidden Markov model , 2008 .

[2]  A. Petkau,et al.  Application of hidden Markov models to multiple sclerosis lesion count data , 2005, Statistics in medicine.

[3]  B. Leroux Maximum-likelihood estimation for hidden Markov models , 1992 .

[4]  R. Douc,et al.  CONSISTENCY OF THE MAXIMUM LIKELIHOOD ESTIMATOR FOR GENERAL HIDDEN MARKOV MODELS , 2009, 0912.4480.

[5]  F. Carrat,et al.  Monitoring epidemiologic surveillance data using hidden Markov models. , 1999, Statistics in medicine.

[6]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[7]  Analysis of responses in migraine modelling using hidden Markov models , 2007, Statistics in medicine.

[8]  L. Baum,et al.  Statistical Inference for Probabilistic Functions of Finite State Markov Chains , 1966 .

[9]  L. Baum,et al.  An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology , 1967 .

[10]  J A Frank,et al.  Time series for modelling counts from a relapsing-remitting disease: application to modelling disease activity in multiple sclerosis. , 1994, Statistics in medicine.

[11]  P S Albert,et al.  A two-state Markov mixture model for a time series of epileptic seizure counts. , 1991, Biometrics.

[12]  M. Lipsitch,et al.  The analysis of hospital infection data using hidden Markov models. , 2004, Biostatistics.

[13]  R. Douc,et al.  Asymptotics of the maximum likelihood estimator for general hidden Markov models , 2001 .

[14]  E. Kuhn,et al.  Coupling a stochastic approximation version of EM with an MCMC procedure , 2004 .

[15]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.