LMest: an R package for latent Markov models for categorical longitudinal data

Latent Markov (LM) models represent an important class of models for the analysis of longitudinal data (Bartolucci et al., 2013), especially when response variables are categorical. These models have a great potential of application for the analysis of social, medical, and behavioral data as well as in other disciplines. We propose the R package LMest, which is tailored to deal with these types of model. In particular, we consider a general framework for extended LM models by including individual covariates and by formulating a mixed approach to take into account additional dependence structures in the data. Such extensions lead to a very flexible class of models, which allows us to fit different types of longitudinal data. Model parameters are estimated through the expectation-maximization algorithm, based on the forward-backward recursions, which is implemented in the main functions of the package. The package also allows us to perform local and global decoding and to obtain standard errors for the parameter estimates. We illustrate its use and the most important features on the basis of examples involving applications in health and criminology.

[1]  Francesco Bartolucci,et al.  Assessment of School Performance Through a Multilevel Latent Markov Rasch Model , 2009, 0909.4961.

[2]  J. Vermunt,et al.  Discrete-Time Discrete-State Latent Markov Models with Time-Constant and Time-Varying Covariates , 1999 .

[3]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[4]  A. Farcomeni,et al.  A discrete time event-history approach to informative drop-out in multivariate latent Markov models with covariates , 2013, 1306.1678.

[5]  Silvia Pandolfi,et al.  A comparison of some criteria for states selection in the latent Markov model for longitudinal data , 2012, Adv. Data Anal. Classif..

[6]  D. Oakes Direct calculation of the information matrix via the EM , 1999 .

[7]  A. Maruotti Mixed Hidden Markov Models for Longitudinal Data: An Overview , 2011 .

[8]  Francesco Bartolucci,et al.  Information matrix for hidden Markov models with covariates , 2015, Stat. Comput..

[9]  Francesco Bartolucci,et al.  Longitudinal analysis of self‐reported health status by mixture latent auto‐regressive models , 2014 .

[10]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[11]  Debashis Kushary,et al.  Bootstrap Methods and Their Application , 2000, Technometrics.

[12]  F. Pennoni,et al.  Two competing models for ordinal longitudinal data with time-varying latent effects: an application to evaluate hospital efficiency , 2014 .

[13]  Alessio Farcomeni,et al.  Quantile regression for longitudinal data based on latent Markov subject-specific parameters , 2010, Statistics and Computing.

[14]  W. Zucchini,et al.  Hidden Markov Models for Time Series: An Introduction Using R , 2009 .

[15]  H. Akaike INFORMATION THEORY AS AN EXTENSION OF THE MAXIMUM LIKELIHOOD , 1973 .

[16]  Lee Manning Wiggins,et al.  Panel analysis : Latent probability models for attitude and behavior processes , 1974 .

[17]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[18]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[19]  Francesco Bartolucci,et al.  Latent Markov models: a review of a general framework for the analysis of longitudinal data with covariates , 2014 .

[20]  F. V. D. Pol,et al.  MIXED MARKOV LATENT CLASS MODELS , 1990 .

[21]  Francesco Bartolucci,et al.  Latent Markov model for longitudinal binary data: An application to the performance evaluation of nursing homes , 2009, 0908.2300.

[22]  Pennoni,et al.  Issues on the Estimation of Latent Variable and Latent Class Models with Social Science Applications , 2004 .

[23]  Francesco Bartolucci,et al.  Three-step estimation of latent Markov models with covariates , 2015, Comput. Stat. Data Anal..

[24]  Christopher H. Jackson,et al.  Multi-State Models for Panel Data: The msm Package for R , 2011 .

[25]  Francesco Bartolucci,et al.  A latent Markov model for detecting patterns of criminal activity , 2007 .

[26]  B. Francis,et al.  Criminal Lifestyle Specialization: Female Offending in England and Wales , 2010 .

[27]  Walter Zucchini,et al.  Series of Seminars: Hidden Markov Models for Time Series , 2013 .

[28]  M. Speekenbrink,et al.  depmixS4: An R Package for Hidden Markov Models , 2010 .

[29]  Kezia Tumilaar,et al.  Hidden Markov Model , 2015 .

[30]  Søren Højsgaard,et al.  Hidden Semi Markov Models for Multiple Observation Sequences: The mhsmm Package for R , 2011 .

[31]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[32]  Francesco Bartolucci,et al.  Latent Markov Models for Longitudinal Data , 2012 .

[33]  Biing-Hwang Juang,et al.  Hidden Markov Models for Speech Recognition , 1991 .

[34]  A. Farcomeni,et al.  A Multivariate Extension of the Dynamic Logit Model for Longitudinal Data Based on a Latent Markov Heterogeneity Structure , 2009 .