Three-step estimation of latent Markov models with covariates

A three-step approach is proposed to estimate latent Markov (LM) models for longitudinal data with and without covariates. The approach is based on a preliminary clustering of sample units on the basis of time-specific responses only, and is particularly useful when a large number of response variables are observed at each time occasion. In such a context, full maximum likelihood estimation, which is typically based on the Expectation-Maximization algorithm, may have some drawbacks, essentially due to the presence of many local maxima of the model likelihood. Moreover, this algorithm may be particularly slow to converge, and may become unstable with complex LM models. The properties of the proposed estimator are illustrated theoretically and by a simulation study in which this estimator is compared with the full likelihood estimator. How reliable standard errors for the three-step parameter estimates are obtained is also shown. The approach is applied to the analysis of a dataset about the health status of elderly people resident in certain Italian nursing homes.

[1]  Marcel Croon,et al.  Estimating Latent Structure Models with Categorical Variables: One-Step Versus Three-Step Estimators , 2004, Political Analysis.

[2]  W. Zucchini,et al.  Hidden Markov Models for Time Series: An Introduction Using R , 2009 .

[3]  Jeroen K. Vermunt,et al.  Latent Class Modeling with Covariates: Two Improved Three-Step Approaches , 2010, Political Analysis.

[4]  Vladimir N Minin,et al.  Fitting and interpreting continuous‐time latent Markov models for panel data , 2013, Statistics in medicine.

[5]  Francesco Bartolucci,et al.  Maximum Likelihood Estimation of an Extended Latent Markov Model for Clustered Binary Panel Data , 2007, Comput. Stat. Data Anal..

[6]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[7]  H. White A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity , 1980 .

[8]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[9]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[10]  C. Matias,et al.  Identifiability of parameters in latent structure models with many observed variables , 2008, 0809.5032.

[11]  Christopher H. Jackson,et al.  Multi-State Models for Panel Data: The msm Package for R , 2011 .

[12]  Rolf Turner,et al.  Direct maximization of the likelihood of a hidden Markov model , 2008, Comput. Stat. Data Anal..

[13]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[14]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[15]  G. Celeux,et al.  An entropy criterion for assessing the number of clusters in a mixture model , 1996 .

[16]  Ofer Harel,et al.  Partial and latent ignorability in missing-data problems , 2009 .

[17]  Padhraic Smyth,et al.  Model selection for probabilistic clustering using cross-validated likelihood , 2000, Stat. Comput..

[18]  J. Hagenaars,et al.  Applied Latent Class Analysis , 2003 .

[19]  S. T. Buckland,et al.  An Introduction to the Bootstrap. , 1994 .

[20]  Francesco Bartolucci,et al.  Latent Markov Models for Longitudinal Data , 2012 .

[21]  Irene R. R. Lu,et al.  Avoiding and Correcting Bias in Score-Based Latent Variable Regression With Discrete Manifest Items , 2008 .

[22]  U. Senin,et al.  Health care for older people in Italy: The U.L.I.S.S.E. project (Un Link Informatico sui Servizi Sanitari Esistenti per l’anziano — a computerized network on health care services for older people) , 2009, The journal of nutrition, health & aging.

[23]  L. A. Goodman Exploratory latent structure analysis using both identifiable and unidentifiable models , 1974 .

[24]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data , 1988 .

[25]  G. Molenberghs,et al.  A simple and fast alternative to the EM algorithm for incomplete categorical data and latent class models , 2001 .

[26]  Otis Dudley Duncan,et al.  Panel Analysis: Latent Probability Models for Attitude and Behavior Processes. , 1975 .

[27]  Anton K. Formann Mixture analysis of multivariate categorical data with covariates and missing entries , 2007, Comput. Stat. Data Anal..

[28]  Paul F. Lazarsfeld,et al.  Latent Structure Analysis. , 1969 .

[29]  Jean-Paul Chilès,et al.  Wiley Series in Probability and Statistics , 2012 .

[30]  Jeroen K. Vermunt,et al.  Estimating the Association between Latent Class Membership and External Variables Using Bias-adjusted Three-step Approaches , 2013 .

[31]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[32]  A. Farcomeni,et al.  A Multivariate Extension of the Dynamic Logit Model for Longitudinal Data Based on a Latent Markov Heterogeneity Structure , 2009 .