Causal Latent Markov Model for the Comparison of Multiple Treatments in Observational Longitudinal Studies

We extend to the longitudinal setting a latent class approach that was recently introduced by Lanza, Coffman, and Xu to estimate the causal effect of a treatment. The proposed approach enables an evaluation of multiple treatment effects on subpopulations of individuals from a dynamic perspective, as it relies on a latent Markov (LM) model that is estimated taking into account propensity score weights based on individual pretreatment covariates. These weights are involved in the expression of the likelihood function of the LM model and allow us to balance the groups receiving different treatments. This likelihood function is maximized through a modified version of the traditional expectation–maximization algorithm, while standard errors for the parameter estimates are obtained by a nonparametric bootstrap method. We study in detail the asymptotic properties of the causal effect estimator based on the maximization of this likelihood function, and we illustrate its finite sample properties through a series of simulations showing that the estimator has the expected behavior. As an illustration, we consider an application aimed at assessing the relative effectiveness of certain degree programs on the basis of three ordinal response variables in which the work path of a graduate is considered as the manifestation of his or her human capital-level across time.

[1]  Tamara B Harris,et al.  Partially Ordered Mixed Hidden Markov Model for the Disablement Process of Older Adults , 2013, Journal of the American Statistical Association.

[2]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[3]  Joshua D. Angrist,et al.  Grouped Data Estimation and Testing in Simple Labor Supply Models , 1991 .

[4]  G. Imbens The Role of the Propensity Score in Estimating Dose-Response Functions , 1999 .

[5]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2000 .

[6]  J. Robins,et al.  Marginal Structural Models to Estimate the Joint Causal Effect of Nonrandomized Treatments , 2001 .

[7]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[8]  L. A. Goodman Exploratory latent structure analysis using both identifiable and unidentifiable models , 1974 .

[9]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[10]  Neil Henry Latent structure analysis , 1969 .

[11]  F. Krauss Latent Structure Analysis , 1980 .

[12]  Francesco Bartolucci,et al.  Latent Markov models: a review of a general framework for the analysis of longitudinal data with covariates , 2014 .

[13]  P. McCullagh Regression Models for Ordinal Data , 1980 .

[14]  D. Rubin Causal Inference Using Potential Outcomes , 2005 .

[15]  Antonello Maruotti Latent Markov Models for longitudinal data , 2014 .

[16]  Odd O Aalen,et al.  Causality, mediation and time: a dynamic viewpoint , 2012, Journal of the Royal Statistical Society. Series A,.

[17]  Stephanie T. Lanza,et al.  Causal Inference in Latent Class Analysis , 2013, Structural equation modeling : a multidisciplinary journal.

[18]  Anthony C. Davison,et al.  Bootstrap Methods and Their Application , 1998 .

[19]  Anca Draghici,et al.  Debate on the Multilevel Model of the Human Capital Measurement , 2014 .

[20]  J. Vermunt,et al.  Discrete-Time Discrete-State Latent Markov Models with Time-Constant and Time-Varying Covariates , 1999 .

[21]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2002 .

[22]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[23]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[24]  E. Arjas Time to Consider Time, and Time to Predict? , 2014 .

[25]  Lane F Burgette,et al.  A tutorial on propensity score estimation for multiple treatments using generalized boosted models , 2013, Statistics in medicine.

[26]  Francesco Bartolucci,et al.  Assessment of School Performance Through a Multilevel Latent Markov Rasch Model , 2009, 0909.4961.

[27]  R. Hambleton,et al.  Item Response Theory , 1984, The History of Educational Measurement.

[28]  J. Robins,et al.  Marginal Structural Models and Causal Inference in Epidemiology , 2000, Epidemiology.

[29]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .

[30]  Peter Willett,et al.  What is a tutorial , 2013 .

[31]  P. Song,et al.  Composite Likelihood Bayesian Information Criteria for Model Selection in High-Dimensional Data , 2010 .

[32]  J. Heckman Policies to Foster Human Capital , 1999 .

[33]  W. Zucchini,et al.  Hidden Markov Models for Time Series: An Introduction Using R , 2009 .

[34]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[35]  A. Farcomeni,et al.  A Multivariate Extension of the Dynamic Logit Model for Longitudinal Data Based on a Latent Markov Heterogeneity Structure , 2009 .

[36]  Paul F. Lazarsfeld,et al.  Latent Structure Analysis. , 1969 .

[37]  Silvia Pandolfi,et al.  A comparison of some criteria for states selection in the latent Markov model for longitudinal data , 2012, Adv. Data Anal. Classif..

[38]  J. Heckman Policies to Foster Human Capital , 2000 .

[39]  R. Hambleton,et al.  Item Response Theory: Principles and Applications , 1984 .

[40]  Pennoni,et al.  Issues on the Estimation of Latent Variable and Latent Class Models with Social Science Applications , 2004 .

[41]  Shenyang Guo,et al.  Propensity Score Analysis: Statistical Methods and Applications , 2014 .

[42]  F. Bartolucci Likelihood inference for a class of latent Markov models under linear hypotheses on the transition probabilities , 2006 .

[43]  Francesco Bartolucci,et al.  Impact Evaluation of Job Training Programs by a Latent Variable Model , 2011 .

[44]  Taraneh Abarin,et al.  On Method of Moments Estimation in Linear Mixed Effects Models with Measurement Error on Covariates and Response with Application to a Longitudinal Study of Gene-Environment Interaction , 2012, Statistics in Biosciences.