Latent Ignorability and Item Selection for Nursing Home Case-Mix Evaluation

In the social, behavioral, and health sciences it is often of interest to identify latent or unobserved groups in the population with the group membership of the individuals depending on a set of observed variables. In particular, we focus on the field of nursing home assessment in which the response variables typically come from the administration of questionnaires made of categorical items. These types of data may suffer from missing values and the use of lengthy questionnaires may be problematic as a large number of items could have a negative impact on the responses. In such a context, we introduce an extended version of the Latent Class (LC) model aimed at dealing with missing values, by assuming a form of latent ignorability. Moreover, we propose an item selection algorithm, based on the LC model, for finding the smallest subset of items providing an amount of information close to that of the initial set. The proposed approach is illustrated through an application to a dataset collected within an Italian project on the quality-of-life of nursing home patients.

[1]  Francesco Bartolucci,et al.  Item selection by latent class-based methods: an application to nursing home evaluation , 2016, Adv. Data Anal. Classif..

[2]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[3]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[4]  Dimitris Karlis,et al.  Choosing Initial Values for the EM Algorithm for Finite Mixtures , 2003, Comput. Stat. Data Anal..

[5]  B. Muthén,et al.  Deciding on the Number of Classes in Latent Class Analysis and Growth Mixture Modeling: A Monte Carlo Simulation Study , 2007 .

[6]  Anton K. Formann Mixture analysis of multivariate categorical data with covariates and missing entries , 2007, Comput. Stat. Data Anal..

[7]  Roderick J. A. Little,et al.  Modeling the Drop-Out Mechanism in Repeated-Measures Studies , 1995 .

[8]  Linda M. Collins,et al.  Latent class and latent transition analysis , 2009 .

[9]  Paul F. Lazarsfeld,et al.  Latent Structure Analysis. , 1969 .

[10]  Ofer Harel,et al.  Partial and latent ignorability in missing-data problems , 2009 .

[11]  U. Senin,et al.  Health care for older people in Italy: The U.L.I.S.S.E. project (Un Link Informatico sui Servizi Sanitari Esistenti per l’anziano — a computerized network on health care services for older people) , 2009, The journal of nutrition, health & aging.

[12]  Gérard Govaert,et al.  An improvement of the NEC criterion for assessing the number of clusters in a mixture model , 1999, Pattern Recognit. Lett..

[13]  Francesco Bartolucci,et al.  Latent Markov model for longitudinal binary data: An application to the performance evaluation of nursing homes , 2009, 0908.2300.

[14]  J. Hagenaars,et al.  Applied Latent Class Analysis , 2003 .

[15]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[16]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[17]  Christophe Biernacki,et al.  Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models , 2003, Comput. Stat. Data Anal..

[18]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[19]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[20]  J. Copas,et al.  Missing at random, likelihood ignorability and model completeness , 2004, math/0406451.

[21]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[22]  G. Celeux,et al.  An entropy criterion for assessing the number of clusters in a mixture model , 1996 .

[23]  L. Wasserman,et al.  Computing Bayes Factors by Combining Simulation and Asymptotic Approximations , 1997 .

[24]  L. A. Goodman Exploratory latent structure analysis using both identifiable and unidentifiable models , 1974 .

[25]  Nema Dean,et al.  Latent class analysis variable selection , 2010, Annals of the Institute of Statistical Mathematics.

[26]  José G. Dias,et al.  Model Selection for the Binary Latent Class Model: A Monte Carlo Simulation , 2006, Data Science and Classification.

[27]  D. Rubin INFERENCE AND MISSING DATA , 1975 .