Item selection by latent class-based methods: an application to nursing home evaluation

The evaluation of nursing homes is usually based on the administration of questionnaires made of a large number of polytomous items. In such a context, the Latent Class (LC) model represents a useful tool for clustering subjects in homogenous groups corresponding to different degrees of impairment of the health conditions. It is known that the performance of model-based clustering and the accuracy of the choice of the number of latent classes may be affected by the presence of irrelevant or noise variables. In this paper, we show the application of an item selection algorithm to real data collected within a project, named ULISSE, on the quality-of-life of elderly patients hosted in italian nursing homes. This algorithm, which is closely related to that proposed by Dean and Raftery in 2010, is aimed at finding the subset of items which provides the best clustering according to the Bayesian Information Criterion. At the same time, it allows us to select the optimal number of latent classes. Given the complexity of the ULISSE study, we perform a validation of the results by means of a sensitivity analysis to different specifications of the initial subset of items and of a resampling procedure.

[1]  J. Vermunt,et al.  Latent class cluster analysis , 2002 .

[2]  Jay Magidson,et al.  Latent Class Factor and Cluster Models, Bi-Plots, and Related Graphical Displays , 2001 .

[3]  B E Fries,et al.  Development of the nursing home Resident Assessment Instrument in the USA. , 1997, Age and ageing.

[4]  L. Ferrucci,et al.  Phenotype of frailty: characterization in the women's health and aging studies. , 2006, The journals of gerontology. Series A, Biological sciences and medical sciences.

[5]  Ofer Harel,et al.  Partial and latent ignorability in missing-data problems , 2009 .

[6]  U. Senin,et al.  Health care for older people in Italy: The U.L.I.S.S.E. project (Un Link Informatico sui Servizi Sanitari Esistenti per l’anziano — a computerized network on health care services for older people) , 2009, The journal of nutrition, health & aging.

[7]  Christophe Biernacki,et al.  Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models , 2003, Comput. Stat. Data Anal..

[8]  J. Copas,et al.  Missing at random, likelihood ignorability and model completeness , 2004, math/0406451.

[9]  L. A. Goodman Exploratory latent structure analysis using both identifiable and unidentifiable models , 1974 .

[10]  Nema Dean,et al.  Latent class analysis variable selection , 2010, Annals of the Institute of Statistical Mathematics.

[11]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[12]  F. Samejima Estimation of latent ability using a response pattern of graded scores , 1969 .

[13]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[14]  Francesco Bartolucci,et al.  A Class of Multidimensional Latent Class IRT Models for Ordinal Polytomous Item Responses , 2012, 1201.4667.

[15]  Maria Moran,et al.  Syndromes of behavioural and psychological symptoms in mild Alzheimer's disease , 2004, International journal of geriatric psychiatry.

[16]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[17]  F. Samejima Estimation of latent ability using a response pattern of graded scores , 1968 .

[18]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[19]  Dimitris Karlis,et al.  Choosing Initial Values for the EM Algorithm for Finite Mixtures , 2003, Comput. Stat. Data Anal..

[20]  François Béland,et al.  Health status transitions in community-living elderly with complex care needs: a latent class approach , 2009, BMC geriatrics.

[21]  Neil Henry Latent structure analysis , 1969 .

[22]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[23]  F. Billari,et al.  The Emergence of Lowest‐Low Fertility in Europe During the 1990s , 2002 .

[24]  Paul F. Lazarsfeld,et al.  Latent Structure Analysis. , 1969 .

[25]  Vincenzo Galasso,et al.  How Does Ageing Affect the Welfare State , 2007 .

[26]  Fumiko Samejima,et al.  EVALUATION OF MATHEMATICAL MODELS FOR ORDERED POLYCHOTOMOUS RESPONSES , 1996 .

[27]  Scott L. Zeger,et al.  Latent Variable Regression for Multiple Discrete Outcomes , 1997 .

[28]  Joan Costa-Font,et al.  Ageing, health, and health care , 2010 .

[29]  J. Hagenaars,et al.  Applied Latent Class Analysis , 2003 .