Evaluating latent class analysis models in qualitative phenotype identification

The paper is aimed to investigate the performance of information criteria in selecting latent class analysis models which are often used in research of phenotype identification. Six information criteria and a sample size adjustment (Psychometrika 52 (1987) 333) are compared under various sample sizes and model dimensionalities. The simulation design is particularly meaningful for phenotypic research in practice. Results show that improvements by the sample size adjustment are considerable. In addition, the sample size and model dimensionality effects are found to be influential in the simulation study.

[1]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[2]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[3]  C. Mitchell Dayton,et al.  Model Selection Information Criteria for Non-Nested Latent Class Models , 1997 .

[4]  D. Haughton On the Choice of a Model to Fit Data from an Exponential Family , 1988 .

[5]  M. Woodroofe On Model Selection and the ARC Sine Laws , 1982 .

[6]  S. Sclove Application of model-selection criteria to some problems in multivariate analysis , 1987 .

[7]  B. Everitt A Monte Carlo Investigation of the Likelihood Ratio Test for Number of Classes in Latent Class Analysis. , 1988, Multivariate behavioral research.

[8]  B. G. Quinn,et al.  The determination of the order of an autoregression , 1979 .

[9]  Clifford M. Hurvich,et al.  Regression and time series model selection in small samples , 1989 .

[10]  Murray Aitkin,et al.  Statistical Modelling of Data on Teaching Styles , 1981 .

[11]  C. Clogg Latent Class Models , 1995 .

[12]  Clifford C. Clogg,et al.  Handbook of statistical modeling for the social and behavioral sciences , 1995 .

[13]  D. Rubin,et al.  Estimation and Hypothesis Testing in Finite Mixture Models , 1985 .

[14]  ScienceDirect Computational statistics & data analysis , 1983 .

[15]  P Bolton,et al.  Latent-class analysis of recurrence risks for complex phenotypes with selection and measurement error: a twin and family history study of autism. , 1995, American journal of human genetics.

[16]  N. Martin,et al.  Zygosity Diagnosis in the Absence of Genotypic Data: An Approach Using Latent Class Analysis , 2003, Twin Research.

[17]  David Draper,et al.  Assessment and Propagation of Model Uncertainty , 2011 .

[18]  H. Akaike Factor analysis and AIC , 1987 .

[19]  L. Bierut,et al.  Linkage of an alcoholism-related severity phenotype to chromosome 16. , 1998, Alcoholism, clinical and experimental research.

[20]  L. A. Goodman Exploratory latent structure analysis using both identifiable and unidentifiable models , 1974 .

[21]  P. Sullivan,et al.  The subtypes of major depression in a twin registry. , 2002, Journal of affective disorders.

[22]  H. Bozdogan Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions , 1987 .

[23]  B. Everitt A Monte Carlo Investigation Of The Likelihood Ratio Test For The Number Of Components In A Mixture Of Normal Distributions. , 1981, Multivariate behavioral research.

[24]  P L Fidler,et al.  Goodness-of-Fit Testing for Latent Class Models. , 1993, Multivariate behavioral research.

[25]  G A Satten,et al.  Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model. , 2001, American journal of human genetics.

[26]  M. Neale A finite mixture distribution model for data collected from twins. , 2003, Twin research : the official journal of the International Society for Twin Studies.