Likelihood Ratio Tests in Behavioral Genetics: Problems and Solutions

The likelihood ratio test of nested models for family data plays an important role in the assessment of genetic and environmental influences on the variation in traits. The test is routinely based on the assumption that the test statistic follows a chi-square distribution under the null, with the number of restricted parameters as degrees of freedom. However, tests of variance components constrained to be non-negative correspond to tests of parameters on the boundary of the parameter space. In this situation the standard test procedure provides too large p-values and the use of the Akaike Information Criterion (AIC) or the Bayesian Information Criterion (BIC) for model selection is problematic. Focusing on the classical ACE twin model for univariate traits, we adapt existing theory to show that the asymptotic distribution for the likelihood ratio statistic is a mixture of chi-square distributions, and we derive the mixing probabilities. We conclude that when testing the AE or the CE model against the ACE model, the p-values obtained from using the χ2(1 df) as the reference distribution should be halved. When the E model is tested against the ACE model, a mixture of χ2(0 df), χ2(1 df) and χ2(2 df) should be used as the reference distribution, and we provide a simple formula to compute the mixing probabilities. Similar results for tests of the AE, DE and E models against the ADE model are also derived. Failing to use the appropriate reference distribution can lead to invalid conclusions.

[1]  E. Lehmann Testing Statistical Hypotheses , 1960 .

[2]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[3]  K. Liang,et al.  Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard Conditions , 1987 .

[4]  H. Akaike Factor analysis and AIC , 1987 .

[5]  Michael C. Neale,et al.  Methodology for Genetic Studies of Twins and Families , 1992 .

[6]  G. Mcclearn,et al.  A Quantitative Genetic Analysis of Cognitive Abilities during the Second Half of the Life Span , 1992 .

[7]  S. T. Buckland,et al.  An Introduction to the Bootstrap. , 1994 .

[8]  D. Stram,et al.  Variance components testing in the longitudinal mixed effects model. , 1994, Biometrics.

[9]  P. Sham Statistics in human genetics , 1997 .

[10]  E. Oord Estimating effects of latent and measured genotypes in multilevel models. , 2001 .

[11]  Mariza de Andrade,et al.  Comparison of Multivariate Tests for Genetic Linkage , 2001, Human Heredity.

[12]  Pak Chung Sham,et al.  Analytic approaches to twin data using structural equation models , 2002, Briefings Bioinform..

[13]  D. Ruppert,et al.  Likelihood ratio tests in linear mixed models with one variance component , 2003 .

[14]  Sophia Rabe-Hesketh,et al.  Generalized latent variable models: multilevel, longitudinal, and structural equation models , 2004 .

[15]  Nancy L. Pedersen,et al.  Processing Speed and Longitudinal Trajectories of Change for Cognitive Abilities: The Swedish Adoption/Twin Study of Aging , 2004 .

[16]  N. Pedersen,et al.  Quantitative genetic analysis of latent growth curve models of cognitive abilities in adulthood. , 2005, Developmental psychology.