Latent variable models for longitudinal twin data

The likelihood ratio test of nested models for family data plays an important role in the assessment of genetic and environmental influences on the variation in traits. The test is routinely based on the assumption that the test statistic follows a chi-square distribution under the null, with the number of restricted parameters as degrees of freedom. However, tests of variance components constrained to be non-negative correspond to tests of parameters on the boundary of the parameter space. In this situation the standard test procedure provides too large p-values and the use of the Akaike Information Criterion (AIC) or the Bayesian Information Criterion (BIC) for model selection is problematic. Focusing on the classical ACE twin model for univariate traits, we adapt existing theory to show that the asymptotic distribution for the likelihood ratio statistic is a mixture of chi-square distributions, and we derive the mixing probabilities. We conclude that when testing the AE or the CE model against the ACE model, the p-values obtained from using the v2 (1 df) as the reference distribution should be halved. When the E model is tested against the ACE model, a mixture of v2(0 df), v2(1 df) and v2 (2 df) should be used as the reference distribution, and we provide a simple formula to compute the mixing probabilities. Similar results for tests of the AE, DE and E models against the ADE model are also derived. Failing to use the appropriate reference distribution can lead to invalid conclusions.

[1]  Pranab Kumar Sen,et al.  An appraisal of some aspects of statistical inference under inequality constraints , 2002 .

[2]  N M Laird,et al.  Mixture models for the joint distribution of repeated measures and event times. , 1997, Statistics in medicine.

[3]  Michael C. Neale,et al.  Methodology for Genetic Studies of Twins and Families , 1992 .

[4]  P. Lichtenstein,et al.  The Swedish Twin Registry: a unique resource for clinical, epidemiological and genetic studies , 2002, Journal of internal medicine.

[5]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data , 1988 .

[6]  Nancy L. Pedersen,et al.  Processing Speed and Longitudinal Trajectories of Change for Cognitive Abilities: The Swedish Adoption/Twin Study of Aging , 2004 .

[7]  S. R. Wilson,et al.  Bias in the estimation of heritability from truncated samples of twins , 1982, Behavior genetics.

[8]  H. Akaike Factor analysis and AIC , 1987 .

[9]  D. Rubin,et al.  Estimation in Covariance Components Models , 1981 .

[10]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[11]  L. Zhao,et al.  Estimating equations for parameters in means and covariances of multivariate discrete and continuous responses. , 1991, Biometrics.

[12]  N M Laird,et al.  Missing data in longitudinal studies. , 1988, Statistics in medicine.

[13]  R. Little Pattern-Mixture Models for Multivariate Incomplete Data , 1993 .

[14]  B. Muthén BEYOND SEM: GENERAL LATENT VARIABLE MODELING , 2002 .

[15]  Carole Dufouil,et al.  Analysis of longitudinal studies with death and drop‐out: a case study , 2004, Statistics in medicine.

[16]  K. Kendler,et al.  Bias in correlations from selected samples of relatives: The effects of soft selection , 1989, Behavior genetics.

[17]  Timothy J. Robinson,et al.  Multilevel Analysis: Techniques and Applications , 2002 .

[18]  D. Falconer,et al.  Introduction to Quantitative Genetics. , 1961 .

[19]  M. Peruggia Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach (2nd ed.) , 2003 .

[20]  H. Chernoff On the Distribution of the Likelihood Ratio , 1954 .

[21]  Joseph L Schafer,et al.  Analysis of Incomplete Multivariate Data , 1997 .

[22]  Jean-François Dartigues,et al.  Random Changepoint Model for Joint Modeling of Cognitive Decline and Dementia , 2006, Biometrics.

[23]  D. Stram,et al.  Variance components testing in the longitudinal mixed effects model. , 1994, Biometrics.

[24]  David W. Bacon,et al.  Estimating the transition between two intersecting straight lines , 1971 .

[25]  John J. McArdle,et al.  Mixed-Effects Variance Components Models for Biometric Family Analyses , 2005, Behavior genetics.

[26]  Geert Molenberghs,et al.  Sensitivity Analysis of Continuous Incomplete Longitudinal Outcomes , 2003 .

[27]  L B Sheiner,et al.  Estimating population kinetics. , 1982, Critical reviews in biomedical engineering.

[28]  M. Kenward,et al.  Informative Drop‐Out in Longitudinal Data Analysis , 1994 .

[29]  G. Molenberghs,et al.  Linear Mixed Models for Longitudinal Data , 2001 .

[30]  D. Harville Maximum Likelihood Approaches to Variance Component Estimation and to Related Problems , 1977 .

[31]  A. Gelfand,et al.  Hierarchical Bayes Models for the Progression of HIV Infection Using Longitudinal CD4 T-Cell Numbers , 1992 .

[32]  K. Lange Central limit theorems of pedigrees , 1978 .

[33]  P. Sen,et al.  Constrained Statistical Inference: Inequality, Order, and Shape Restrictions , 2001 .

[34]  K. Liang,et al.  Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard Conditions , 1987 .

[35]  J. Hodges,et al.  Counting degrees of freedom in hierarchical and other richly-parameterised models , 2001 .

[36]  D. Rubin,et al.  Multiple Imputation for Nonresponse in Surveys , 1989 .

[37]  F. Vaida,et al.  Conditional Akaike information for mixed-effects models , 2005 .

[38]  John Geweke,et al.  Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments , 1991 .

[39]  L. Skovgaard NONLINEAR MODELS FOR REPEATED MEASUREMENT DATA. , 1996 .

[40]  A. Gelman Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper) , 2004 .

[41]  Y. Pawitan In all likelihood : statistical modelling and inference using likelihood , 2002 .

[42]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[43]  J. Palmgren,et al.  The Influence of Mortality on Twin Models of Change: Addressing Missingness Through Multiple Imputation , 2003, Behavior genetics.

[44]  N. Pedersen,et al.  Sources of Influence on Rate of Cognitive Change Over Time in Swedish Twins: An Application of Latent Growth Models , 2002, Experimental aging research.

[45]  Alan Taylor The consequences of selective participation on behavioral-genetic findings: evidence from simulated and real data. , 2004, Twin research : the official journal of the International Society for Twin Studies.

[46]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[47]  Gareth O. Roberts,et al.  Convergence assessment techniques for Markov chain Monte Carlo , 1998, Stat. Comput..

[48]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[49]  J. Mcardle,et al.  Latent variable growth within behavior genetic models , 1986, Behavior genetics.

[50]  Patrick J Heagerty,et al.  Directly parameterized regression conditioning on being alive: analysis of longitudinal data truncated by deaths. , 2005, Biostatistics.

[51]  James L. Arbuckle,et al.  Full Information Estimation in the Presence of Incomplete Data , 1996 .

[52]  H. D. Patterson,et al.  Recovery of inter-block information when block sizes are unequal , 1971 .

[53]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[54]  Sophia Rabe-Hesketh,et al.  Generalized latent variable models: multilevel, longitudinal, and structural equation models , 2004 .

[55]  P. Diggle,et al.  Analysis of Longitudinal Data. , 1997 .

[56]  R. P. McDonald,et al.  Structural Equations with Latent Variables , 1989 .

[57]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58]  P. McCullagh,et al.  Generalized Linear Models, 2nd Edn. , 1990 .

[59]  K. Lange,et al.  Extensions to pedigree analysis III. Variance components by the scoring method , 1976, Annals of human genetics.

[60]  G. Mcclearn,et al.  A Quantitative Genetic Analysis of Cognitive Abilities during the Second Half of the Life Span , 1992 .

[61]  P. Sham Statistics in human genetics , 1997 .

[62]  Adrian F. M. Smith,et al.  Sampling-Based Approaches to Calculating Marginal Densities , 1990 .