Regression models for twin studies: a critical review.

Twin studies have long been recognized for their value in learning about the aetiology of disease and specifically for their potential for separating genetic effects from environmental effects. The recent upsurge of interest in life-course epidemiology and the study of developmental influences on later health has provided a new impetus to study twins as a source of unique insights. Twins are of special interest because they provide naturally matched pairs where the confounding effects of a large number of potentially causal factors (such as maternal nutrition or gestation length) may be removed by comparisons between twins who share them. The traditional tool of epidemiological 'risk factor analysis' is the regression model, but it is not straightforward to transfer standard regression methods to twin data, because the analysis needs to reflect the paired structure of the data, which induces correlation between twins. This paper reviews the use of more specialized regression methods for twin data, based on generalized least squares or linear mixed models, and explains the relationship between these methods and the commonly used approach of analysing within-twin-pair difference values. Methods and issues of interpretation are illustrated using an example from a recent study of the association between birth weight and cord blood erythropoietin. We focus on the analysis of continuous outcome measures but review additional complexities that arise with binary outcomes. We recommend the use of a general model that includes separate regression coefficients for within-twin-pair and between-pair effects, and provide guidelines for the interpretation of estimates obtained under this model.

[1]  Bernard Rosner,et al.  Birth weight and risk of cardiovascular disease in a cohort of women followed up since 1976 , 1997, BMJ.

[2]  J B Carlin,et al.  Analysis of binary outcomes in longitudinal studies using weighted estimating equations and discrete-time survival methods: prevalence and incidence of smoking in an adolescent cohort. , 1999, Statistics in medicine.

[3]  P. Lichtenstein,et al.  Low birthweight and Type 2 diabetes: a study on 11 162 Swedish twins. , 2004, International journal of epidemiology.

[4]  J. Carlin,et al.  Using Bivariate Models to Understand between‐ and within‐Cluster Regression Coefficients, with Application to Twin Data , 2006, Biometrics.

[5]  T. Cole,et al.  Fetal origins of adult disease—the hypothesis revisited , 1999, BMJ.

[6]  Sander Greenland,et al.  An overview of relations among causal modelling methods. , 2002, International journal of epidemiology.

[7]  A Gelman,et al.  A case study on the choice, interpretation and checking of multilevel models for longitudinal binary outcomes. , 2001, Biostatistics.

[8]  K. Christensen,et al.  Do genetic factors contribute to the association between birth weight and blood pressure? , 2001, Journal of epidemiology and community health.

[9]  B. D. De Stavola,et al.  Separating within and between effects in family studies: an application to the study of blood pressure in children , 2004, Statistics in medicine.

[10]  J. Kalbfleisch,et al.  Between- and within-cluster covariate effects in the analysis of clustered data. , 1998, Biometrics.

[11]  F. Rasmussen,et al.  Fetal growth and systolic blood pressure in young adulthood: the Swedish Young Male Twins Study. , 2002, Paediatric and perinatal epidemiology.

[12]  G. Mcneill,et al.  The role of genetic and environmental factors in the association between birthweight and blood pressure: evidence from meta-analysis of twin studies. , 2004, International journal of epidemiology.

[13]  M. Lesperance,et al.  Estimation efficiency in a binary mixed-effects model setting , 1996 .

[14]  J. Hanley,et al.  Statistical analysis of correlated data using generalized estimating equations: an orientation. , 2003, American journal of epidemiology.

[15]  J M Neuhaus,et al.  Statistical methods for longitudinal and clustered designs with binary responses , 1992, Statistical methods in medical research.

[16]  A. Scott,et al.  The Effect of Two-Stage Sampling on Ordinary Least Squares Methods , 1982 .

[17]  F B Hu,et al.  Comparison of population-averaged and subject-specific approaches for analyzing repeated binary outcomes. , 1998, American journal of epidemiology.

[18]  Tim J Cole Modeling postnatal exposures and their interactions with birth size. , 2004, The Journal of nutrition.

[19]  J. Hopper,et al.  Association of birth weight and current body size to blood pressure in female twins. , 2001, Twin research : the official journal of the International Society for Twin Studies.

[20]  P. Albert,et al.  Models for longitudinal data: a generalized estimating equation approach. , 1988, Biometrics.

[21]  Geraldine McNeill,et al.  Blood pressure in relation to birth weight in twins and singleton controls matched for gestational age. , 2003, American journal of epidemiology.

[22]  Michael K Parides,et al.  Separation of individual‐level and cluster‐level covariate effects in regression analysis of correlated data , 2003, Statistics in medicine.

[23]  T. Dwyer,et al.  Within pair association between birth weight and blood pressure at age 8 in twins from a cohort study , 1999, BMJ.

[24]  R. Berk Regression Analysis: A Constructive Critique , 2003 .

[25]  D. Leon,et al.  The foetal origins of adult disease: interpreting the evidence from twin studies. , 2001, Twin research : the official journal of the International Society for Twin Studies.

[26]  T. Dwyer,et al.  Twins and fetal origins hypothesis: within-pair analyses , 2002, The Lancet.

[27]  E. Seeman,et al.  The bone density of female twins discordant for tobacco use. , 1994, The New England journal of medicine.

[28]  A. Thapar,et al.  Methodology for Genetic Studies of Twins and Families , 1993 .

[29]  M. Klebanoff,et al.  Differences in birth weight and blood pressure at age 7 years among twins. , 2001, American journal of epidemiology.

[30]  P. McKeigue,et al.  Reduced fetal growth rate and increased risk of death from ischaemic heart disease: cohort study of 15 000 Swedish men and women born 1915-29 , 1998, BMJ.

[31]  J. Carlin,et al.  Association between Erythropoietin in Cord Blood of Twins and Size at Birth: Does It Relate to Gestational Factors or to Factors during Labor or Delivery? , 2005, Pediatric Research.