Robustness of inference on measured covariates to misspecification of genetic random effects in family studies

Family studies to identify disease‐related genes frequently collect only families with multiple cases. It is often desirable to determine if risk factors that are known to influence disease risk in the general population also play a role in the study families. If so, these factors should be incorporated into the genetic analysis to control for confounding. Pfeiffer et al. [2001 Biometrika 88: 933–948] proposed a variance components or random effects model to account for common familial effects and for different genetic correlations among family members. After adjusting for ascertainment, they found maximum likelihood estimates of the measured exposure effects. Although it is appealing that this model accounts for genetic correlations as well as for the ascertainment of families, in order to perform an analysis one needs to specify the distribution of random genetic effects. The current work investigates the robustness of the proposed model with respect to various misspecifications of genetic random effects in simulations. When the true underlying genetic mechanism is polygenic with a small dominant component, or Mendelian with low allele frequency and penetrance, the effects of misspecification on the estimation of fixed effects in the model are negligible. The model is applied to data from a family study on nasopharyngeal carcinoma in Taiwan. Genet Epidemiol 24:14–23, 2003. © 2003 Wiley‐Liss, Inc.

[1]  C. J. Chen,et al.  Occupational exposure to wood, formaldehyde, and solvents and risk of nasopharyngeal carcinoma. , 2001, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[2]  D. Thomas,et al.  Bias and efficiency in family-based gene-characterization studies: conditional, prospective, retrospective, and joint likelihoods. , 2000, American journal of human genetics.

[3]  Regina C. Elandt-Johnson,et al.  Probability models and statistical methods in genetics , 1972 .

[4]  G. Bonney,et al.  Regressive logistic models for familial disease and other binary traits. , 1986, Biometrics.

[5]  R. Fisher XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. , 1919, Transactions of the Royal Society of Edinburgh.

[6]  T A Louis,et al.  Random effects models with non-parametric priors. , 1992, Statistics in medicine.

[7]  G. Verbeke,et al.  The effect of misspecifying the random-effects distribution in linear mixed models for longitudinal data , 1997 .

[8]  B. Muthén,et al.  Finite Mixture Modeling with Mixture Outcomes Using the EM Algorithm , 1999, Biometrics.

[9]  Mitchell H. Gail,et al.  Inference for covariates that accounts for ascertainment and random genetic effects in family studies , 2001 .

[10]  P. Heagerty,et al.  Misspecified maximum likelihood estimates and generalised linear mixed models , 2001 .

[11]  S. Stacey,et al.  Antibodies to Epstein‐Barr virus thymidine kinase: A characteristic marker for the serological detection of nasopharyngeal carcinoma , 2001, International journal of cancer.

[12]  J. Kalbfleisch,et al.  Between- and within-cluster covariate effects in the analysis of clustered data. , 1998, Biometrics.

[13]  J. Kalbfleisch,et al.  The effects of mixture distribution misspecification when fitting mixed-effects logistic models , 1992 .

[14]  T J O'Neill,et al.  Truncated logistic regression. , 1995, Biometrics.

[15]  D. Bishop BRCA1 and BRCA2 and breast cancer incidence: a review. , 1999, Annals of oncology : official journal of the European Society for Medical Oncology.

[16]  P. Levine,et al.  Etiology of nasopharyngeal carcinoma: a review. , 1993, Epidemiologic reviews.

[17]  Marie Davidian,et al.  Consequences of misspecifying assumptions in nonlinear mixed effects models , 2000 .