Inference for covariates that accounts for ascertainment and random genetic effects in family studies

SUMMARY Family studies to identify disease-related genes often collect families with multiple cases. If environmental exposures or other measured covariates are also important, they should be incorporated into these genetic analyses to control for confounding and increase statistical power. We propose a two-level mixed effects model that allows us to estimate environmental effects while accounting for varying genetic correlations among family members and adjusting for ascertainment by conditioning on the number of cases in the family. We describe a conditional maximum likelihood analysis based on this model. When genetic effects are negligible, this conditional likelihood reduces to standard conditional logistic regression. We show that the simpler conditional logistic regression typically yields biased estimators of exposure effects, and we describe conditions under which the conditional logistic approach has little or no bias.

[1]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[2]  Chien-Jen Chen,et al.  Cigarette smoking, alcohol consumption and risk of nasopharyngeal carcinoma in Taiwan , 2004, Cancer Causes & Control.

[3]  D. R. Cox,et al.  The analysis of binary data , 1971 .

[4]  J. Neuhaus Estimation efficiency and tests of covariate effects with clustered binary data. , 1993, Biometrics.

[5]  C. Amos Robust variance-components approach for assessing genetic linkage in pedigrees. , 1994, American journal of human genetics.

[6]  R. Fisher XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. , 1919, Transactions of the Royal Society of Edinburgh.

[7]  H. White Maximum Likelihood Estimation of Misspecified Models , 1982 .

[8]  J. Witte,et al.  Asymptotic bias and efficiency in case-control studies of candidate genes and gene-environment interactions: basic family designs. , 1999, American journal of epidemiology.

[9]  M. Gail,et al.  Biased estimates of treatment effect in randomized experiments with nonlinear regressions and omitted covariates , 1984 .

[10]  P. Diggle,et al.  Analysis of Longitudinal Data. , 1997 .

[11]  P. Albert,et al.  Models for longitudinal data: a generalized estimating equation approach. , 1988, Biometrics.

[12]  J. Houwing-Duistermaat,et al.  Incorporation of family history in logistic regression models. , 1998, Statistics in Medicine.

[13]  P. Levine,et al.  Etiology of nasopharyngeal carcinoma: a review. , 1993, Epidemiologic reviews.

[14]  R. Elston,et al.  Ascertainment adjustment: where does it take us? , 2000, American journal of human genetics.

[15]  J. Kalbfleisch,et al.  A Comparison of Cluster-Specific and Population-Averaged Approaches for Analyzing Correlated Binary Data , 1991 .