Robust discovery of genetic associations incorporating gene-environment interaction and independence.

This article considers the detection and evaluation of genetic effects incorporating gene-environment interaction and independence. Whereas ordinary logistic regression cannot exploit the assumption of gene-environment independence, the proposed approach makes explicit use of the independence assumption to improve estimation efficiency. This method, which uses both cases and controls, fits a constrained retrospective regression in which the genetic variant plays the role of the response variable, and the disease indicator and the environmental exposure are the independent variables. The regression model constrains the association of the environmental exposure with the genetic variant among the controls to be null, thus explicitly encoding the gene-environment independence assumption, which yields substantial gain in accuracy in the evaluation of genetic effects. The proposed retrospective regression approach has several advantages. It is easy to implement with standard software, and it readily accounts for multiple environmental exposures of a polytomous or of a continuous nature, while easily incorporating extraneous covariates. Unlike the profile likelihood approach of Chatterjee and Carroll (Biometrika. 2005;92:399-418), the proposed method does not require a model for the association of a polytomous or continuous exposure with the disease outcome, and, therefore, it is agnostic to the functional form of such a model and completely robust to its possible misspecification.

[1]  Jack A. Taylor,et al.  Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case-control studies. , 1994, Statistics in medicine.

[2]  P S Albert,et al.  Limitations of the case-only design for identifying gene-environment interactions. , 2001, American journal of epidemiology.

[3]  R. Pyke,et al.  Logistic disease incidence models and case-control studies , 1979 .

[4]  S Wacholder,et al.  Parity, oral contraceptives, and the risk of ovarian cancer among carriers and noncarriers of a BRCA1 or BRCA2 mutation. , 2001, The New England journal of medicine.

[5]  C R Weinberg,et al.  Designing and analysing case-control studies to exploit independence of genotype and exposure. , 1997, Statistics in medicine.

[6]  Peter Kraft,et al.  Exploiting Gene-Environment Interaction to Detect Genetic Associations , 2007, Human Heredity.

[7]  M. Hernán,et al.  Case‐only gene‐environment interaction studies: when does association imply mechanistic interaction? , 2010, Genetic epidemiology.

[8]  N. Breslow,et al.  Statistics in Epidemiology : The Case-Control Study , 2008 .

[9]  A. Morris,et al.  American Journal of Epidemiology Practice of Epidemiology a Comparison of Sample Size and Power in Case-only Association Studies of Gene-environment Interaction , 2022 .

[10]  Peter Kraft,et al.  Using principal components of genetic variation for robust and powerful detection of gene-gene interactions in case-control and case-only studies. , 2010, American journal of human genetics.

[11]  Nilanjan Chatterjee,et al.  Semiparametric maximum likelihood estimation exploiting gene-environment independence in case-control studies , 2005 .

[12]  Eric J Tchetgen Tchetgen,et al.  On the robustness of tests of genetic associations incorporating gene-environment interaction when the environmental exposure is misspecified. , 2011, Epidemiology.