Semiparametric tests for sufficient cause interaction

Summary.  A sufficient cause interaction between two exposures signals the presence of individuals for whom the outcome would occur only under certain values of the two exposures. When the outcome is dichotomous and all exposures are categorical, then, under certain no confounding assumptions, empirical conditions for sufficient cause interactions can be constructed on the basis of the sign of linear contrasts of conditional outcome probabilities between differently exposed subgroups, given confounders. It is argued that logistic regression models are unsatisfactory for evaluating such contrasts, and that Bernoulli regression models with linear link are prone to misspecification. We therefore develop semiparametric tests for sufficient cause interactions under models which postulate probability contrasts in terms of a finite dimensional parameter, but which are otherwise unspecified. Estimation is often not feasible in these models because it would require non‐parametric estimation of auxiliary conditional expectations given high dimensional variables. We therefore develop ‘multiply robust tests’ under a union model which assumes that at least one of several working submodels holds. In the special case of a randomized experiment or a family‐based genetic study in which the joint exposure distribution is known by design or Mendelian inheritance, the procedure leads to asymptotically distribution‐free tests of the null hypothesis of no sufficient cause interaction.

[1]  J. Robins,et al.  Marginal Structural Models for Sufficient Cause Interactions , 2010 .

[2]  Alexander van Geen,et al.  Modification of Risk of Arsenic-Induced Skin Lesions by Sunlight Exposure, Smoking, and Occupational Exposures in Bangladesh , 2006, Epidemiology.

[3]  P. Bickel Efficient and Adaptive Estimation for Semiparametric Models , 1993 .

[4]  H. Cordell Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans. , 2002, Human molecular genetics.

[5]  D P Byar,et al.  Tests for homogeneity of effect in an epidemiologic investigation. , 1977, American journal of epidemiology.

[6]  James M Robins,et al.  The Identification of Synergism in the Sufficient-Component-Cause Framework , 2007, Epidemiology.

[7]  Tyler J VanderWeele,et al.  Statistical Applications in Genetics and Molecular Biology Epistatic Interactions , 2011 .

[8]  Mark J van der Laan,et al.  Empirical Efficiency Maximization: Improved Locally Efficient Covariate Adjustment in Randomized Experiments and Survival Analysis , 2008, The international journal of biostatistics.

[9]  F. Parvez,et al.  Health Effects of Arsenic Longitudinal Study (HEALS): Description of a multidisciplinary epidemiologic investigation , 2006, Journal of Exposure Science and Environmental Epidemiology.

[10]  W. Gauderman,et al.  Gene-environment interaction in genome-wide association studies. , 2008, American journal of epidemiology.

[11]  Arnab Maity,et al.  Testing in semiparametric models with interaction, with applications to gene-environment interactions. , 2009, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[12]  Tyler J VanderWeele,et al.  Sufficient Cause Interactions and Statistical Interactions , 2009, Epidemiology.

[13]  J. Robins,et al.  Comment: Performance of Double-Robust Estimators When “Inverse Probability” Weights Are Highly Variable , 2007, 0804.2965.

[14]  Sander Greenland,et al.  Model-based estimation of relative risks and other epidemiologic measures in studies of common outcomes and in case-control studies. , 2004, American journal of epidemiology.

[15]  James M. Robins,et al.  Empirical and counterfactual conditions for sufficient cause interactions , 2008 .

[16]  G. Chamberlain Asymptotic efficiency in estimation with conditional moment restrictions , 1987 .

[17]  M. Davidian,et al.  Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data , 2009, Biometrika.

[18]  Tyler J. VanderWeele,et al.  Empirical tests for compositional epistasis , 2010, Nature Reviews Genetics.

[19]  Christoph Lange,et al.  Testing and Estimating Gene–Environment Interactions in Family‐Based Association Studies , 2008, Biometrics.

[20]  Peter J. Bickel,et al.  INFERENCE FOR SEMIPARAMETRIC MODELS: SOME QUESTIONS AND AN ANSWER , 2001 .

[21]  T. Richardson,et al.  GENERAL THEORY FOR INTERACTIONS IN SUFFICIENT CAUSE MODELS WITH DICHOTOMOUS EXPOSURES. , 2013, Annals of statistics.

[22]  Sander Greenland,et al.  Interactions in Epidemiology: Relevance, Identification, and Estimation , 2009, Epidemiology.

[23]  Zhiqiang Tan Comment: Improved Local Efficiency and Double Robustness , 2008, The international journal of biostatistics.

[24]  Zhiqiang Tan,et al.  A Distributional Approach for Causal Inference Using Propensity Scores , 2006 .

[25]  P. Phillips Epistasis — the essential role of gene interactions in the structure and evolution of genetic systems , 2008, Nature Reviews Genetics.

[26]  S Greenland,et al.  Basic problems in interaction assessment. , 1993, Environmental health perspectives.

[27]  James M. Robins,et al.  Multiply Robust Inference for Statistical Interactions , 2008, Journal of the American Statistical Association.

[28]  James Robins,et al.  The Semiparametric Case‐Only Estimator , 2010, Biometrics.

[29]  S. Vansteelandt,et al.  On model selection and model misspecification in causal inference , 2012, Statistical methods in medical research.

[30]  Tyler J Vanderweele,et al.  Sufficient cause interactions for categorical and ordinal exposures with three levels. , 2010, Biometrika.

[31]  Anders Skrondal,et al.  Interaction as departure from additivity in case-control studies: a cautionary note. , 2003, American journal of epidemiology.

[32]  J S Koopman,et al.  Interaction between discrete causes. , 1981, American journal of epidemiology.