Data-driven integration of epidemiological and toxicological data to select candidate interacting genes and environmental factors in association with disease

Motivation: Complex diseases, such as Type 2 Diabetes Mellitus (T2D), result from the interplay of both environmental and genetic factors. However, most studies investigate either the genetics or the environment and there are a few that study their possible interaction in context of disease. One key challenge in documenting interactions between genes and environment includes choosing which of each to test jointly. Here, we attempt to address this challenge through a data-driven integration of epidemiological and toxicological studies. Specifically, we derive lists of candidate interacting genetic and environmental factors by integrating findings from genome-wide and environment-wide association studies. Next, we search for evidence of toxicological relationships between these genetic and environmental factors that may have an etiological role in the disease. We illustrate our method by selecting candidate interacting factors for T2D. Contact: abutte@stanford.edu

[1]  M. McCarthy,et al.  Replication of Genome-Wide Association Signals in UK Samples Reveals Risk Loci for Type 2 Diabetes , 2007, Science.

[2]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[3]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[4]  Petra S. Hüppi,et al.  Perinatal Exposure to Bisphenol A Alters Early Adipogenesis in the Rat , 2009, Environmental health perspectives.

[5]  V. Preedy,et al.  National Health and Nutrition Examination Survey , 2010 .

[6]  Atul J Butte,et al.  Systematic evaluation of environmental factors: persistent pollutants and nutrients correlated with serum lipid levels , 2012, International journal of epidemiology.

[7]  M. Fallin,et al.  Is "X"-WAS the future for all of epidemiology? , 2011, Epidemiology.

[8]  W. Stone,et al.  Gamma (γ) tocopherol upregulates peroxisome proliferator activated receptor (PPAR) gamma (γ) expression in SW 480 human colon cancer cell lines , 2003, BMC Cancer.

[9]  A. Butte,et al.  Non-Synonymous and Synonymous Coding SNPs Show Similar Likelihood and Effect Size of Human Disease Association , 2010, PloS one.

[10]  Francis Collins,et al.  Medicine. Environmental biology and human disease. , 2007, Science.

[11]  B. Spiegelman PPAR-gamma: adipogenic regulator and thiazolidinedione receptor. , 1998, Diabetes.

[12]  Stephen M Rappaport,et al.  Environment and Disease Risks , 2010, Science.

[13]  Juan Pablo Lewinger,et al.  Invited commentary: GE-Whiz! Ratcheting gene-environment studies up to the whole genome and the whole exposome. , 2012, American journal of epidemiology.

[14]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[15]  M. Khoury,et al.  Invited commentary: from genome-wide association studies to gene-environment-wide interaction studies--challenges and opportunities. , 2008, American journal of epidemiology.

[16]  T. Buterin,et al.  Convergent transcriptional profiles induced by endogenous estrogen and distinct xenoestrogens in breast cancer cells. , 2005, Carcinogenesis.

[17]  J. Holder,et al.  PPAR-gamma agonists: therapeutic role in diabetes, inflammation and cancer. , 2000, Trends in pharmacological sciences.

[18]  J. Holder,et al.  PPAR-γ agonists: therapeutic role in diabetes, inflammation and cancer , 2000 .

[19]  D. Lipman,et al.  National Center for Biotechnology Information , 2019, Springer Reference Medizin.

[20]  Paolo Vineis,et al.  A self-fulfilling prophecy: are we underestimating the role of the environment in gene-environment interaction research? , 2004, International journal of epidemiology.

[21]  Paolo Vineis,et al.  Epidemiology, Public Health, and the Rhetoric of False Positives , 2009, Environmental health perspectives.

[22]  Atul J. Butte,et al.  An Environment-Wide Association Study (EWAS) on Type 2 Diabetes Mellitus , 2010, PloS one.

[23]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[24]  Gilbert S Omenn,et al.  Overview of the symposium on public health significance of genomics and eco-genetics. , 2010, Annual review of public health.

[25]  Niels Grarup,et al.  Gene–environment interactions in the pathogenesis of type 2 diabetes and metabolism , 2007, Current opinion in clinical nutrition and metabolic care.

[26]  Miquel Porta,et al.  Commentary: a step towards more comprehensive analyses of life course effects of mixtures of environmental factors. , 2012, International journal of epidemiology.

[27]  Francis S. Collins,et al.  Environmental Biology and Human Disease , 2007, Science.

[28]  Paolo Boffetta,et al.  False-Positive Results in Cancer Epidemiology: A Plea for Epistemological Modesty , 2008, Journal of the National Cancer Institute.

[29]  J. Os,et al.  An environmental analysis of genes associated with schizophrenia: hypoxia and vascular factors as interacting elements in the neurodevelopmental model , 2012, Molecular Psychiatry.

[30]  Michael C. Rosenstein,et al.  The comparative toxicogenomics database: a cross-species resource for building chemical-gene interaction networks. , 2006, Toxicological sciences : an official journal of the Society of Toxicology.

[31]  D. Hunter Gene–environment interactions in human diseases , 2005, Nature Reviews Genetics.

[32]  T. Kawada,et al.  Carotenoids and retinoids as suppressors on adipocyte differentiation via nuclear receptors. , 2000, BioFactors.

[33]  Atul J. Butte,et al.  The "etiome": identification and clustering of human disease etiological factors , 2009, BMC Bioinformatics.

[34]  John P A Ioannidis,et al.  Researching Genetic Versus Nongenetic Determinants of Disease: A Comparison and Proposed Unification , 2009, Science Translational Medicine.

[35]  D. Thomas,et al.  Gene–environment-wide association studies: emerging approaches , 2010, Nature Reviews Genetics.

[36]  C. Wild Complementing the Genome with an “Exposome”: The Outstanding Challenge of Environmental Exposure Measurement in Molecular Epidemiology , 2005, Cancer Epidemiology Biomarkers & Prevention.

[37]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[38]  James Ra,et al.  Commentary: A step towards more comprehensive analyses of life course effects of mixtures of environmental factors , 2012 .

[39]  G. Smith Use of genetic markers and gene-diet interactions for interrogating population-level causal influences of diet on health , 2011, Genes & Nutrition.

[40]  Ncbi National Center for Biotechnology Information , 2008 .

[41]  Philippe Dessen,et al.  Targeting iron homeostasis induces cellular differentiation and synergizes with differentiating agents in acute myeloid leukemia , 2010, The Journal of experimental medicine.