Improving strategies for detecting genetic patterns of disease susceptibility in association studies

The analysis of gene interactions and epistatic patterns of susceptibility is especially important for investigating complex diseases such as cancer characterized by the joint action of several genes. This work is motivated by a case-control study of bladder cancer, aimed at evaluating the role of both genetic and environmental factors in bladder carcinogenesis. In particular, the analysis of the inflammation pathway is of interest, for which information on a total of 282 SNPs in 108 genes involved in the inflammatory response is available. Detecting and interpreting interactions with such a large number of polymorphisms is a great challenge from both the statistical and the computational perspectives. In this paper we propose a two-stage strategy for identifying relevant interactions: (1) the use of a synergy measure among interacting genes and (2) the use of the model-based multifactor dimensionality reduction method (MB-MDR), a model-based version of the MDR method, which allows adjustment for confounders.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  Kristel Van Steen,et al.  MB-MDR: Model-Based Multifactor Dimensionality Reduction for detecting interactions in high-dimensional genomic data , 2008 .

[3]  Aidong Zhang,et al.  Information-theoretic metrics for visualizing gene-environment interactions. , 2007, American journal of human genetics.

[4]  P. Corning “The synergism hypothesis”: On the concept of synergy and its role in the evolution of complex systems , 1998 .

[5]  Ingo Ruczinski,et al.  Exploring interactions in high-dimensional genomic data: an overview of logic regression, with applications , 2004 .

[6]  Margaret R Karagas,et al.  Concordance of multiple analytical approaches demonstrates a complex relationship between DNA repair gene SNPs, smoking and bladder cancer susceptibility. , 2006, Carcinogenesis.

[7]  Taesung Park,et al.  Odds ratio based multifactor-dimensionality reduction method for detecting gene – gene interactions , 2006 .

[8]  Ivan Bratko,et al.  Quantifying and Visualizing Attribute Interactions , 2003, ArXiv.

[9]  William J. McGill Multivariate information transmission , 1954, Trans. IRE Prof. Group Inf. Theory.

[10]  Christoph Lange,et al.  Genomic screening and replication using the same data set in family-based association testing , 2005, Nature Genetics.

[11]  D. Anastassiou Computational analysis of the synergy among multiple interacting genes , 2007, Molecular systems biology.

[12]  Jun Zhu,et al.  A generalized combinatorial approach for detecting gene-by-gene and gene-by-environment interactions with application to nicotine dependence. , 2007, American journal of human genetics.

[13]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[14]  N E Day,et al.  Multi-factor dimensionality reduction applied to a large prospective investigation on gene-gene and gene-environment interactions. , 2006, Carcinogenesis.

[15]  Marko Robnik-Sikonja,et al.  Theoretical and Empirical Analysis of ReliefF and RReliefF , 2003, Machine Learning.

[16]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[17]  David M. Miller,et al.  Computational inference of the molecular logic for synaptic connectivity in C. elegans , 2006, ISMB.

[18]  J. H. Moore,et al.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. , 2001, American journal of human genetics.

[19]  Todd Holden,et al.  A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility. , 2006, Journal of theoretical biology.

[20]  Ivan Bratko,et al.  Attribute Interactions in Medical Data Analysis , 2003, AIME.

[21]  Jason H. Moore,et al.  Tuning ReliefF for Genome-Wide Genetic Analysis , 2007, EvoBIO.

[22]  Burton H. Singer,et al.  Recursive partitioning in the health sciences , 1999 .

[23]  Scott M. Williams,et al.  Traversing the conceptual divide between biological and statistical epistasis: systems biology and a more modern synthesis. , 2005, BioEssays : news and reviews in molecular, cellular and developmental biology.

[24]  Xifeng Wu,et al.  High-Order Interactions among Genetic Variants in DNA Base Excision Repair Pathway Genes and Smoking in Bladder Cancer Susceptibility , 2007, Cancer Epidemiology Biomarkers & Prevention.

[25]  Te Sun Han,et al.  Multiple Mutual Informations and Multiple Interactions in Frequency Data , 1980, Inf. Control..

[26]  Jason H. Moore,et al.  Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions , 2003, Bioinform..

[27]  J. Gu,et al.  High-order interactions among genetic polymorphisms in nucleotide excision repair pathway genes and smoking in modulating bladder cancer risk. , 2007, Carcinogenesis.

[28]  Paolo Vineis,et al.  DNA Repair Polymorphisms Modify Bladder Cancer Risk: A Multi-factor Analytic Strategy , 2007, Human Heredity.