Information fusion of CNVs and SNPs on gene-gene interactions for molecular subtypes of lymphoma

Although genome-wide association studies report many disease-associated loci involved in pathogenesis, current identified variants only explain a little part of the heritability underlying complex diseases. To explore the other missing part of heritability, data-mining methods are proposed and developed to detect disease-associated interactions between variants. Recently, some studies have revealed the linkage disequilibrium between chromosome structure variations and disease-associated loci. We are motivated to employ a fusion approach that incorporates the information of copy number variations (CNVs) for identifying interactions between single nucleotide polymorphisms (SNPs). The CNV profiles are first used for clustering analysis of disease subtypes, and then the SNP-SNP interactions are examined by the multifactor dimensionality reduction (MDR) method. We applied the fusion approach in analyzing 214 lymphoma cases. The results showed that the interactions identified by the fusion approach were more significantly associated with lymphoma than those identified only by MDR without incorporating CNV information. Therefore, we conclude that information fusion of CNVs and SNPs provides a proper strategy for detecting gene-gene interactions in disease association studies.

[1]  Andreas Ziegler,et al.  On safari to Random Jungle: a fast implementation of Random Forests for high-dimensional data , 2010, Bioinform..

[2]  D. Goldstein Common genetic variation and human traits. , 2009, The New England journal of medicine.

[3]  Jun S. Liu,et al.  Bayesian inference of epistatic interactions in case-control studies , 2007, Nature Genetics.

[4]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[5]  Andreas Ziegler,et al.  On safari to Random Jungle: a fast implementation of Random Forests for high-dimensional data , 2010, Bioinform..

[6]  S. Ogawa,et al.  Frequent inactivation of A20 in B-cell lymphomas , 2009, Nature.

[7]  Nancy J. Cox,et al.  A Study of CNVs As Trait-Associated Polymorphisms and As Expression Quantitative Trait Loci , 2011, PLoS genetics.

[8]  Vicenc Torra,et al.  Information Fusion in Data Mining , 2003 .

[9]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[10]  I. Pe’er,et al.  Ultrafast genome-wide scan for SNP–SNP interactions in common complex disease , 2012, Genome research.

[11]  Qiang Yang,et al.  Predictive rule inference for epistatic interaction detection in genome-wide association studies , 2010, Bioinform..

[12]  Shyh-Huei Chen,et al.  A support vector machine approach for detecting gene‐gene interaction , 2008, Genetic epidemiology.

[13]  B. Fridley Bayesian variable and model selection methods for genetic association studies , 2009, Genetic epidemiology.

[14]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[15]  Allen D. Delaney,et al.  High resolution analysis of follicular lymphoma genomes reveals somatic recurrent sites of copy‐neutral loss of heterozygosity and copy number alterations that target single genes , 2010, Genes, chromosomes & cancer.

[16]  M. Xiong,et al.  Test for interaction between two unlinked loci. , 2006, American journal of human genetics.

[17]  Qiang Yang,et al.  BOOST: A fast approach to detecting gene-gene interactions in genome-wide case-control studies , 2010, American journal of human genetics.

[18]  Hsueh-Wei Chang,et al.  Computational analysis of simulated SNP interactions between 26 growth factor-related genes in a breast cancer association study. , 2011, Omics : a journal of integrative biology.

[19]  J. H. Moore,et al.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. , 2001, American journal of human genetics.

[20]  S. Brunak,et al.  Genome-Wide Assessment of the Association of Rare and Common Copy Number Variations to Testicular Germ Cell Cancer , 2013, Front. Endocrin..

[21]  J. Knight Resolving the variable genome and epigenome in human disease , 2012, Journal of internal medicine.

[23]  Christian Gieger,et al.  Six new loci associated with body mass index highlight a neuronal influence on body weight regulation , 2009, Nature Genetics.

[24]  A. Floren,et al.  ' " ' " ' " . " ' " " " " " ' " ' " " " " " : ' " 1 , 2001 .

[25]  Judy H Cho,et al.  Deletion polymorphism upstream of IRGM associated with altered IRGM expression and Crohn's disease , 2008, Nature Genetics.

[26]  Stephen J. Chanock,et al.  Polymorphism Interaction Analysis (PIA): a method for investigating complex gene-gene interactions , 2008, BMC Bioinformatics.

[27]  Karsten M. Borgwardt,et al.  EPIBLASTER-fast exhaustive two-locus epistasis detection strategy using graphical processing units , 2011, European Journal of Human Genetics.

[28]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[29]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[30]  Kevin P. Murphy,et al.  Model-based clustering of array CGH data , 2009, Bioinform..

[31]  B. Maher Personal genomes: The case of the missing heritability , 2008, Nature.