Multivariate Quantitative Multifactor Dimensionality Reduction for Detecting Gene-Gene Interactions

Objectives: To determine gene-gene interactions and missing heritability of complex diseases is a challenging topic in genome-wide association studies. The multifactor dimensionality reduction (MDR) method is one of the most commonly used methods for identifying gene-gene interactions with dichotomous phenotypes. For quantitative phenotypes, the generalized MDR or quantitative MDR (QMDR) methods have been proposed. These methods are known as univariate methods because they consider only one phenotype. To date, there are few methods for analyzing multiple phenotypes. Methods: To address this problem, we propose a multivariate QMDR method (Multi-QMDR) for multivariate correlated phenotypes. We summarize the multivariate phenotypes into a univariate score by dimensional reduction analysis, and then classify the samples accordingly into high-risk and low-risk groups. We use different ways of summarizing mainly based on the principal components. Multi-QMDR is model-free and easy to implement. Results: Multi-QMDR is applied to lipid-related traits. The properties of Multi- QMDR were investigated through simulation studies. Empirical studies show that Multi-QMDR outperforms existing univariate and multivariate methods at identifying causal interactions. Conclusions: The Multi-QMDR approach improves the performance of QMDR when multiple quantitative phenotypes are available.

[1]  Stephen G. Young,et al.  New wrinkles in lipoprotein lipase biology , 2012, Current opinion in lipidology.

[2]  Jun Zhu,et al.  A generalized combinatorial approach for detecting gene-by-gene and gene-by-environment interactions with application to nicotine dependence. , 2007, American journal of human genetics.

[3]  Angeline S. Andrew,et al.  A novel survival multifactor dimensionality reduction method for detecting gene–gene interactions with application to bladder cancer prognosis , 2010, Human Genetics.

[4]  Jun Zhu,et al.  A combinatorial approach to detecting gene-gene and gene-environment interactions in family studies. , 2008, American journal of human genetics.

[5]  Taesung Park,et al.  Log-linear model-based multifactor dimensionality reduction method to detect gene-gene interactions , 2007, Bioinform..

[6]  Scott M. Williams,et al.  A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction , 2007, Genetic epidemiology.

[7]  Marylyn D. Ritchie,et al.  A General Framework for Formal Tests of Interaction after Exhaustive Search Methods with Applications to MDR and MDR-PDT , 2010, PloS one.

[8]  Jason H. Moore,et al.  GAMETES: a fast, direct algorithm for generating pure, strict, epistatic models with random architectures , 2012, BioData Mining.

[9]  Taesung Park,et al.  A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits , 2009, Nature Genetics.

[10]  P. Donnelly,et al.  Genome-wide strategies for detecting multiple loci that influence complex diseases , 2005, Nature Genetics.

[11]  Hiroyuki Kamiguchi Asymmetric Ca2+ signaling and membrane dynamics mediate growth cone guidance , 2007, Neuroscience Research.

[12]  Taesung Park,et al.  Large-scale genome-wide association studies in east Asians identify new genetic loci influencing metabolic traits , 2011, Nature Genetics.

[13]  M. L. Calle,et al.  Model‐Based Multifactor Dimensionality Reduction for detecting epistasis in case–control data in the presence of noise , 2011, Annals of human genetics.

[14]  R. Jennrich,et al.  Unbalanced repeated-measures models with structured covariance matrices. , 1986, Biometrics.

[15]  Taesung Park,et al.  A novel method to identify high order gene-gene interactions in genome-wide association studies: Gene-based MDR , 2012, BMC Bioinformatics.

[16]  Scott M. Williams,et al.  A Simple and Computationally Efficient Approach to Multifactor Dimensionality Reduction Analysis of Gene-Gene Interactions for Quantitative Traits , 2013, PloS one.

[17]  Seungyeoun Lee,et al.  Gene–gene interaction analysis for the survival phenotype based on the Cox model , 2012, Bioinform..

[18]  John C. Chambers,et al.  A Replication Study of GWAS-Derived Lipid Genes in Asian Indians: The Chromosomal Region 11q23.3 Harbors Loci Contributing to Triglycerides , 2012, PloS one.

[19]  Taesung Park,et al.  New evaluation measures for multifactor dimensionality reduction classifiers in gene-gene interaction analysis , 2009, Bioinform..

[20]  M. L. Calle,et al.  FAM-MDR: A Flexible Family-Based Multifactor Dimensionality Reduction Technique to Detect Epistasis Using Related Individuals , 2010, PloS one.

[21]  Jan Albert Kuivenhoven,et al.  The value of HDL genetics , 2008, Current opinion in lipidology.

[22]  Jason H. Moore,et al.  Missing heritability and strategies for finding the underlying causes of complex disease , 2010, Nature Reviews Genetics.

[23]  Tom R. Gaunt,et al.  Gene-centric association signals for lipids and apolipoproteins identified via the HumanCVD BeadChip. , 2009, American journal of human genetics.

[24]  Wei Wang,et al.  Association of the variants in the BUD13-ZNF259 genes and the risk of hyperlipidaemia , 2014, Journal of cellular and molecular medicine.

[25]  Taesung Park,et al.  Identification of multiple gene-gene interactions for ordinal phenotypes , 2013, BMC Medical Genomics.

[26]  Ku Chee Seng,et al.  The success of the genome-wide association approach: a brief story of a long struggle , 2008, European Journal of Human Genetics.

[27]  Sam T. Roweis,et al.  EM Algorithms for PCA and SPCA , 1997, NIPS.

[28]  Taesung Park,et al.  Multivariate generalized multifactor dimensionality reduction to detect gene-gene interactions , 2013, BMC Systems Biology.

[29]  J. H. Moore,et al.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. , 2001, American journal of human genetics.

[30]  Mary K. Wojczynski,et al.  Genetic analysis of long-lived families reveals novel variants influencing high density-lipoprotein cholesterol , 2014, Front. Genet..

[31]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[32]  Taesung Park,et al.  Odds ratio based multifactor-dimensionality reduction method for detecting gene – gene interactions , 2006 .

[33]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .