Multifactor dimendionality reduction analysis for gene-gene interaction of multiple binary traits

Recent advances in genotyping technology have facilitated the use of genome-wide association studies (GWAS) to successfully identify genetic variants that are associated with common complex traits. Following the successes in identification of single variants, joint identification including gene-gene interaction has been studied vigorously and produced many novel results. However, most genome-wide association studies have been conducted by focusing on one trait of interest for identifying genetic variants associated with common complex traits. Since many complex diseases having severe influences on the public health are pleiotropic, simple univariate analysis focusing on a single trait does not well detect full genetic architecture of complex diseases. For example, hyperlipidemia is diagnosed by four multiple traits: Total cholesterol (Tchl), High density lipoprotein (HDL) cholesterol, Low density lipoprotein (LDL), and cholesterol and Triglycerides (TG). Surprisingly, however, only few studies handle multiple traits simultaneously so far. Therefore, in order to improve power and reflect biological association more expansively, we investigate a multivariate approach which considers multiple traits simultaneously. Especially for the gene-gene interaction analysis for the multiple traits, we extend original multifactor dimensionality reduction (MDR) to handle multiple traits. We then demonstrate its superiority to univariate analysis through simulation studies. We confirm that the multivariate approach provides more stable and precise accuracy measures compared to univariate analysis. We applied the multivariate MDR approach to a GWA dataset of 8,842 Korean individuals and detected genetic variants associated with hypertension traits using systolic blood pressure (SBP) and Diastolic blood pressure (DBP).

[1]  Jun Zhu,et al.  A generalized combinatorial approach for detecting gene-by-gene and gene-by-environment interactions with application to nicotine dependence. , 2007, American journal of human genetics.

[2]  Taesung Park,et al.  Multivariate generalized multifactor dimensionality reduction to detect gene-gene interactions , 2013, BMC Systems Biology.

[3]  Jason H. Moore,et al.  Ideal discrimination of discrete clinical endpoints using multilocus genotypes , 2004, Silico Biol..

[4]  J L Haines,et al.  Multifactor dimensionality reduction reveals gene–gene interactions associated with multiple sclerosis susceptibility in African Americans , 2006, Genes and Immunity.

[5]  A. G. Heidema,et al.  The challenge for genetic epidemiologists: how to analyze large numbers of SNPs in relation to complex diseases , 2006, BMC Genetics.

[6]  Scott M. Williams,et al.  New strategies for identifying gene-gene interactions in hypertension , 2002, Annals of medicine.

[7]  Christian Gieger,et al.  Genetic Variants in Novel Pathways Influence Blood Pressure and Cardiovascular Disease Risk , 2011, Nature.

[8]  Charles Rotimi,et al.  A Genome-Wide Association Study of Hypertension and Blood Pressure in African Americans , 2009, PLoS genetics.

[9]  Taesung Park,et al.  A novel method to identify high order gene-gene interactions in genome-wide association studies: Gene-based MDR , 2012, BMC Bioinformatics.

[10]  P. Donnelly,et al.  Genome-wide strategies for detecting multiple loci that influence complex diseases , 2005, Nature Genetics.

[11]  Jason H. Moore,et al.  Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions , 2003, Bioinform..

[12]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[13]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[14]  J. H. Moore,et al.  Multifactor-dimensionality reduction shows a two-locus interaction associated with Type 2 diabetes mellitus , 2004, Diabetologia.

[15]  W. Oetting,et al.  Power of multifactor dimensionality reduction and penalized logistic regression for detecting gene-gene Interaction in a case-control study , 2009, BMC Medical Genetics.

[16]  Mee Young Park,et al.  Penalized logistic regression for detecting gene interactions. , 2008, Biostatistics.

[17]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[18]  J. H. Moore,et al.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. , 2001, American journal of human genetics.

[19]  Taesung Park,et al.  A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits , 2009, Nature Genetics.

[20]  Peter Kraft,et al.  Exploiting Gene-Environment Interaction to Detect Genetic Associations , 2007, Human Heredity.

[21]  Jason H. Moore,et al.  Power of multifactor dimensionality reduction for detecting gene‐gene interactions in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity , 2003, Genetic epidemiology.