An effective approach to identify gene-gene interactions for complex quantitative traits using generalized fuzzy accuracy

Multifactor dimensionality reduction (MDR) is originally proposed to identify gene-gene and gene-environment interactions associated with binary traits. Some efforts have been made to extend it to quantitative traits (QTs) and ordinal traits. However these methods are still not computationally efficient or effective. In this paper, we propose Fuzzy Quantitative trait based Ordinal MDR (QOMDR) to strengthen identification of gene-gene interactions associated with a quantitative trait by first transforming it to an ordinal trait and then using a fuzzy balance accuracy measure based on generalized member function of fuzzy sets to select best sets of SNPs as having strong association with the trait. Experimental results on two real datasets show that our algorithm has better consistency and classification accuracy in identifying gene-gene interactions associated with QTs.

[1]  H. Cordell Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans. , 2002, Human molecular genetics.

[2]  Taesung Park,et al.  Identification of multiple gene-gene interactions for ordinal phenotypes , 2013, BMC Medical Genomics.

[3]  Ingo Ruczinski,et al.  Identifying interacting SNPs using Monte Carlo logic regression , 2005, Genetic epidemiology.

[4]  Marylyn D. Ritchie,et al.  GPNN: Power studies and applications of a neural network method for detecting gene-gene interactions in studies of human disease , 2006 .

[5]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[6]  Marylyn D Ritchie,et al.  Comparison of approaches for machine‐learning optimization of neural networks for detecting gene‐gene interactions in genetic epidemiology , 2008, Genetic epidemiology.

[7]  Scott M. Williams,et al.  A Simple and Computationally Efficient Approach to Multifactor Dimensionality Reduction Analysis of Gene-Gene Interactions for Quantitative Traits , 2013, PloS one.

[8]  K. Lunetta,et al.  Identifying SNPs predictive of phenotype using random forests , 2005, Genetic epidemiology.

[9]  Shuanglin Zhang,et al.  An ensemble learning approach jointly modeling main and interaction effects in genetic association studies , 2008, Genetic epidemiology.

[10]  Benjamin M. Neale,et al.  Genome-wide association studies in ADHD , 2009, Human Genetics.

[11]  Ingo Wegener,et al.  Detecting high-order interactions of single nucleotide polymorphisms using genetic programming , 2007, Bioinform..

[12]  P. Donnelly,et al.  Genome-wide strategies for detecting multiple loci that influence complex diseases , 2005, Nature Genetics.

[13]  Alan Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[14]  Mee Young Park,et al.  Penalized logistic regression for detecting gene interactions. , 2008, Biostatistics.

[15]  C. Sing,et al.  A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation. , 2001, Genome research.

[16]  Shyh-Huei Chen,et al.  A support vector machine approach for detecting gene‐gene interaction , 2008, Genetic epidemiology.

[17]  David V Conti,et al.  A testing framework for identifying susceptibility genes in the presence of epistasis. , 2006, American journal of human genetics.

[18]  Heping Zhang,et al.  Use of classification trees for association studies , 2000, Genetic epidemiology.

[19]  Todd Holden,et al.  A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility. , 2006, Journal of theoretical biology.

[20]  Edward J. Perkins,et al.  Comparison of transcriptional responses in liver tissue and primary hepatocyte cell cultures after exposure to hexahydro-1, 3, 5-trinitro-1, 3, 5-triazine , 2006, BMC Bioinformatics.

[21]  J. H. Moore,et al.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. , 2001, American journal of human genetics.

[22]  Jason H Moore,et al.  Computational analysis of gene-gene interactions using multifactor dimensionality reduction , 2004, Expert review of molecular diagnostics.

[23]  Jun Zhu,et al.  A generalized combinatorial approach for detecting gene-by-gene and gene-by-environment interactions with application to nicotine dependence. , 2007, American journal of human genetics.