Detection of epistatic effects with logic regression and a classical linear regression model

Abstract To locate multiple interacting quantitative trait loci (QTL) influencing a trait of interest within experimental populations, usually methods as the Cockerham’s model are applied. Within this framework, interactions are understood as the part of the joined effect of several genes which cannot be explained as the sum of their additive effects. However, if a change in the phenotype (as disease) is caused by Boolean combinations of genotypes of several QTLs, this Cockerham’s approach is often not capable to identify them properly. To detect such interactions more efficiently, we propose a logic regression framework. Even though with the logic regression approach a larger number of models has to be considered (requiring more stringent multiple testing correction) the efficient representation of higher order logic interactions in logic regression models leads to a significant increase of power to detect such interactions as compared to a Cockerham’s approach. The increase in power is demonstrated analytically for a simple two-way interaction model and illustrated in more complex settings with simulation study and real data analysis.

[1]  H. Deng,et al.  Bayesian mapping of quantitative trait loci for multiple complex traits with the use of variance components. , 2007, American journal of human genetics.

[2]  Z. Zeng Precision mapping of quantitative trait loci. , 1994, Genetics.

[3]  Holger Schwender,et al.  Importance Measures for Epistatic Interactions in Case‐Parent Trios , 2011, Annals of human genetics.

[4]  R. Jansen,et al.  University of Groningen High Resolution of Quantitative Traits Into Multiple Loci via Interval Mapping , 2022 .

[5]  J. Ott,et al.  Neural network analysis of complex traits , 1997, Genetic epidemiology.

[6]  C. Haley,et al.  A simple regression method for mapping quantitative trait loci in line crosses using flanking markers , 1992, Heredity.

[7]  G. Churchill,et al.  New quantitative trait loci that contribute to cholesterol gallstone formation detected in an intercross of CAST/Ei and 129S1/SvImJ inbred mice. , 2003, Physiological genomics.

[8]  Carolin Strobl,et al.  Statistical Applications in Genetics and Molecular Biology Multiple Testing for SNP-SNP Interactions , 2007 .

[9]  Ingo Ruczinski,et al.  Logic Regression — Methods and Software , 2003 .

[10]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[11]  G. Churchill,et al.  A statistical framework for quantitative trait mapping. , 2001, Genetics.

[12]  D. Clayton Prediction and Interaction in Complex Disease Genetics: Experience in Type 1 Diabetes , 2009, PLoS genetics.

[13]  Ingo Ruczinski,et al.  Identifying interacting SNPs using Monte Carlo logic regression , 2005, Genetic epidemiology.

[14]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[15]  Multiple-Interval Mapping for Quantitative Trait Loci With a Spike in the Trait Distribution , 2009, Genetics.

[16]  R. Ball,et al.  Bayesian methods for quantitative trait loci mapping based on model selection: approximate analysis using the Bayesian information criterion. , 2001, Genetics.

[17]  Claudia Czado,et al.  Locating Multiple Interacting Quantitative Trait Loci with the Zero-Inflated Generalized Poisson Regression , 2010, Statistical applications in genetics and molecular biology.

[18]  Zehua Chen,et al.  Mixture Generalized Linear Models for Multiple Interval Mapping of Quantitative Trait Loci in Experimental Crosses , 2009, Biometrics.

[19]  Kerrie L. Mengersen,et al.  Methods for Identifying SNP Interactions: A Review on Variations of Logic Regression, Random Forest and Bayesian Logistic Regression , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[20]  Katja Ickstadt,et al.  Comparing Logic Regression Based Methods for Identifying SNP Interactions , 2007, BIRD.

[21]  Chris S. Haley,et al.  Epistasis: too often neglected in complex trait studies? , 2004, Nature Reviews Genetics.

[22]  Andreas Baierl,et al.  On Locating Multiple Interacting Quantitative Trait Loci in Intercross Designs , 2006, Genetics.

[23]  L. Penrose,et al.  THE CORRELATION BETWEEN RELATIVES ON THE SUPPOSITION OF MENDELIAN INHERITANCE , 2022 .

[24]  N. Yi,et al.  Bayesian mapping of quantitative trait loci for complex binary traits. , 2000, Genetics.

[25]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[26]  Holger Schwender,et al.  Identification of SNP interactions using logic regression. , 2008, Biostatistics.

[27]  J. Ghosh,et al.  Extending the Modified Bayesian Information Criterion (mBIC) to Dense Markers and Multiple Interval Mapping , 2008, Biometrics.

[28]  H. Cordell Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans. , 2002, Human molecular genetics.

[29]  Hao Wu,et al.  R/qtl: QTL Mapping in Experimental Crosses , 2003, Bioinform..

[30]  W. Bateson Mendel's Principles of Heredity , 1910, Nature.

[31]  E. Lander,et al.  Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. , 1989, Genetics.

[32]  Arjun K. Gupta The Theory of Linear Models and Multivariate Analysis , 1981 .

[33]  C. Cockerham,et al.  An Extension of the Concept of Partitioning Hereditary Variance for Analysis of Covariances among Relatives When Epistasis Is Present. , 1954, Genetics.

[34]  Ingo Ruczinski,et al.  Exploring interactions in high-dimensional genomic data: an overview of logic regression, with applications , 2004 .

[35]  Hao Wu,et al.  R/qtlbim: QTL with Bayesian Interval Mapping in experimental crosses , 2007, Bioinform..

[36]  W. Atchley,et al.  Mapping quantitative trait loci for complex binary diseases using line crosses. , 1996, Genetics.

[37]  D Siegmund,et al.  Statistical methods for mapping quantitative trait loci from a dense set of markers. , 1999, Genetics.

[38]  Z. Zeng,et al.  Multiple interval mapping for quantitative trait loci. , 1999, Genetics.

[39]  Z. Zeng,et al.  Modeling epistasis of quantitative trait loci using Cockerham's model. , 2002, Genetics.

[40]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[41]  R. Doerge Multifactorial genetics: Mapping and analysis of quantitative trait loci in experimental populations , 2002, Nature Reviews Genetics.