Empirical Bayesian LASSO-logistic regression for multiple binary trait locus mapping

BackgroundComplex binary traits are influenced by many factors including the main effects of many quantitative trait loci (QTLs), the epistatic effects involving more than one QTLs, environmental effects and the effects of gene-environment interactions. Although a number of QTL mapping methods for binary traits have been developed, there still lacks an efficient and powerful method that can handle both main and epistatic effects of a relatively large number of possible QTLs.ResultsIn this paper, we use a Bayesian logistic regression model as the QTL model for binary traits that includes both main and epistatic effects. Our logistic regression model employs hierarchical priors for regression coefficients similar to the ones used in the Bayesian LASSO linear model for multiple QTL mapping for continuous traits. We develop efficient empirical Bayesian algorithms to infer the logistic regression model. Our simulation study shows that our algorithms can easily handle a QTL model with a large number of main and epistatic effects on a personal computer, and outperform five other methods examined including the LASSO, HyperLasso, BhGLM, RVM and the single-QTL mapping method based on logistic regression in terms of power of detection and false positive rate. The utility of our algorithms is also demonstrated through analysis of a real data set. A software package implementing the empirical Bayesian algorithms in this paper is freely available upon request.ConclusionsThe EBLASSO logistic regression method can handle a large number of effects possibly including the main and epistatic QTL effects, environmental effects and the effects of gene-environment interactions. It will be a very useful tool for multiple QTLs mapping for complex binary traits.

[1]  R W Doerge,et al.  Model Selection in Binary Trait Locus Mapping , 2005, Genetics.

[2]  C. Hackett,et al.  Genetic mapping of quantitative trait loci for traits with ordinal distributions. , 1995, Biometrics.

[3]  Shizhong Xu,et al.  Fast empirical Bayesian LASSO for multiple quantitative trait locus mapping , 2011, BMC Bioinformatics.

[4]  H. Cordell,et al.  SNP Selection in Genome-Wide and Candidate Gene Studies via Penalized Logistic Regression , 2010, Genetic epidemiology.

[5]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[6]  Shizhong Xu Estimating polygenic effects using markers of the entire genome. , 2003, Genetics.

[7]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[8]  Shizhong Xu,et al.  Bayesian Shrinkage Estimation of Quantitative Trait Loci Parameters , 2005, Genetics.

[9]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[10]  Nengjun Yi,et al.  An Efficient Bayesian Model Selection Approach for Interacting Quantitative Trait Loci Models With Many Effects , 2007, Genetics.

[11]  Jianqing Fan,et al.  Sure independence screening in generalized linear models with NP-dimensionality , 2009, The Annals of Statistics.

[12]  Shizhong Xu,et al.  Joint Mapping of Quantitative Trait Loci for Multiple Binary Characters , 2005, Genetics.

[13]  Subburaman Mohan,et al.  Analysis of gene expression in the wound repair/regeneration process , 2001, Mammalian Genome.

[14]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[15]  David J. C. MacKay,et al.  The Evidence Framework Applied to Classification Networks , 1992, Neural Computation.

[16]  Wim E Crusio AN INTRODUCTION TO QUANTITATIVE GENETICS , 1998 .

[17]  Zehua Chen,et al.  Mixture Generalized Linear Models for Multiple Interval Mapping of Quantitative Trait Loci in Experimental Crosses , 2009, Biometrics.

[18]  B T Kunimoto,et al.  Growth factors in wound healing: the next great innovation? , 1999, Ostomy/wound management.

[19]  Fei Zou,et al.  Bayesian Multiple Quantitative Trait Loci Mapping for Complex Traits Using Markers of the Entire Genome , 2007, Genetics.

[20]  Nengjun Yi,et al.  An EM algorithm for mapping binary disease loci: application to fibrosarcoma in a four-way cross mouse family. , 2003, Genetical research.

[21]  Jianqing Fan,et al.  Sure independence screening for ultrahigh dimensional feature space , 2006, math/0612857.

[22]  W. Atchley,et al.  Mapping quantitative trait loci for complex binary diseases using line crosses. , 1996, Genetics.

[23]  Trevor J. Hastie,et al.  Genome-wide association analysis by lasso penalized logistic regression , 2009, Bioinform..

[24]  Shizhong Xu,et al.  An expectation–maximization algorithm for the Lasso estimation of quantitative trait locus effects , 2010, Heredity.

[25]  Bradley P. Carlin,et al.  Bayesian Methods for Data Analysis , 2008 .

[26]  Michael E. Tipping,et al.  Fast Marginal Likelihood Maximisation for Sparse Bayesian Models , 2003 .

[27]  Zhao-Bang Zeng,et al.  Multiple-Interval Mapping for Ordinal Traits , 2006, Genetics.

[28]  Shizhong Xu,et al.  An EM algorithm for mapping quantitative resistance loci , 2005, Heredity.

[29]  Nengjun Yi,et al.  Hierarchical Generalized Linear Models for Multiple Quantitative Trait Locus Mapping , 2009, Genetics.

[30]  C. Hoggart,et al.  Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies , 2008, PLoS genetics.

[31]  Shizhong Xu,et al.  An Empirical Bayes Method for Estimating Epistatic Effects of Quantitative Trait Loci , 2007, Biometrics.

[32]  N. Yi,et al.  Bayesian mapping of quantitative trait loci for complex binary traits. , 2000, Genetics.

[33]  George Eastman House,et al.  Sparse Bayesian Learning and the Relevan e Ve tor Ma hine , 2001 .

[34]  Nengjun Yi,et al.  Mapping quantitative trait loci with epistatic effects. , 2002, Genetical research.

[35]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[36]  G. Casella,et al.  The Bayesian Lasso , 2008 .

[37]  J. Griffin,et al.  BAYESIAN HYPER‐LASSOS WITH NON‐CONVEX PENALIZATION , 2011 .

[38]  M. Sillanpää,et al.  Bayesian mapping of genotype × expression interactions in quantitative and qualitative traits , 2006, Heredity.

[39]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[40]  C. Cockerham,et al.  An Extension of the Concept of Partitioning Hereditary Variance for Analysis of Covariances among Relatives When Epistasis Is Present. , 1954, Genetics.

[41]  W Y Zhang,et al.  Discussion on `Sure independence screening for ultra-high dimensional feature space' by Fan, J and Lv, J. , 2008 .

[42]  Nengjun Yi,et al.  Mapping Multiple Quantitative Trait Loci for Ordinal Traits , 2004, Behavior genetics.

[43]  D. Falconer,et al.  Introduction to Quantitative Genetics. , 1961 .

[44]  Jiahan Li,et al.  Bayesian functional mapping of dynamic quantitative traits , 2011, Theoretical and Applied Genetics.

[45]  C. Haley,et al.  A simple regression method for mapping quantitative trait loci in line crosses using flanking markers , 1992, Heredity.

[46]  S. Mohan,et al.  Identification of wound healing/regeneration quantitative trait loci (QTL) at multiple time points that explain seventy percent of variance in (MRL/MpJ and SJL/J) mice F2 population. , 2001, Genome research.

[47]  Z. Q. John Lu,et al.  Bayesian methods for data analysis, third edition , 2010 .

[48]  R. N. Curnow,et al.  Estimating the locations and the sizes of the effects of quantitative trait loci using flanking markers , 1992, Theoretical and Applied Genetics.

[49]  Rongling Wu,et al.  Statistical Genetics of Quantitative Traits: Linkage, Maps and QTL , 2007 .

[50]  Shizhong Xu,et al.  Mapping quantitative trait loci for ordered categorical traits in four-way crosses , 1998, Heredity.

[51]  Zhaohai Li,et al.  A Logistic Regression Mixture Model for Interval Mapping of Genetic Trait Loci Affecting Binary Phenotypes , 2006, Genetics.

[52]  N. Yi,et al.  Bayesian LASSO for Quantitative Trait Loci Mapping , 2008, Genetics.

[53]  Kunimoto Bt Growth factors in wound healing: the next great innovation? , 1999 .

[54]  Nengjun Yi,et al.  Bayesian Mapping of Genomewide Interacting Quantitative Trait Loci for Ordinal Traits , 2007, Genetics.