An Empirical Bayes Method for Estimating Epistatic Effects of Quantitative Trait Loci

The genetic variance of a quantitative trait is often controlled by the segregation of multiple interacting loci. Linear model regression analysis is usually applied to estimating and testing effects of these quantitative trait loci (QTL). Including all the main effects and the effects of interaction (epistatic effects), the dimension of the linear model can be extremely high. Variable selection via stepwise regression or stochastic search variable selection (SSVS) is the common procedure for epistatic effect QTL analysis. These methods are computationally intensive, yet they may not be optimal. The LASSO (least absolute shrinkage and selection operator) method is computationally more efficient than the above methods. As a result, it has been widely used in regression analysis for large models. However, LASSO has never been applied to genetic mapping for epistatic QTL, where the number of model effects is typically many times larger than the sample size. In this study, we developed an empirical Bayes method (E-BAYES) to map epistatic QTL under the mixed model framework. We also tested the feasibility of using LASSO to estimate epistatic effects, examined the fully Bayesian SSVS, and reevaluated the penalized likelihood (PENAL) methods in mapping epistatic QTL. Simulation studies showed that all the above methods performed satisfactorily well. However, E-BAYES appears to outperform all other methods in terms of minimizing the mean-squared error (MSE) with relatively short computing time. Application of the new method to real data was demonstrated using a barley dataset.

[1]  C. R. Henderson ESTIMATION OF VARIANCE AND COVARIANCE COMPONENTS , 1953 .

[2]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[3]  Patrick M. Hayes,et al.  Regions of the genome that affect agronomic performance in two-row barley , 1996 .

[4]  H. Chipman,et al.  Bayesian CART Model Search , 1998 .

[5]  H. D. Patterson,et al.  Recovery of inter-block information when block sizes are unequal , 1971 .

[6]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[7]  D. Harville Maximum Likelihood Approaches to Variance Component Estimation and to Related Problems , 1977 .

[8]  M. Yuan,et al.  Efficient Empirical Bayes Variable Selection and Estimation in Linear Models , 2005 .

[9]  Daniel S. Bernstein,et al.  Rational Matrix Functions and Rank-1 Updates , 2000, SIAM J. Matrix Anal. Appl..

[10]  A. Gelman Analysis of variance: Why it is more important than ever? , 2005, math/0504499.

[11]  J. Ghosh,et al.  Modifying the Schwarz Bayesian Information Criterion to Locate Multiple Interacting Quantitative Trait Loci , 2004, Genetics.

[12]  Bradley P. Carlin,et al.  Markov Chain Monte Carlo conver-gence diagnostics: a comparative review , 1996 .

[13]  Nengjun Yi,et al.  Bayesian model choice and search strategies for mapping interacting quantitative trait Loci. , 2003, Genetics.

[14]  Cajo J F ter Braak,et al.  Extending Xu's Bayesian Model for Estimating Polygenic Effects Using Markers of the Entire Genome , 2005, Genetics.

[15]  G. Casella,et al.  The Effect of Improper Priors on Gibbs Sampling in Hierarchical Linear Mixed Models , 1996 .

[16]  Dean Phillips Foster,et al.  Calibration and empirical Bayes variable selection , 2000 .

[17]  Nengjun Yi,et al.  Stochastic search variable selection for identifying multiple quantitative trait loci. , 2003, Genetics.

[18]  S. Xu,et al.  A penalized maximum likelihood method for estimating epistatic effects of QTL , 2005, Heredity.

[19]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[20]  Shizhong Xu,et al.  Bayesian Shrinkage Estimation of Quantitative Trait Loci Parameters , 2005, Genetics.

[21]  G. Churchill,et al.  A statistical framework for quantitative trait mapping. , 2001, Genetics.

[22]  Shizhong Xu Estimating polygenic effects using markers of the entire genome. , 2003, Genetics.