Bayesian methods for quantitative trait loci mapping based on model selection: approximate analysis using the Bayesian information criterion.

We describe an approximate method for the analysis of quantitative trait loci (QTL) based on model selection from multiple regression models with trait values regressed on marker genotypes, using a modification of the easily calculated Bayesian information criterion to estimate the posterior probability of models with various subsets of markers as variables. The BIC-delta criterion, with the parameter delta increasing the penalty for additional variables in a model, is further modified to incorporate prior information, and missing values are handled by multiple imputation. Marginal probabilities for model sizes are calculated, and the posterior probability of nonzero model size is interpreted as the posterior probability of existence of a QTL linked to one or more markers. The method is demonstrated on analysis of associations between wood density and markers on two linkage groups in Pinus radiata. Selection bias, which is the bias that results from using the same data to both select the variables in a model and estimate the coefficients, is shown to be a problem for commonly used non-Bayesian methods for QTL mapping, which do not average over alternative possible models that are consistent with the data.

[1]  C. Schön,et al.  Bias and Sampling Error of the Estimated Proportion of Genotypic Variance Explained by Quantitative Trait Loci Determined From Experimental Data in Maize Using Cross Validation and Validation With Independent Samples. , 2000, Genetics.

[2]  M. Sillanpää,et al.  Bayesian mapping of multiple quantitative trait loci from incomplete outbred offspring data. , 1999, Genetics.

[3]  David A. Stephens,et al.  BAYESIAN ANALYSIS OF QUANTITATIVE TRAIT LOCUS DATA USING REVERSIBLE JUMP MARKOV CHAIN MONTE CARLO , 1998 .

[4]  A. Melchinger,et al.  Quantitative trait locus (QTL) mapping using different testers and independent population samples in maize reveals low power of QTL detection and large bias in estimates of QTL effects. , 1998, Genetics.

[5]  Allan R. Wilks,et al.  The new S language: a programming environment for data analysis and graphics , 1988 .

[6]  D. Madigan,et al.  Bayesian Model Averaging for Linear Regression Models , 1997 .

[7]  E. Lander,et al.  Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. , 1989, Genetics.

[8]  B. Efron,et al.  The Jackknife: The Bootstrap and Other Resampling Plans. , 1983 .

[9]  M. Sillanpää,et al.  Bayesian mapping of multiple quantitative trait loci from incomplete inbred line cross data. , 1998, Genetics.

[10]  J. Friedman,et al.  A Statistical View of Some Chemometrics Regression Tools , 1993 .

[11]  R. Kohn,et al.  Nonparametric regression using Bayesian variable selection , 1996 .

[12]  P. Wilcox,et al.  Multiple-marker mapping of wood density loci in an outbred pedigree of radiata pine , 2000, Theoretical and Applied Genetics.

[13]  Rebecca W. Doerge,et al.  Statistical issues in the search for genes affecting quantitative traits in experimental populations , 1997 .

[14]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[15]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[16]  I. Hoeschele,et al.  Advances in statistical methods to map quantitative trait loci in outbred populations. , 1997, Genetics.

[17]  R. Doerge,et al.  Empirical threshold values for quantitative trait mapping. , 1994, Genetics.

[18]  D. Rubin,et al.  Multiple Imputation for Interval Estimation from Simple Random Samples with Ignorable Nonresponse , 1986 .

[19]  James O. Berger,et al.  Statistical Analysis and the Illusion of Objectivity , 1988 .

[20]  S. Heath Markov chain Monte Carlo segregation and linkage analysis for oligogenic models. , 1997, American journal of human genetics.

[21]  H. Young,et al.  Statistical analysis relating analytical and consumer panel assessments of kiwifruit flavour compounds in a model juice base , 1998 .

[22]  M A Newton,et al.  A bayesian approach to detect quantitative trait loci using Markov chain Monte Carlo. , 1996, Genetics.

[23]  C. Haley,et al.  A simple regression method for mapping quantitative trait loci in line crosses using flanking markers , 1992, Heredity.

[24]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[25]  A. Paterson,et al.  Molecular dissection of quantitative traits: progress and prospects. , 1995, Genome research.

[26]  P M Visscher,et al.  Confidence intervals in QTL mapping by bootstrapping. , 1996, Genetics.