QTL Mapping Using a Memetic Algorithm with Modifications of BIC as Fitness Function

The problem of locating quantitative trait loci (QTL) for experimental populations can be approached by multiple regression analysis. In this context variable selection using a modification of the Bayesian Information Criterion (mBIC) has been well established in the past. In this article a memetic algorithm (MA) is introduced to find the model which minimizes the selection criterion. Apart from mBIC also a second modification (mBIC2) is considered, which has the property of controlling the false discovery rate. Given the Bayesian nature of our selection criteria, we are not only interested in finding the best model, but also in computing marker posterior probabilities using all models visited by MA. In a simulation study MA (with mBIC and mBIC2) is compared with a parallel genetic algorithm (PGA) which has been previously suggested for QTL mapping. It turns out that MA in combination with mBIC2 performs best, where determining QTL positions based on marker posterior probabilities yields even better results than using the best model selected by MA. Finally we consider a real data set from the literature and show that MA can also be extended to multiple interval mapping, which potentially increases the precision with which the exact location of QTLs can be estimated.

[1]  Mu Zhu,et al.  Darwinian Evolution in Parallel Universes: A Parallel Genetic Algorithm for Variable Selection , 2006, Technometrics.

[2]  Jiahua Chen,et al.  Extended Bayesian information criteria for model selection with large model spaces , 2008 .

[3]  Andreas Baierl,et al.  On Locating Multiple Interacting Quantitative Trait Loci in Intercross Designs , 2006, Genetics.

[4]  Malgorzata Bogdan,et al.  Modified versions of the Bayesian Information Criterion for sparse Generalized Linear Models , 2011, Comput. Stat. Data Anal..

[5]  Malgorzata Bogdan,et al.  Modified versions of Bayesian Information Criterion for genome-wide association studies , 2012, Comput. Stat. Data Anal..

[6]  Ying Zhou Multiple Interval Mapping for Quantitative Trait Loci via EM Algorithm in the Presence of Crossover Interference , 2010 .

[7]  Rebecca W. Doerge,et al.  Statistical issues in the search for genes affecting quantitative traits in experimental populations , 1997 .

[8]  Pablo Moscato,et al.  On Evolution, Search, Optimization, Genetic Algorithms and Martial Arts : Towards Memetic Algorithms , 1989 .

[9]  Rongling Wu,et al.  Statistical Genetics of Quantitative Traits: Linkage, Maps and QTL , 2007 .

[10]  M. Lund,et al.  Detection and modelling of time-dependent QTL in animal populations , 2008, Genetics Selection Evolution.

[11]  J. Ghosh,et al.  Modifying the Schwarz Bayesian Information Criterion to Locate Multiple Interacting Quantitative Trait Loci , 2004, Genetics.

[12]  J. Namkung,et al.  Identification of expression quantitative trait loci by the interaction analysis using genetic algorithm , 2007, BMC proceedings.

[13]  A. Nishi,et al.  QTL analysis of measures of mouse home-cage activity using B6/MSM consomic strains , 2010, Mammalian Genome.

[14]  Malgorzata Bogdan,et al.  Asymptotic Bayes optimality under sparsity for generally distributed effect sizes under the alternative , 2010, 1005.4753.

[15]  Andreas Baierl,et al.  Locating Multiple Interacting Quantitative Trait Loci Using Rank-Based Model Selection , 2007, Genetics.

[16]  Z. Zeng,et al.  Multiple interval mapping for quantitative trait loci. , 1999, Genetics.

[17]  E. Lander,et al.  Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. , 1989, Genetics.

[18]  Yuehua Cui,et al.  A Systems Biology Approach for Identifying Novel Pathway Regulators in eQTL Mapping , 2010, Journal of biopharmaceutical statistics.

[19]  Siuli Mukhopadhyay,et al.  Variable selection method for quantitative trait analysis based on parallel genetic algorithm , 2010, Annals of human genetics.

[20]  Jayanta K. Ghosh,et al.  Selecting explanatory variables with the modified version of the Bayesian information criterion , 2008, Qual. Reliab. Eng. Int..

[21]  Andreas Baierl,et al.  Locating multiple interacting quantitative trait loci using robust model selection , 2007, Comput. Stat. Data Anal..

[22]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[23]  Z B Zeng,et al.  Theoretical basis for separation of multiple linked gene effects in mapping quantitative trait loci. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[24]  H Kishino,et al.  Detection of closely linked multiple quantitative trait loci using a genetic algorithm. , 2001, Genetics.

[25]  C H Kao,et al.  On the differences between maximum likelihood and regression interval mapping in the analysis of quantitative trait loci. , 2000, Genetics.

[26]  Karl W. Broman,et al.  A model selection approach for the identification of quantitative trait loci in experimental crosses , 2002 .

[27]  R. Wu,et al.  Functional mapping — how to map and study the genetic architecture of dynamic complex traits , 2006, Nature Reviews Genetics.

[28]  G. Hadjipavlou,et al.  Age-dependent quantitative trait loci affecting growth traits in Scottish Blackface sheep. , 2009, Animal genetics.

[29]  C. Haley,et al.  A simple regression method for mapping quantitative trait loci in line crosses using flanking markers , 1992, Heredity.

[30]  Rongling Wu,et al.  Score Statistics for Mapping Quantitative Trait Loci , 2009, Statistical applications in genetics and molecular biology.

[31]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[32]  J. Ghosh,et al.  Extending the Modified Bayesian Information Criterion (mBIC) to Dense Markers and Multiple Interval Mapping , 2008, Biometrics.

[33]  R. Doerge,et al.  Empirical threshold values for quantitative trait mapping. , 1994, Genetics.

[34]  K. Broman,et al.  A Guide to QTL Mapping with R/qtl , 2009 .

[35]  Claudia Czado,et al.  Locating Multiple Interacting Quantitative Trait Loci with the Zero-Inflated Generalized Poisson Regression , 2010, Statistical applications in genetics and molecular biology.

[36]  Hao Wu,et al.  R/qtlbim: QTL with Bayesian Interval Mapping in experimental crosses , 2007, Bioinform..

[37]  R. Jansen,et al.  Interval mapping of multiple quantitative trait loci. , 1993, Genetics.

[38]  Ina Hoeschele,et al.  Nonparametric Bayesian Variable Selection With Applications to Multiple Quantitative Trait Loci Mapping With Epistasis and Gene–Environment Interaction , 2010, Genetics.

[39]  B. Kinghorn,et al.  The use of a genetic algorithm for simultaneous mapping of multiple interacting quantitative trait loci. , 2000, Genetics.

[40]  E. Lander,et al.  Complete multipoint sib-pair analysis of qualitative and quantitative traits. , 1995, American journal of human genetics.

[41]  Z B Zeng,et al.  Genetic architecture of a morphological shape difference between two Drosophila species. , 2000, Genetics.