jModelTest: phylogenetic model averaging.

jModelTest is a new program for the statistical selection of models of nucleotide substitution based on "Phyml" (Guindon and Gascuel 2003. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 52:696-704.). It implements 5 different selection strategies, including "hierarchical and dynamical likelihood ratio tests," the "Akaike information criterion," the "Bayesian information criterion," and a "decision-theoretic performance-based" approach. This program also calculates the relative importance and model-averaged estimates of substitution parameters, including a model-averaged estimate of the phylogeny. jModelTest is written in Java and runs under Mac OSX, Windows, and Unix systems with a Java Runtime Environment installed. The program, including documentation, can be freely downloaded from the software section at http://darwin.uvigo.es.

[1]  H. Munro,et al.  Mammalian protein metabolism , 1964 .

[2]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[3]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[4]  N. Sugiura Further analysts of the data by akaike' s information criterion and the finite corrections , 1978 .

[5]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[6]  S. Jeffery Evolution of Protein Molecules , 1979 .

[7]  M. Kimura Estimation of evolutionary distances between homologous nucleotide sequences. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[8]  S. Tavaré Some probabilistic and statistical problems in the analysis of DNA sequences , 1986 .

[9]  Robert M. Miura,et al.  Some mathematical questions in biology : DNA sequence analysis , 1986 .

[10]  Clifford M. Hurvich,et al.  Regression and time series model selection in small samples , 1989 .

[11]  M. Nei,et al.  Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. , 1993, Molecular biology and evolution.

[12]  D. Swofford,et al.  Evolution of the Mitochondrial Cytochrome Oxidase II Gene in Collembola , 1997, Journal of Molecular Evolution.

[13]  O Gascuel,et al.  BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data. , 1997, Molecular biology and evolution.

[14]  C. W. Kilpatrick,et al.  Phylogeography and molecular systematics of the Peromyscus aztecus species group (Rodentia: Muridae) inferred using parsimony and likelihood. , 1997, Systematic biology.

[15]  David Posada,et al.  MODELTEST: testing the model of DNA substitution , 1998, Bioinform..

[16]  David R. Anderson,et al.  Model Selection and Multimodel Inference , 2003 .

[17]  David Posada,et al.  The Effect of Branch Length Variation on the Selection of Models of Molecular Evolution , 2001, Journal of Molecular Evolution.

[18]  K. Crandall,et al.  Selecting the best-fit model of nucleotide substitution. , 2001, Systematic biology.

[19]  C. Cunningham,et al.  The effects of nucleotide substitution model assumptions on estimates of nonparametric bootstrap support. , 2002, Molecular biology and evolution.

[20]  T. Buckley,et al.  Model misspecification and probabilistic tests of topology: evidence from empirical data sets. , 2002, Systematic biology.

[21]  W. Pearson,et al.  Current Protocols in Bioinformatics , 2002 .

[22]  D. Posada Using MODELTEST and PAUP* to Select a Model of Nucleotide Substitution , 2003, Current protocols in bioinformatics.

[23]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[24]  Zaid Abdo,et al.  Performance-based selection of likelihood models for phylogeny estimation. , 2003, Systematic biology.

[25]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[26]  Ramakant Sharma,et al.  Phylogeny Estimation and Hypothesis Testing using Maximum Likelihood , 2003 .

[27]  Emily C. Moriarty,et al.  The importance of proper model assumption in bayesian phylogenetics. , 2004, Systematic biology.

[28]  M. P. Cummings PHYLIP (Phylogeny Inference Package) , 2004 .

[29]  D. Pol Empirical problems of the hierarchical likelihood ratio test for model selection. , 2004, Systematic biology.

[30]  Jerald B. Johnson,et al.  Model selection in ecology and evolution. , 2004, Trends in ecology & evolution.

[31]  A. Zharkikh Estimation of evolutionary distances between nucleotide sequences , 1994, Journal of Molecular Evolution.

[32]  D. Posada,et al.  Model selection and model averaging in phylogenetics: advantages of akaike information criterion and bayesian approaches over likelihood ratio tests. , 2004, Systematic biology.

[33]  H. Kishino,et al.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA , 2005, Journal of Molecular Evolution.

[34]  James E. Byers,et al.  MODEL SELECTION IN PHYLOGENETICS , 2005 .

[35]  Thomas J Naughton,et al.  Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified , 2006, BMC Evolutionary Biology.

[36]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.

[37]  Zaid Abdo,et al.  Accounting for uncertainty in the tree topology has little effect on the decision-theoretic approach to model selection in phylogeny estimation. , 2005, Molecular biology and evolution.

[38]  M. Kimura A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences , 1980, Journal of Molecular Evolution.

[39]  Michael E Alfaro,et al.  Comparative performance of Bayesian and AIC-based measures of phylogenetic model uncertainty. , 2006, Systematic biology.

[40]  Michael A. Thomas,et al.  Model use in phylogenetics: nine key questions. , 2007, Trends in ecology & evolution.