Molecular dating of phylogenies by likelihood methods: a comparison of models and a new information criterion.

Dating the divergence in a phylogenetic tree is a fundamental step in evolutionary analysis. Some extensions and improvements of the penalised likelihood method originally presented by Sanderson are introduced. The improvements are the introduction of alternative models, including one with non-correlated rates of molecular substitution ("relaxed" model), a completely reworked fitting algorithm that considers the high-dimensionality of the optimisation problem, and the development of a new information criterion for model selection in the presence of a penalised term. It is also shown that the strict clock model is a special case of the present approach. An extensive simulation study was conducted to assess the statistical performance of these improvements. Overall, the different estimators studied here appeared as unbiased though their variance varied depending on the fitted and the simulated models and on the number of calibration points. The strict clock model gave good estimates of branch lengths even in the presence of heterogeneous substitution rates. The correlated model gave the best estimates of substitution rates whatever the model used to simulate the data. These results, which are certainly the first from an extensive simulation study of a molecular dating method, call for more comparison with alternative methods, as well as further work on the developments introduced here.

[1]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[2]  Robert Lanfear,et al.  Improved characterisation of among-lineage rate variation in cetacean mitogenomes using codon-partitioned relaxed clocks , 2010, Mitochondrial DNA.

[3]  S. Magallón Using fossils to break long branches in molecular dating: a comparison of relaxed clocks applied to the origin of angiosperms. , 2010, Systematic biology.

[4]  S. Tavaré,et al.  Dating primate divergences through an integrated analysis of palaeontological and molecular data. , 2011, Systematic biology.

[5]  Ziheng Yang,et al.  Approximate likelihood calculation on a phylogeny for Bayesian estimation of divergence times. , 2011, Molecular biology and evolution.

[6]  D. Bryant,et al.  A general comparison of relaxed molecular clock models. , 2007, Molecular biology and evolution.

[7]  S. Ho,et al.  Relaxed Phylogenetics and Dating with Confidence , 2006, PLoS biology.

[8]  P H Harvey,et al.  Tempo and mode of evolution revealed from molecular phylogenies. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Rosemary M. McCloskey,et al.  Analyses of Phylogenetics and Evolution , 2015 .

[10]  Ziheng Yang,et al.  A heuristic rate smoothing procedure for maximum likelihood estimation of species divergence times , 2004 .

[11]  T. Garland Rate Tests for Phenotypic Evolution Using Phylogenetically Independent Contrasts , 1992, The American Naturalist.

[12]  A. Rambaut,et al.  BEAST: Bayesian evolutionary analysis by sampling trees , 2007, BMC Evolutionary Biology.

[13]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[14]  A. Baker,et al.  A mitogenomic timescale for birds detects variable phylogenetic rates of molecular evolution and refutes the standard molecular clock. , 2006, Molecular biology and evolution.

[15]  Ziheng Yang Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: Approximate methods , 1994, Journal of Molecular Evolution.

[16]  Barbara R. Holland,et al.  Analysis of Phylogenetics and Evolution with R , 2007 .

[17]  M. Sanderson Estimating absolute rates of molecular evolution and divergence times: a penalized likelihood approach. , 2002, Molecular biology and evolution.

[18]  M. Sanderson,et al.  Relaxed molecular clocks, the bias-variance trade-off, and the quality of phylogenetic inference. , 2010, Systematic biology.

[19]  G. Kitagawa,et al.  Information Criteria and Statistical Modeling , 2007 .

[20]  T. Britton,et al.  Phylogenetic dating with confidence intervals using mean path lengths. , 2002, Molecular phylogenetics and evolution.

[21]  Itay Mayrose,et al.  A likelihood method for detecting trait-dependent shifts in the rate of molecular evolution. , 2011, Molecular biology and evolution.

[22]  Nicolas Lartillot,et al.  PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating , 2009, Bioinform..

[23]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[24]  Michael J Sanderson,et al.  Penalized likelihood phylogenetic inference: bridging the parsimony-likelihood gap. , 2008, Systematic biology.

[25]  S. Guindon,et al.  Bayesian estimation of divergence times from large sequence alignments. , 2010, Molecular biology and evolution.

[26]  Xuhua Xia,et al.  A distance-based least-square method for dating speciation events. , 2011, Molecular phylogenetics and evolution.

[27]  C. Thébaud,et al.  Molecular phylogeography reveals island colonization history and diversification of western Indian Ocean sunbirds (Nectarinia: Nectariniidae). , 2003, Molecular phylogenetics and evolution.

[28]  Michael J. Sanderson,et al.  A Nonparametric Approach to Estimating Divergence Times in the Absence of Rate Constancy , 1997 .

[29]  Effrey,et al.  Divergence Time and Evolutionary Rate Estimation with Multilocus Data , 2002 .

[30]  H. Kishino,et al.  Estimating the rate of evolution of the rate of molecular evolution. , 1998, Molecular biology and evolution.

[31]  N. Lartillot,et al.  A phylogenetic model for investigating correlated evolution of substitution rates and continuous phenotypic characters. , 2011, Molecular biology and evolution.

[32]  Michael J. Sanderson,et al.  R8s: Inferring Absolute Rates of Molecular Evolution, Divergence times in the Absence of a Molecular Clock , 2003, Bioinform..

[33]  Ziheng Yang,et al.  Rate variation and estimation of divergence times using strict and relaxed clocks , 2011, BMC Evolutionary Biology.

[34]  Ziheng Yang,et al.  Bayesian estimation of species divergence times under a molecular clock using multiple fossil calibrations with soft bounds. , 2006, Molecular biology and evolution.

[35]  Bobby Schnabel,et al.  A modular system of algorithms for unconstrained minimization , 1985, TOMS.

[36]  Korbinian Strimmer,et al.  APE: Analyses of Phylogenetics and Evolution in R language , 2004, Bioinform..