RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies

Motivation: Phylogenies are increasingly used in all fields of medical and biological research. Moreover, because of the next-generation sequencing revolution, datasets used for conducting phylogenetic analyses grow at an unprecedented pace. RAxML (Randomized Axelerated Maximum Likelihood) is a popular program for phylogenetic analyses of large datasets under maximum likelihood. Since the last RAxML paper in 2006, it has been continuously maintained and extended to accommodate the increasingly growing input datasets and to serve the needs of the user community. Results: I present some of the most notable new features and extensions of RAxML, such as a substantial extension of substitution models and supported data types, the introduction of SSE3, AVX and AVX2 vector intrinsics, techniques for reducing the memory requirements of the code and a plethora of operations for conducting post-analyses on sets of trees. In addition, an up-to-date 50-page user manual covering all new RAxML options is available. Availability and implementation: The code is available under GNU GPL at https://github.com/stamatak/standard-RAxML. Contact: alexandros.stamatakis@h-its.org Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Alexandros Stamatakis,et al.  Novel Parallelization Schemes for Large-Scale Likelihood-based Phylogenetic Inference , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.

[2]  Alexandros Stamatakis,et al.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models , 2006, Bioinform..

[3]  Alexandros Stamatakis,et al.  Accuracy of morphology-based phylogenetic fossil placement under Maximum Likelihood , 2010, ACS/IEEE International Conference on Computer Systems and Applications - AICCSA 2010.

[4]  Minh Anh Nguyen,et al.  Ultrafast Approximation for Phylogenetic Bootstrap , 2013, Molecular biology and evolution.

[5]  A. Stamatakis,et al.  Automated Plausibility Analysis of Large Phylogenies , 2015 .

[6]  J. Rougemont,et al.  A rapid bootstrap algorithm for the RAxML Web servers. , 2008, Systematic biology.

[7]  P. Lewis A likelihood approach to estimating phylogeny from discrete morphological character data. , 2001, Systematic biology.

[8]  Alexandros Stamatakis,et al.  Uncovering Hidden Phylogenetic Consensus in Large Data Sets , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[9]  Antonis Rokas,et al.  Inferring ancient divergences requires genes with strong phylogenetic signals , 2013, Nature.

[10]  Denis Krompass,et al.  Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads under Maximum Likelihood , 2011, Systematic biology.

[11]  Alexandros Stamatakis,et al.  Hybrid MPI/Pthreads parallelization of the RAxML phylogenetics code , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW).

[12]  Alexandros Stamatakis,et al.  Parallelized phylogenetic post-analysis on multi-core architectures , 2010, J. Comput. Sci..

[13]  Alexandros Stamatakis,et al.  How Many Bootstrap Replicates Are Necessary? , 2009, RECOMB.

[14]  Olivier Gascuel,et al.  Modeling protein evolution with several amino acid replacement matrices depending on site rates. , 2012, Molecular biology and evolution.

[15]  Alexandros Stamatakis,et al.  Algorithms, data structures, and numerics for likelihood-based phylogenetic inference of huge trees , 2011, BMC Bioinformatics.

[16]  O. Gascuel,et al.  New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. , 2010, Systematic biology.