Computational Phylogenetics: An Introduction to Designing Methods for Phylogeny Estimation

A comprehensive account of both basic and advanced material in phylogeny estimation, focusing on computational and statistical issues. No background in biology or computer science is assumed, and there is minimal use of mathematical formulas, meaning that students from many disciplines, including biology, computer science, statistics, and applied mathematics, will find the text accessible. The mathematical and statistical foundations of phylogeny estimation are presented rigorously, following which more advanced material is covered. This includes substantial chapters on multi-locus phylogeny estimation, supertree methods, Markov models of sequence evolution, multiple sequence alignment techniques, and designing methods for large-scale phylogeny estimation. The author provides key analytical techniques to prove theoretical properties about methods, as well as addressing performance in practice for methods for estimating trees. Research problems requiring novel computational methods are also presented, so that graduate students and researchers from varying disciplines will be able to enter the broad and exciting field of computational phylogenetics.

[1]  David Fernández-Baca,et al.  Robinson-Foulds Supertrees , 2010, Algorithms for Molecular Biology.

[2]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[3]  Elena Rivas,et al.  Parameterizing sequence alignment with an explicit evolutionary model , 2015, BMC Bioinformatics.

[4]  Luay Nakhleh,et al.  Phylogenetic networks , 2004 .

[5]  Elchanan Mossel,et al.  Optimal phylogenetic reconstruction , 2005, STOC '06.

[6]  P. Erdös,et al.  Local Quartet Splits of a Binary Tree Infer All Quartet Splits Via One Dyadic Inference Rule , 1996, Comput. Artif. Intell..

[7]  Wen-Hsiung Li,et al.  Fundamentals of molecular evolution , 1990 .

[8]  J. Neyman MOLECULAR STUDIES OF EVOLUTION: A SOURCE OF NOVEL STATISTICAL PROBLEMS* , 1971 .

[9]  Tandy J. Warnow,et al.  Sequence-Length Requirements for Phylogenetic Methods , 2002, WABI.

[10]  David Fernández-Baca,et al.  iGTP: A software package for large-scale gene tree parsimony analysis , 2010, BMC Bioinformatics.

[11]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[12]  Cynthia A. Phillips,et al.  The Asymmetric Median Tree - A New Model for Building Consensus Trees , 1996, Discret. Appl. Math..

[13]  Constantinos Daskalakis,et al.  Alignment-Free Phylogenetic Reconstruction: Sample Complexity via a Branching Process Analysis , 2011, ArXiv.

[14]  Lior Pachter,et al.  Combinatorics of least-squares trees , 2008, Proceedings of the National Academy of Sciences.

[15]  Tao Jiang,et al.  MSOAR: A High-Throughput Ortholog Assignment System Based on Genome Rearrangement , 2007, J. Comput. Biol..

[16]  Sheng Guo,et al.  Large-scale simulation of RNA macroevolution by an energy-dependent fitness model , 2009, 0912.2326.

[17]  Sagi Snir,et al.  Triplet MaxCut: a new toolkit for rooted supertree , 2016 .

[18]  Magnus Bordewich,et al.  An algorithm for reconstructing ultrametric tree-child networks from inter-taxa distances , 2016, Discret. Appl. Math..

[19]  David Haussler,et al.  Cactus: Algorithms for genome multiple sequence alignment. , 2011, Genome research.

[20]  Stephen J. Willson,et al.  Reconstruction of Certain Phylogenetic Networks from Their Tree-Average Distances , 2013, Bulletin of Mathematical Biology.

[21]  Charles Semple,et al.  A supertree method for rooted trees , 2000, Discret. Appl. Math..

[22]  B. Efron,et al.  Bootstrap confidence levels for phylogenetic trees. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[23]  N. Grishin,et al.  MUMMALS: multiple sequence alignment improved by using hidden Markov models with local structural information , 2006, Nucleic acids research.

[24]  David Fernández-Baca,et al.  Mrf Supertrees , 2004 .

[25]  D. Sankoff,et al.  Gene Order Breakpoint Evidence in Animal Mitochondrial Phylogeny , 1999, Journal of Molecular Evolution.

[26]  Daniel Doerr,et al.  Orthology Detection Combining Clustering and Synteny for Very Large Datasets , 2014, PloS one.

[27]  O. Gascuel,et al.  Une approche phylo-HMM pour la recherche de s ´ equences , 2008 .

[28]  Guilherme Oliveira,et al.  Assessing the efficiency of multiple sequence alignment programs , 2014, Algorithms for Molecular Biology.

[29]  Mihai Pop,et al.  TIPP: taxonomic identification and phylogenetic profiling , 2014, Bioinform..

[30]  W. Fitch Toward Defining the Course of Evolution: Minimum Change for a Specific Tree Topology , 1971 .

[31]  David Sankoff,et al.  Multiple Genome Rearrangement and Breakpoint Phylogeny , 1998, J. Comput. Biol..

[32]  S. Holmes,et al.  Bootstrapping Phylogenetic Trees: Theory and Methods , 2003 .

[33]  Xin Deng,et al.  MSACompro: protein multiple sequence alignment using predicted secondary structure, solvent accessibility, and residue-residue contacts , 2011, BMC Bioinformatics.

[34]  Daniel H. Huson,et al.  Solving Large Scale Phylogenetic Problems using DCM2 , 1999, ISMB.

[35]  D. Higgins,et al.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.

[36]  N. Sugiura Further analysts of the data by akaike' s information criterion and the finite corrections , 1978 .

[37]  A. Löytynoja,et al.  Co-estimation of Phylogeny-aware Alignment and Phylogenetic Tree , 2016, bioRxiv.

[38]  Sébastien Roch,et al.  A short proof that phylogenetic tree reconstruction by maximum likelihood is hard , 2005, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[39]  S. Karlin,et al.  Evolutionary Processes and Theory , 1986 .

[40]  N. Rosenberg,et al.  Discordance of Species Trees with Their Most Likely Gene Trees , 2006, PLoS genetics.

[41]  David James Russell,et al.  Multiple sequence alignment methods , 2006 .

[42]  Tandy J. Warnow,et al.  A few logs suffice to build (almost) all trees (I) , 1999, Random Struct. Algorithms.

[43]  M. Fujita,et al.  Introgression and phenotypic assimilation in Zimmerius flycatchers (Tyrannidae): population genetic and phylogenetic inferences from genome-wide SNPs. , 2014, Systematic biology.

[44]  Md. Shamsuzzoha Bayzid,et al.  Whole-genome analyses resolve early branches in the tree of life of modern birds , 2014, Science.

[45]  Mark T. Holder,et al.  The Posterior and the Prior in Bayesian Phylogenetics , 2006 .

[46]  Benedict Paten,et al.  Sequence progressive alignment, a framework for practical large-scale probabilistic consistency alignment , 2009, Bioinform..

[47]  Katherine St. John,et al.  Hamiltonian Walks of Phylogenetic Treespaces , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[48]  Vincent Moulton,et al.  Consensus Networks: A Method for Visualising Incompatibilities in Collections of Trees , 2003, WABI.

[49]  John M. Mellor-Crummey,et al.  PRec-I-DCM3: A Parallel Framework for Fast and Accurate Large Scale Phylogeny Reconstruction , 2005, 11th International Conference on Parallel and Distributed Systems (ICPADS'05).

[50]  Gonzalo Giribet,et al.  Dynamic homology and phylogenetic systematics: a unified approach using POY , 2006 .

[51]  Luay Nakhleh,et al.  Species Tree Inference by Minimizing Deep Coalescences , 2009, PLoS Comput. Biol..

[52]  Ziheng Yang,et al.  INDELible: A Flexible Simulator of Biological Sequence Evolution , 2009, Molecular biology and evolution.

[53]  A. Graybeal,et al.  Is it better to add taxa or characters to a difficult phylogenetic problem? , 1998, Systematic biology.

[54]  I. Holmes,et al.  A "Long Indel" model for evolutionary sequence alignment. , 2003, Molecular biology and evolution.

[55]  John D. Kececioglu,et al.  Aligning alignments exactly , 2004, RECOMB.

[56]  Y. Pauplin Direct Calculation of a Tree Length Using a Distance Matrix , 2000, Journal of Molecular Evolution.

[57]  O. Gascuel,et al.  Quartet-based phylogenetic inference: improvements and limits. , 2001, Molecular biology and evolution.

[58]  D. Haussler,et al.  Protein modeling using hidden Markov models: analysis of globins , 1993, [1993] Proceedings of the Twenty-sixth Hawaii International Conference on System Sciences.

[59]  Paramvir S. Dehal,et al.  FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments , 2010, PloS one.

[60]  J. Hein,et al.  Statistical alignment: computational properties, homology testing and goodness-of-fit. , 2000, Journal of molecular biology.

[61]  Olivier Poch,et al.  BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs , 1999, Bioinform..

[62]  Tandy J. Warnow,et al.  Designing fast converging phylogenetic methods , 2001, ISMB.

[63]  Yu Lin,et al.  A Metric for Phylogenetic Trees Based on Matching , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[64]  Bin Qian,et al.  Detecting distant homologs using phylogenetic tree‐based HMMs , 2003, Proteins.

[65]  David Fernández-Baca,et al.  A Polynomial-Time Algorithm for the Perfect Phylogeny Problem when the Number of Character States is Fixed , 1993, FOCS.

[66]  Cédric Notredame,et al.  Upcoming challenges for multiple sequence alignment methods in the high-throughput era , 2009, Bioinform..

[67]  Travis C. Glenn,et al.  A Phylogeny of Birds Based on Over 1,500 Loci Collected by Target Enrichment and High-Throughput Sequencing , 2012, PloS one.

[68]  István Miklós Algorithm for statistical alignment of two sequences derived from a Poisson sequence length distribution , 2003, Discret. Appl. Math..

[69]  Bryan Kolaczkowski,et al.  Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous , 2004, Nature.

[70]  Samuel Fiorini,et al.  Approximating the balanced minimum evolution problem , 2012, Oper. Res. Lett..

[71]  Oliver Eulenstein,et al.  DupTree: a program for large-scale phylogenetic analyses using gene tree parsimony , 2008, Bioinform..

[72]  M. Nei,et al.  Theoretical foundation of the minimum-evolution method of phylogenetic inference. , 1993, Molecular biology and evolution.

[73]  E. Braun,et al.  Ancestral range reconstruction of Galliformes: the effects of topology and taxon sampling , 2017 .

[74]  O Gascuel,et al.  Strengths and limitations of the minimum evolution principle. , 2001, Systematic biology.

[75]  D. Haussler,et al.  Human-mouse alignments with BLASTZ. , 2003, Genome research.

[76]  W. Doolittle,et al.  Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability. , 2003, Molecular biology and evolution.

[77]  John D. Kececioglu,et al.  Aligning Protein Sequences with Predicted Secondary Structure , 2010, J. Comput. Biol..

[78]  Michael D. Hendy,et al.  A Framework for the Quantitative Study of Evolutionary Trees , 1989 .

[79]  Matthieu Muffato,et al.  Current Methods for Automated Filtering of Multiple Sequence Alignments Frequently Worsen Single-Gene Phylogenetic Inference , 2015, Systematic biology.

[80]  M. Steel The complexity of reconstructing trees from qualitative characters and subtrees , 1992 .

[81]  Colin N. Dewey,et al.  BUCKy: Gene tree/species tree reconciliation with Bayesian concordance analysis , 2010, Bioinform..

[82]  Claudia R. Solís-Lemus,et al.  Inferring Phylogenetic Networks with Maximum Pseudolikelihood under Incomplete Lineage Sorting , 2015, PLoS genetics.

[83]  S. Tavaré Some probabilistic and statistical problems in the analysis of DNA sequences , 1986 .

[84]  Denis Krompass,et al.  Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads under Maximum Likelihood , 2011, Systematic biology.

[85]  David Posada,et al.  Phylogenomics for Systematic Biology. , 2016, Systematic biology.

[86]  Jian Ma,et al.  PSAR-Align: improving multiple sequence alignment using probabilistic sampling , 2014, Bioinform..

[87]  M. Donoghue,et al.  Mega-phylogeny approach for comparative biology: an alternative to supertree and supermatrix approaches , 2009, BMC Evolutionary Biology.

[88]  Franziska Wulf,et al.  Mathematical Population Genetics , 2016 .

[89]  William R. Taylor,et al.  The rapid generation of mutation data matrices from protein sequences , 1992, Comput. Appl. Biosci..

[90]  Otto Optiz,et al.  Conceptual and Numerical Analysis of Data , 1989 .

[91]  M. Steel,et al.  Computing the Distribution of a Tree Metric , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[92]  Daniel H. Huson,et al.  Phylogenetic Networks - Concepts, Algorithms and Applications , 2011 .

[93]  J. Felsenstein,et al.  An evolutionary model for maximum likelihood alignment of DNA sequences , 1991, Journal of Molecular Evolution.

[94]  Bin Ma,et al.  Alignment between Two Multiple Alignments , 2003, CPM.

[95]  A. Tversky,et al.  Additive similarity trees , 1977 .

[96]  Martin Vingron,et al.  Weighted sequence graphs: boosting iterated dynamic programming using locally suboptimal solutions , 2003, Discret. Appl. Math..

[97]  V Moulton,et al.  Likelihood analysis of phylogenetic networks using directed graphical models. , 2000, Molecular biology and evolution.

[98]  T. Tuller,et al.  Inferring phylogenetic networks by the maximum parsimony criterion: a case study. , 2006, Molecular biology and evolution.

[99]  Golan Yona,et al.  Within the twilight zone: a sensitive profile-profile comparison tool based on information theory. , 2002, Journal of molecular biology.

[100]  Tandy J. Warnow,et al.  Estimating true evolutionary distances between genomes , 2001, STOC '01.

[101]  David Bryant,et al.  Inferring species trees directly from biallelic genetic markers: bypassing gene trees in a full coalescent analysis. , 2009, Molecular biology and evolution.

[102]  David Fernández-Baca,et al.  Fast Local Search for Unrooted Robinson-Foulds Supertrees , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[103]  A. Drummond,et al.  Bayesian Inference of Species Trees from Multilocus Data , 2009, Molecular biology and evolution.

[104]  M. Wilkinson Common Cladistic Information and its Consensus Representation: Reduced Adams and Reduced Cladistic Consensus Trees and Profiles , 1994 .

[105]  Jimin Pei,et al.  PCMA: fast and accurate multiple sequence alignment based on profile consistency , 2003, Bioinform..

[106]  Laura Kubatko,et al.  Identifiability of the unrooted species tree topology under the coalescent model with time-reversible substitution processes, site-specific rate variation, and invariable sites. , 2014, Journal of theoretical biology.

[107]  D. Morrison,et al.  Using data-display networks for exploratory data analysis in phylogenetic studies. , 2010, Molecular biology and evolution.

[108]  Louis J. Billera,et al.  Geometry of the Space of Phylogenetic Trees , 2001, Adv. Appl. Math..

[109]  Gabriel Cardona,et al.  On Nakhleh's Metric for Reduced Phylogenetic Networks , 2009, TCBB.

[110]  S. Poe Evaluation of the strategy of long-branch subdivision to improve the accuracy of phylogenetic methods. , 2003, Systematic biology.

[111]  Jijun Tang,et al.  Reconstructing phylogenies from gene-content and gene-order data , 2007, Mathematics of Evolution and Phylogeny.

[112]  K. Kidd,et al.  Phylogenetic analysis: concepts and methods. , 1971, American journal of human genetics.

[113]  Iain M. Wallace,et al.  M-Coffee: combining multiple sequence alignment methods with T-Coffee , 2006, Nucleic acids research.

[114]  Jonathan P. Bollback,et al.  Bayesian Inference of Phylogeny and Its Impact on Evolutionary Biology , 2001, Science.

[115]  Krister M. Swenson,et al.  Gene tree correction for reconciliation and species tree inference , 2012, Algorithms for Molecular Biology.

[116]  Inna Dubchak,et al.  Multiple whole-genome alignments without a reference organism. , 2009, Genome research.

[117]  Stephen Jay Gould,et al.  The shape of evolution: a comparison of real and random clades , 1977, Paleobiology.

[118]  Ian Holmes,et al.  Evolutionary HMMs: a Bayesian approach to multiple alignment , 2001, Bioinform..

[119]  A. von Haeseler,et al.  IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies , 2014, Molecular biology and evolution.

[120]  Ziheng Yang,et al.  Molecular Evolution: A Statistical Approach , 2014 .

[121]  Olivier Gascuel,et al.  Mathematics of Evolution and Phylogeny , 2005 .

[122]  Elena Rivas,et al.  Probabilistic Phylogenetic Inference with Insertions and Deletions , 2008, PLoS Comput. Biol..

[123]  Bengt Oxelman,et al.  Statistical inference of allopolyploid species networks in the presence of incomplete lineage sorting. , 2012, Systematic biology.

[124]  N. Perna,et al.  progressiveMauve: Multiple Genome Alignment with Gene Gain, Loss and Rearrangement , 2010, PloS one.

[125]  Sophie S Abby,et al.  Phylogenetic modeling of lateral gene transfer reconstructs the pattern and relative timing of speciations , 2012, Proceedings of the National Academy of Sciences.

[126]  Rezwana Reaz,et al.  Accurate Phylogenetic Tree Reconstruction from Quartets: A Heuristic Approach , 2014, PloS one.

[127]  Tandy J. Warnow,et al.  The Performance of Phylogenetic Methods on Trees of Bounded Diameter , 2001, WABI.

[128]  Alexandros Stamatakis,et al.  Does the choice of nucleotide substitution models matter topologically? , 2016, BMC Bioinformatics.

[129]  D. Morrison Next generation sequencing and phylogenetic networks , 2014 .

[130]  Desmond G. Higgins,et al.  Making automated multiple alignments of very large numbers of protein sequences , 2013, Bioinform..

[131]  A. Rokas,et al.  Evolutionary Genomics , 2012, Methods in Molecular Biology.

[132]  Risa Kawaguchi,et al.  Split-alignment of genomes finds orthologies more accurately , 2015, Genome Biology.

[133]  N. Goldman,et al.  A codon-based model of nucleotide substitution for protein-coding DNA sequences. , 1994, Molecular biology and evolution.

[134]  Irmgard Krisai-Greilhuber,et al.  Iteratively Refined Guide Trees Help Improving Alignment and Phylogenetic Inference in the Mushroom Family Bolbitiaceae , 2013, PloS one.

[135]  Oliver Eulenstein,et al.  The shape of supertrees to come: tree shape related properties of fourteen supertree methods. , 2005, Systematic biology.

[136]  F. Blattner,et al.  Mauve: multiple alignment of conserved genomic sequence with rearrangements. , 2004, Genome research.

[137]  David Haussler,et al.  Using Dirichlet Mixture Priors to Derive Hidden Markov Models for Protein Families , 1993, ISMB.

[138]  K. Crandall,et al.  Selecting the best-fit model of nucleotide substitution. , 2001, Systematic biology.

[139]  Andrew R Francis,et al.  Tree-like reticulation networks--when do tree-like distances also support reticulate evolution? , 2015, Mathematical biosciences.

[140]  Kimmen Sjölander,et al.  COACH : profile-profile alignment of protein families using hidden Markov models , 2003 .

[141]  S F Altschul,et al.  Generalized affine gap costs for protein sequence alignment , 1998, Proteins.

[142]  Daniel P. Miranker,et al.  An accurate scalable template-based alignment algorithm , 2012, 2012 IEEE International Conference on Bioinformatics and Biomedicine.

[143]  D. Sankoff Minimal Mutation Trees of Sequences , 1975 .

[144]  István Miklós,et al.  StatAlign: an extendable software package for joint Bayesian estimation of alignments and evolutionary trees , 2008, Bioinform..

[145]  A. Mooers,et al.  Signatures of random and selective mass extinctions in phylogenetic tree balance. , 2002, Systematic biology.

[146]  Jerrold I. Davis,et al.  Phylogeny and subfamilial classification of the grasses (Poaceae) , 2001 .

[147]  Kimmen Sjölander,et al.  The PhyloFacts FAT-CAT web server: ortholog identification and function prediction using fast approximate tree classification , 2013, Nucleic Acids Res..

[148]  Toni Gabaldón,et al.  Measuring guide-tree dependency of inferred gaps in progressive aligners , 2013, Bioinform..

[149]  Sarah C. Ayling,et al.  Novel methodology for construction and pruning of quasi-median networks , 2008, BMC Bioinformatics.

[150]  M. Gouy,et al.  Genome-scale coestimation of species and gene trees , 2013, Genome research.

[151]  Vineet Bafna,et al.  Genome Rearrangements and Sorting by Reversals , 1996, SIAM J. Comput..

[152]  Dennis R. Livesay,et al.  Probalign: multiple sequence alignment using partition function posterior probabilities , 2006, Bioinform..

[153]  Richard R. Hudson,et al.  TESTING THE CONSTANT‐RATE NEUTRAL ALLELE MODEL WITH PROTEIN SEQUENCE DATA , 1983, Evolution; international journal of organic evolution.

[154]  Lloyd Allison,et al.  Minimum message length encoding, evolutionary trees and multiple-alignment , 1992, Proceedings of the Twenty-Fifth Hawaii International Conference on System Sciences.

[155]  Olivier Gascuel,et al.  A ‘Stochastic Safety Radius’ for Distance-Based Tree Reconstruction , 2014, Algorithmica.

[156]  Enno Ohlebusch,et al.  Efficient multiple genome alignment , 2002, ISMB.

[157]  David Bryant,et al.  A classification of consensus methods for phylogenetics , 2001, Bioconsensus.

[158]  David Haussler,et al.  Combining Phylogenetic and Hidden Markov Models in Biosequence Analysis , 2004, J. Comput. Biol..

[159]  M. Kimura A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences , 1980, Journal of Molecular Evolution.

[160]  Tandy J. Warnow,et al.  A simulation study comparing supertree and combined analysis methods using SMIDGen , 2009, Algorithms for Molecular Biology.

[161]  Ramón Doallo,et al.  CircadiOmics: integrating circadian genomics, transcriptomics, proteomics and metabolomics , 2012, Nature Methods.

[162]  D. Maddison The rapidly changing landscape of insect phylogenetics. , 2016, Current opinion in insect science.

[163]  Yann Ponty,et al.  ecceTERA: comprehensive gene tree-species tree reconciliation using parsimony , 2016, Bioinform..

[164]  Nina Amenta,et al.  Case study: visualizing sets of evolutionary trees , 2002, IEEE Symposium on Information Visualization, 2002. INFOVIS 2002..

[165]  G. Mitchison A Probabilistic Treatment of Phylogeny and Sequence Alignment , 1999, Journal of Molecular Evolution.

[166]  István Miklós,et al.  Efficient representation of uncertainty in multiple sequence alignments using directed acyclic graphs , 2015, BMC Bioinformatics.

[167]  Sagi Snir,et al.  Maximum likelihood of phylogenetic networks , 2006, Bioinform..

[168]  Michael G. Nute,et al.  Scaling statistical multiple sequence alignment to large datasets , 2016, BMC Genomics.

[169]  David L. Swofford,et al.  Are Guinea Pigs Rodents? The Importance of Adequate Models in Molecular Phylogenetics , 1997, Journal of Mammalian Evolution.

[170]  Mark D. Wilkinson,et al.  A view of supertree methods , 2001, Bioconsensus.

[171]  M. O. Dayhoff,et al.  22 A Model of Evolutionary Change in Proteins , 1978 .

[172]  Jordan V Smith,et al.  Ratite nonmonophyly: independent evidence from 40 novel Loci. , 2013, Systematic biology.

[173]  O. Gascuel On the optimization principle in phylogenetic analysis and the minimum-evolution criterion. , 2000, Molecular biology and evolution.

[174]  Q. Feng,et al.  On better heuristic for Euclidean Steiner minimum trees , 1991, [1991] Proceedings 32nd Annual Symposium of Foundations of Computer Science.

[175]  John Gatesy,et al.  Phylogenetic analysis at deep timescales: unreliable gene trees, bypassed hidden support, and the coalescence/concatalescence conundrum. , 2014, Molecular phylogenetics and evolution.

[176]  M. Syvanen,et al.  Cross-species gene transfer; implications for a new theory of evolution. , 1985, Journal of theoretical biology.

[177]  Kevin J. Liu,et al.  Maximum likelihood inference of reticulate evolutionary histories , 2014, Proceedings of the National Academy of Sciences.

[178]  Tandy J. Warnow,et al.  Towards the Development of Computational Tools for Evaluating Phylogenetic Network Reconstruction Methods , 2002, Pacific Symposium on Biocomputing.

[179]  David Sankoff,et al.  Locating the vertices of a steiner tree in an arbitrary metric space , 1975, Math. Program..

[180]  Michael I. Jordan,et al.  Evolutionary inference via the Poisson Indel Process , 2012, Proceedings of the National Academy of Sciences.

[181]  B. Peyton,et al.  An Introduction to Chordal Graphs and Clique Trees , 1993 .

[182]  R. Durrett Probability Models for DNA Sequence Evolution , 2002 .

[183]  D. Hillis,et al.  Taxon sampling and the accuracy of phylogenetic analyses , 2008 .

[184]  John A Rhodes,et al.  Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent , 2009, Journal of mathematical biology.

[185]  Ziheng Yang,et al.  Bayes estimation of species divergence times and ancestral population sizes using DNA sequences from multiple loci. , 2003, Genetics.

[186]  T. Dobzhansky,et al.  Inversions in the Third Chromosome of Wild Races of Drosophila Pseudoobscura, and Their Use in the Study of the History of the Species. , 1936, Proceedings of the National Academy of Sciences of the United States of America.

[187]  David Haussler,et al.  Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology , 1996, Comput. Appl. Biosci..

[188]  István Miklós,et al.  Statistical Alignment: Recent Progress, New Applications, and Challenges , 2005 .

[189]  John R. Davidson,et al.  SATCHMO-JS: a webserver for simultaneous protein multiple sequence alignment and phylogenetic tree construction , 2010, Nucleic Acids Res..

[190]  Sharon R Grossman,et al.  Detecting natural selection in genomic data. , 2013, Annual review of genetics.

[191]  Alexey M. Kozlov,et al.  ExaML version 3: a tool for phylogenomic analyses on supercomputers , 2015, Bioinform..

[192]  Frédéric Delsuc,et al.  Visualizing conflicting evolutionary hypotheses in large collections of trees: using consensus networks to study the origins of placentals and hexapods. , 2005, Systematic biology.

[193]  D. Kendall,et al.  Mathematics in the Archaeological and Historical Sciences , 1971, The Mathematical Gazette.

[194]  Mikkel Thorup,et al.  On the approximability of numerical taxonomy (fitting distances by tree metrics) , 1996, SODA '96.

[195]  Oliver Eulenstein,et al.  Algorithms for Genome-Scale Phylogenetics Using Gene Tree Parsimony , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[196]  M. Gerstein,et al.  What is a gene, post-ENCODE? History and updated definition. , 2007, Genome research.

[197]  Satish Rao,et al.  Quartets MaxCut: A Divide and Conquer Quartets Algorithm , 2010, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[198]  Susan Holmes,et al.  Phylogenies: An Overview , 1997 .

[199]  Mark A. Ragan,et al.  The MRP Method , 2004 .

[200]  O. Gascuel,et al.  Theoretical foundation of the balanced minimum evolution method of phylogenetic inference and its relationship to weighted least-squares tree fitting. , 2003, Molecular biology and evolution.

[201]  Mark Wilkinson,et al.  Majority-rule supertrees. , 2007, Systematic biology.

[202]  Laura Kubatko,et al.  Estimating species trees : practical and theoretical aspects , 2010 .

[203]  J. Felsenstein,et al.  Inching toward reality: An improved likelihood model of sequence evolution , 2004, Journal of Molecular Evolution.

[204]  N. Grishin,et al.  COMPASS: a tool for comparison of multiple protein alignments with assessment of statistical significance. , 2003, Journal of molecular biology.

[205]  R. Gray,et al.  Language-tree divergence times support the Anatolian theory of Indo-European origin , 2003, Nature.

[206]  K. Sjölander,et al.  FlowerPower: clustering proteins into domain architecture classes for phylogenomic inference of protein function , 2007, BMC evolutionary biology.

[207]  Maria Anisimova,et al.  Investigating protein-coding sequence evolution with probabilistic codon substitution models. , 2009, Molecular biology and evolution.

[208]  Kiyoshi Ezawa General continuous-time Markov model of sequence evolution via insertions/deletions: are alignment probabilities factorable? , 2016, BMC Bioinformatics.

[209]  L. Nakhleh,et al.  A Metric on the Space of Reduced Phylogenetic Networks , 2010, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[210]  W. Fitch,et al.  Construction of phylogenetic trees. , 1967, Science.

[211]  Tu Minh Phuong,et al.  Multiple alignment of protein sequences with repeats and rearrangements , 2006, Nucleic acids research.

[212]  Blanchette,et al.  Breakpoint Phylogenies. , 1997, Genome informatics. Workshop on Genome Informatics.

[213]  S. Roch Toward Extracting All Phylogenetic Information from Matrices of Evolutionary Distances , 2010, Science.

[214]  Ian Holmes,et al.  Dynamic programming alignment accuracy , 1998, RECOMB '98.

[215]  Bernard M. E. Moret,et al.  Phylogenetic Inference , 2011, Encyclopedia of Parallel Computing.

[216]  Peter Buneman,et al.  A characterisation of rigid circuit graphs , 1974, Discret. Math..

[217]  R. A. Fisher,et al.  On the dominance ratio , 1990 .

[218]  B. Faircloth,et al.  The evolution of peafowl and other taxa with ocelli (eyespots): a phylogenomic approach , 2014, Proceedings of the Royal Society B: Biological Sciences.

[219]  Hans-Jürgen Bandelt,et al.  A Relational Approach to Split Decomposition , 1993 .

[220]  Luay Nakhleh,et al.  Coalescent histories on phylogenetic networks and detection of hybridization despite incomplete lineage sorting. , 2011, Systematic biology.

[221]  W. Ewens,et al.  The chromosome inversion problem , 1982 .

[222]  D. Haussler,et al.  Hidden Markov models in computational biology. Applications to protein modeling. , 1993, Journal of molecular biology.

[223]  M. Tixier-Boichard,et al.  Genetic analysis of local Vietnamese chickens provides evidence of gene flow from wild to domestic populations , 2009, BMC Genetics.

[224]  Otto Opitz,et al.  Information and Classification , 1993 .

[225]  Chuong B. Do,et al.  Access the most recent version at doi: 10.1101/gr.926603 References , 2003 .

[226]  Tandy Warnow,et al.  Constructing phylogenetic trees efficiently using compatibility criteria , 1993 .

[227]  Sébastien Roch,et al.  Sequence Length Requirement of Distance-Based Phylogeny Reconstruction: Breaking the Polynomial Barrier , 2008, 2008 49th Annual IEEE Symposium on Foundations of Computer Science.

[228]  David Fernández-Baca,et al.  Flipping: A supertree construction method , 2001, Bioconsensus.

[229]  Adrian M. Altenhoff,et al.  Standardized benchmarking in the quest for orthologs , 2016, Nature Methods.

[230]  G. Dirac On rigid circuit graphs , 1961 .

[231]  D. Aldous Stochastic Analysis: The Continuum random tree II: an overview , 1991 .

[232]  E. Birney,et al.  Enredo and Pecan: genome-wide mammalian consistency-based multiple alignment with paralogs. , 2008, Genome research.

[233]  David A. Bader,et al.  A Linear-Time Algorithm for Computing Inversion Distance between Signed Permutations with an Experimental Study , 2001, WADS.

[234]  J. Farris A Probability Model for Inferring Evolutionary Trees , 1973 .

[235]  Tandy J. Warnow,et al.  PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences , 2015, J. Comput. Biol..

[236]  Tandy Warnow,et al.  SuperFine: fast and accurate supertree estimation. , 2012, Systematic biology.

[237]  Tandy Warnow,et al.  Treelength Optimization for Phylogeny Estimation , 2012, PloS one.

[238]  Paul Schliekelman,et al.  Statistical Methods in Bioinformatics: An Introduction , 2001 .

[239]  Tandy J. Warnow,et al.  Algorithms for MDC-Based Multi-Locus Phylogeny Inference: Beyond Rooted Binary Gene Trees on Single Alleles , 2011, J. Comput. Biol..

[240]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[241]  Sagi Snir,et al.  Recovering the Tree-Like Trend of Evolution Despite Extensive Lateral Genetic Transfer: A Probabilistic Analysis , 2012, RECOMB.

[242]  Patricia A. McLenachan,et al.  A Statistical Approach for Distinguishing Hybridization and Incomplete Lineage Sorting , 2009, The American Naturalist.

[243]  R. Graham,et al.  The steiner problem in phylogeny is NP-complete , 1982 .

[244]  J. Degnan Anomalous unrooted gene trees. , 2013, Systematic biology.

[245]  Tandy J. Warnow,et al.  Ultra-large alignments using phylogeny-aware profiles , 2015, Genome Biology.

[246]  P. Goloboff Analyzing Large Data Sets in Reasonable Times: Solutions for Composite Optima , 1999, Cladistics : the international journal of the Willi Hennig Society.

[247]  L. Nakhleh,et al.  Computational approaches to species phylogeny inference and gene tree reconciliation. , 2013, Trends in ecology & evolution.

[248]  Vincent Moulton,et al.  Using consensus networks to visualize contradictory evidence for species phylogeny. , 2004, Molecular biology and evolution.

[249]  Daniel H. Huson,et al.  Disk-Covering, a Fast-Converging Method for Phylogenetic Tree Reconstruction , 1999, J. Comput. Biol..

[250]  Yun S. Song,et al.  An Efficient Algorithm for Statistical Multiple Alignment on Arbitrary Phylogenetic Trees , 2003, J. Comput. Biol..

[251]  H Philippe,et al.  How many nucleotides are required to resolve a phylogenetic problem? The use of a new statistical method applicable to available sequences. , 1994, Molecular phylogenetics and evolution.

[252]  M. Ragan,et al.  Inferring phylogenies of evolving sequences without multiple sequence alignment , 2014, Scientific Reports.

[253]  D. Maddison The discovery and importance of multiple islands of most , 1991 .

[254]  Dannie Durand,et al.  Inferring duplications, losses, transfers and incomplete lineage sorting with nonbinary species trees , 2012, Bioinform..

[255]  E. Myers,et al.  Sequence comparison with concave weighting functions. , 1988, Bulletin of mathematical biology.

[256]  R. Doolittle,et al.  Progressive sequence alignment as a prerequisitetto correct phylogenetic trees , 2007, Journal of Molecular Evolution.

[257]  M Steel,et al.  Links between maximum likelihood and maximum parsimony under a simple model of site substitution. , 1997, Bulletin of mathematical biology.

[258]  Tandy Warnow,et al.  Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer , 2015, bioRxiv.

[259]  Barbara R. Holland,et al.  Imputing supertrees and supernetworks from quartets. , 2007 .

[260]  S. Jones,et al.  Principles of protein-protein interactions. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[261]  Elchanan Mossel,et al.  Incomplete Lineage Sorting: Consistent Phylogeny Estimation from Multiple Loci , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[262]  Alexandros Stamatakis,et al.  Aligning short reads to reference alignments and trees , 2011, Bioinform..

[263]  Tandy J. Warnow,et al.  FastRFS: fast and accurate Robinson-Foulds Supertrees using constrained exact optimization , 2016, Bioinform..

[264]  Yun Yu,et al.  A Distance-Based Method for Inferring Phylogenetic Networks in the Presence of Incomplete Lineage Sorting , 2015, ISBRA.

[265]  K. Crandall,et al.  Phylogeny Estimation and Hypothesis Testing Using Maximum Likelihood , 1997 .

[266]  Satish Rao,et al.  A tight bound on approximating arbitrary metrics by tree metrics , 2003, STOC '03.

[267]  Jacob A. Esselstyn,et al.  The Challenges of Resolving a Rapid, Recent Radiation: Empirical and Simulated Phylogenomics of Philippine Shrews. , 2015, Systematic biology.

[268]  Laura Kubatko,et al.  An Invariants-based Method for Efficient Identification of Hybrid Species From Large-scale Genomic Data , 2015 .

[269]  D. Higgins,et al.  See Blockindiscussions, Blockinstats, Blockinand Blockinauthor Blockinprofiles Blockinfor Blockinthis Blockinpublication Clustal: Blockina Blockinpackage Blockinfor Blockinperforming Multiple Blockinsequence Blockinalignment Blockinon Blockina Minicomputer Article Blockin Blockinin Blockin , 2022 .

[270]  Ian Holmes,et al.  Estimating Empirical Codon Hidden Markov Models , 2012, Molecular biology and evolution.

[271]  M. Nei,et al.  Relationships between gene trees and species trees. , 1988, Molecular biology and evolution.

[272]  Richa Agarwala,et al.  COBALT: constraint-based alignment tool for multiple protein sequences , 2007, Bioinform..

[273]  Jotun Hein,et al.  Statistical Methods in Bioinformatics: An Introduction , 2002 .

[274]  Bernard B. Suh,et al.  Reconstructing contiguous regions of an ancestral genome. , 2006, Genome research.

[275]  Mikkel Thorup,et al.  Fast comparison of evolutionary trees , 1994, SODA '94.

[276]  D. Posada,et al.  Model selection and model averaging in phylogenetics: advantages of akaike information criterion and bayesian approaches over likelihood ratio tests. , 2004, Systematic biology.

[277]  Paul Malliavin,et al.  Stochastic Analysis , 1997, Nature.

[278]  B. Faircloth,et al.  Analysis of a Rapid Evolutionary Radiation Using Ultraconserved Elements: Evidence for a Bias in Some Multispecies Coalescent Methods. , 2016, Systematic biology.

[279]  Ramón Doallo,et al.  ProtTest-HPC: Fast Selection of Best-Fit Models of Protein Evolution , 2010, Euro-Par Workshops.

[280]  Tandy J. Warnow,et al.  SEPP: SATe -Enabled Phylogenetic Placement , 2011, Pacific Symposium on Biocomputing.

[281]  Sampath Kannan,et al.  A fast algorithm for the computation and enumeration of perfect phylogenies when the number of character states is fixed , 1995, SODA '95.

[282]  K. Nixon The Parsimony Ratchet, a New Method for Rapid Parsimony Analysis , 1999 .

[283]  D. Mindell,et al.  The tree of life: metaphor, model, and heuristic device. , 2013, Systematic biology.

[284]  D. Pearl,et al.  Estimating species phylogenies using coalescence times among sequences. , 2009, Systematic biology.

[285]  Richard M. Karp,et al.  Reducibility Among Combinatorial Problems , 1972, 50 Years of Integer Programming.

[286]  Lynn Kuo,et al.  Bayesian Phylogenetics : Methods, Algorithms, and Applications , 2014 .

[287]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[288]  Gary Benson,et al.  Algorithms in Bioinformatics , 2002, Lecture Notes in Computer Science.

[289]  Noah A. Rosenberg,et al.  iGLASS: An Improvement to the GLASS Method for Estimating Species Trees from Gene Trees , 2012, J. Comput. Biol..

[290]  Axel Janke,et al.  Mammalian Evolution May not Be Strictly Bifurcating , 2010, Molecular biology and evolution.

[291]  Anne Berry,et al.  An Introduction to Clique Minimal Separator Decomposition , 2010, Algorithms.

[292]  David Sankoff,et al.  Analysis of gene order evolution beyond single-copy genes. , 2012, Methods in molecular biology.

[293]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.

[294]  H. Vogt Leçons sur la résolution algébrique des équations , 2015 .

[295]  Tandy J. Warnow,et al.  An analytical upper bound on the number of loci required for all splits of a species tree to appear in a set of gene trees , 2016, BMC Bioinformatics.

[296]  Olivier Gascuel,et al.  Fast and accurate branch lengths estimation for phylogenomic trees , 2015, BMC Bioinformatics.

[297]  T. Warnow,et al.  A STOCHASTIC MODEL OF LANGUAGE EVOLUTION THAT INCORPORATES HOMOPLASY AND BORROWING , 2005 .

[298]  Sagi Snir,et al.  Recovering the Tree-Like Trend of Evolution Despite Extensive Lateral Genetic Transfer: A Probabilistic Analysis , 2012, RECOMB.

[299]  A. Löytynoja,et al.  Phylogeny-Aware Gap Placement Prevents Errors in Sequence Alignment and Evolutionary Analysis , 2008, Science.

[300]  D. Mindell Fundamentals of molecular evolution , 1991 .

[301]  David Bryant,et al.  On the Uniqueness of the Selection Criterion in Neighbor-Joining , 2005, J. Classif..

[302]  T. Dobzhansky Nothing in Biology Makes Sense Except in the Light of Evolution , 1973 .

[303]  David Fernández-Baca,et al.  The Perfect Phylogeny Problem , 2001 .

[304]  Vineet Bafna,et al.  Sorting by Transpositions , 1998, SIAM J. Discret. Math..

[305]  João Luís Sobral,et al.  Parallel SuperFine - A tool for fast and accurate supertree estimation: Features and limitations , 2017, Future Gener. Comput. Syst..

[306]  Tandy J. Warnow,et al.  ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes , 2015, Bioinform..

[307]  Tao Jiang,et al.  On the Complexity of Comparing Evolutionary Trees , 1996, Discret. Appl. Math..

[308]  Kevin Atteson,et al.  The Performance of Neighbor-Joining Methods of Phylogenetic Reconstruction , 1999, Algorithmica.

[309]  M. Holder,et al.  Phylogeny estimation: traditional and Bayesian approaches , 2003, Nature Reviews Genetics.

[310]  Vincent Moulton,et al.  TriLoNet: Piecing Together Small Networks to Reconstruct Reticulate Evolutionary Histories. , 2016, Molecular biology and evolution.

[311]  Ramón Doallo,et al.  ProtTest 3: fast selection of best-fit models of protein evolution , 2011, Bioinform..

[312]  Takeya Shigezumi,et al.  Robustness of Greedy Type Minimum Evolution Algorithms , 2006, International Conference on Computational Science.

[313]  L. Kubatko,et al.  Inconsistency of phylogenetic estimates from concatenated data under coalescence. , 2007, Systematic biology.

[314]  Tandy J. Warnow,et al.  The Accuracy of Fast Phylogenetic Methods for Large Datasets , 2001, Pacific Symposium on Biocomputing.

[315]  Olivier Gascuel,et al.  FastME 2.0: A Comprehensive, Accurate, and Fast Distance-Based Phylogeny Inference Program , 2015, Molecular biology and evolution.

[316]  M. Nei,et al.  Molecular Evolution and Phylogenetics , 2000 .

[317]  Hervé Philippe,et al.  Evaluation of the models handling heterotachy in phylogenetic inference , 2007, BMC Evolutionary Biology.

[318]  W. Maddison Gene Trees in Species Trees , 1997 .

[319]  Alfred V. Aho,et al.  Inferring a Tree from Lowest Common Ancestors with an Application to the Optimization of Relational Expressions , 1981, SIAM J. Comput..

[320]  T. M. Nye Trees of trees: an approach to comparing multiple alternative phylogenies. , 2008, Systematic biology.

[321]  Tandy J. Warnow,et al.  Finding a Maximum Compatible Tree for a Bounded Number of Trees with Bounded Degree Is Solvable in Polynomial Time , 2001, WABI.

[322]  M. Rosenberg,et al.  Alignment and topological accuracy of the direct optimization approach via POY and traditional phylogenetics via ClustalW + PAUP*. , 2007, Systematic biology.

[323]  Lusheng Wang,et al.  Improved Approximation Algorithms for Tree Alignment , 1996, J. Algorithms.

[324]  Guillaume Fertin,et al.  Sorting by Transpositions Is Difficult , 2010, SIAM J. Discret. Math..

[325]  J. Felsenstein,et al.  A Hidden Markov Model approach to variation among sites in rate of evolution. , 1996, Molecular biology and evolution.

[326]  J. Hartigan MINIMUM MUTATION FITS TO A GIVEN TREE , 1973 .

[327]  Reed A. Cartwright,et al.  Logarithmic gap costs decrease alignment accuracy , 2006, BMC Bioinformatics.

[328]  B. Rannala,et al.  Taxon sampling and the accuracy of large phylogenies. , 1998, Systematic biology.

[329]  Elchanan Mossel,et al.  Distance-based Species Tree Estimation: Information-Theoretic Trade-off between Number of Loci and Sequence Length under the Coalescent , 2015, APPROX-RANDOM.

[330]  D. Bryant Building trees, hunting for trees, and comparing trees : theory and methods in phylogenetic analysis , 1997 .

[331]  Robert E. Tarjan,et al.  Algorithmic Aspects of Vertex Elimination on Graphs , 1976, SIAM J. Comput..

[332]  Charles Semple,et al.  Determining phylogenetic networks from inter-taxa distances , 2016, Journal of mathematical biology.

[333]  Michael Kaufmann,et al.  DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment , 2008, Algorithms for Molecular Biology.

[334]  Noah A Rosenberg,et al.  The probability of topological concordance of gene trees and species trees. , 2002, Theoretical population biology.

[335]  Tandy J. Warnow,et al.  Reconstructing Reticulate Evolution in SpeciesTheory and Practice , 2005, J. Comput. Biol..

[336]  Kimmen Sjölander,et al.  A comparison of scoring functions for protein sequence profile alignment , 2004, Bioinform..

[337]  Ruriko Yoshida,et al.  Optimality of the Neighbor Joining Algorithm and Faces of the Balanced Minimum Evolution Polytope , 2010, Bulletin of mathematical biology.

[338]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[339]  David Bryant,et al.  Properties of consensus methods for inferring species trees from gene trees. , 2008, Systematic biology.

[340]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[341]  Benjamin D. Redelings,et al.  BAli-Phy: simultaneous Bayesian inference of alignment and phylogeny , 2006, Bioinform..

[342]  Fred R. McMorris,et al.  Triangulating vertex colored graphs , 1994, SODA '93.

[343]  B. Boussau,et al.  Efficient Exploration of the Space of Reconciled Gene Trees , 2013, Systematic biology.

[344]  Colin N. Dewey,et al.  Aligning multiple whole genomes with Mercator and MAVID. , 2007, Methods in molecular biology.

[345]  Joseph T. Chang,et al.  Full reconstruction of Markov models on evolutionary trees: identifiability and consistency. , 1996, Mathematical biosciences.

[346]  Todd Wareham,et al.  A Simplified Proof of the NP- and MAX SNP-Hardness of Multiple Sequence Tree Alignment , 1995, J. Comput. Biol..

[347]  A. Lemmon,et al.  High-Throughput Genomic Data in Systematics and Phylogenetics , 2013 .

[348]  M. Steel Recovering a tree from the leaf colourations it generates under a Markov model , 1994 .

[349]  M. Nei,et al.  Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. , 1993, Molecular biology and evolution.

[350]  Mike Steel,et al.  Maximum likelihood supertrees. , 2007, Systematic biology.

[351]  Tao Jiang,et al.  A Polynomial Time Approximation Scheme for Inferring Evolutionary Trees from Quartet Topologies and Its Application , 2001, SIAM J. Comput..

[352]  Tandy J. Warnow,et al.  The Effect of the Guide Tree on Multiple Sequence Alignments and Subsequent Phylogenetic Analysis , 2008, Pacific Symposium on Biocomputing.

[353]  D. Labie,et al.  Molecular Evolution , 1991, Nature.

[354]  Kazutaka Katoh,et al.  Application of the MAFFT sequence alignment program to large data—reexamination of the usefulness of chained guide trees , 2016, Bioinform..

[355]  D. Rose Triangulated graphs and the elimination process , 1970 .

[356]  Vincent Berry,et al.  Models, algorithms and programs for phylogeny reconciliation , 2011, Briefings Bioinform..

[357]  Carsten Kemena,et al.  Using the T-Coffee package to build multiple sequence alignments of protein, RNA, DNA sequences and 3D structures , 2011, Nature Protocols.

[358]  Mark P. Simmons,et al.  Resolution of a concatenation/coalescence kerfuffle: partitioned coalescence support and a robust family‐level tree for Mammalia , 2017, Cladistics : the international journal of the Willi Hennig Society.

[359]  Dan Gusfield,et al.  Efficient algorithms for inferring evolutionary trees , 1991, Networks.

[360]  Vincent Moulton,et al.  The minimum evolution problem is hard: a link between tree inference and graph clustering problems , 2016, Bioinform..

[361]  Arndt von Haeseler,et al.  Shortest triplet clustering: reconstructing large phylogenies using representative sets , 2005, BMC Bioinformatics.

[362]  G. Yule,et al.  A Mathematical Theory of Evolution, Based on the Conclusions of Dr. J. C. Willis, F.R.S. , 1925 .

[363]  F. McMorris,et al.  The median procedure for n-trees , 1986 .

[364]  D. Morrison Multiple sequence alignment for phylogenetic purposes , 2006 .

[365]  T. Warnow,et al.  Unidentifiable divergence times in rates-across-sites models , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[366]  W. H. Day Computational complexity of inferring phylogenies from dissimilarity matrices. , 1987, Bulletin of mathematical biology.

[367]  Burkhard Morgenstern,et al.  DIALIGN: finding local similarities by multiple sequence alignment , 1998, Bioinform..

[368]  D Gusfield,et al.  Efficient methods for multiple sequence alignment with guaranteed error bounds , 1993, Bulletin of mathematical biology.

[369]  David Posada,et al.  MODELTEST: testing the model of DNA substitution , 1998, Bioinform..

[370]  A. Halpern,et al.  Weighted neighbor joining: a likelihood-based approach to distance-based phylogeny reconstruction. , 2000, Molecular biology and evolution.

[371]  John P. Huelsenbeck,et al.  MrBayes 3: Bayesian phylogenetic inference under mixed models , 2003, Bioinform..

[372]  D. Hillis Inferring complex phytogenies , 1996, Nature.

[373]  Derrick J. Zwickl,et al.  Increased taxon sampling is advantageous for phylogenetic inference. , 2002, Systematic biology.

[374]  M Steel,et al.  Properties of phylogenetic trees generated by Yule-type speciation models. , 2001, Mathematical biosciences.

[375]  Simon Whelan,et al.  Class of multiple sequence alignment algorithm affects genomic analysis. , 2013, Molecular biology and evolution.

[376]  Laura Salter Kubatko,et al.  STEM: species tree estimation using maximum likelihood for gene trees under coalescence , 2009, Bioinform..

[377]  John D. Kececioglu,et al.  Multiple alignment by aligning alignments , 2007, ISMB/ECCB.

[378]  Linda A. Raubeson,et al.  Chloroplast DNA Evidence on the Ancient Evolutionary Split in Vascular Land Plants , 1992, Science.

[379]  Olivier Gascuel,et al.  Reconstructing evolution : new mathematical and computational advances , 2007 .

[380]  Kevin J. Liu,et al.  RAxML and FastTree: Comparing Two Methods for Large-Scale Maximum Likelihood Phylogeny Estimation , 2011, PloS one.

[381]  Yu Lin,et al.  Rearrangements in Phylogenetic Inference: Compare, Model, or Encode? , 2013, Models and Algorithms for Genome Evolution.

[382]  Tandy J. Warnow,et al.  DACTAL: divide-and-conquer trees (almost) without alignments , 2012, Bioinform..

[383]  Maxime Crochemore,et al.  Proceedings of the 14th annual conference on Combinatorial pattern matching , 2003 .

[384]  Anne Berry,et al.  A simple algorithm to generate the minimal separators and the maximal cliques of a chordal graph , 2011, Inf. Process. Lett..

[385]  D. Posada,et al.  A Bayesian Supertree Model for Genome-Wide Species Tree Reconstruction , 2014, Systematic biology.

[386]  G. Sposito,et al.  Graph theory and theoretical physics , 1969 .

[387]  Peter W. Collingridge,et al.  MergeAlign: improving multiple sequence alignment performance by dynamic reconstruction of consensus multiple sequence alignments , 2012, BMC Bioinformatics.

[388]  Jens Lagergren,et al.  Species Tree Inference Using a Mixture Model. , 2015, Molecular biology and evolution.

[389]  Alexandros Stamatakis,et al.  Algorithms, data structures, and numerics for likelihood-based phylogenetic inference of huge trees , 2011, BMC Bioinformatics.

[390]  Satish Rao,et al.  Short Quartet Puzzling: A New Quartet-Based Phylogeny Reconstruction Algorithm , 2008, J. Comput. Biol..

[391]  Albert J. Vilella,et al.  Accurate extension of multiple sequence alignments using a phylogeny-aware graph algorithm , 2012, Bioinform..

[392]  Tandy J. Warnow,et al.  Better Hill-Climbing Searches for Parsimony , 2003, WABI.

[393]  Russell F. Doolittle,et al.  “Homology” in proteins and nucleic acids: A terminology muddle and a way out of it , 1987, Cell.

[394]  Tandy J. Warnow,et al.  Reconstructing Chromosomal Evolution , 2006, SIAM J. Comput..

[395]  Olivier Gascuel,et al.  Concerning the NJ algorithm and its unweighted version, UNJ , 1996, Mathematical Hierarchies and Biology.

[396]  Alessandra Carbone,et al.  Improvement in Protein Domain Identification Is Reached by Breaking Consensus, with the Agreement of Many Profiles and Domain Co-occurrence , 2016, PLoS Comput. Biol..

[397]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[398]  Yun Yu,et al.  A maximum pseudo-likelihood approach for phylogenetic networks , 2015, BMC Genomics.

[399]  Serita M. Nelesen,et al.  SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees. , 2012, Systematic biology.

[400]  B. Faircloth,et al.  Tectonic collision and uplift of Wallacea triggered the global songbird radiation , 2016, Nature Communications.

[401]  Robert C. Edgar,et al.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity , 2004, BMC Bioinformatics.

[402]  Sean R Eddy,et al.  A new generation of homology search tools based on probabilistic inference. , 2009, Genome informatics. International Conference on Genome Informatics.

[403]  John W. Tukey,et al.  Exploratory data analysis , 1977, Addison-Wesley series in behavioral science : quantitative methods.

[404]  Osamu Gotoh,et al.  Further improvement in methods of group-to-group sequence alignment with generalized profile operations , 1994, Comput. Appl. Biosci..

[405]  A. D. Gordon Consensus supertrees: The synthesis of rooted trees containing overlapping sets of labeled leaves , 1986 .

[406]  Seán G. Brady,et al.  Phylogenomic Insights into the Evolution of Stinging Wasps and the Origins of Ants and Bees , 2017, Current Biology.

[407]  Hanns-Georg Leimer,et al.  Optimal decomposition by clique separators , 1993, Discret. Math..

[408]  Derrick J. Zwickl,et al.  Increased taxon sampling greatly reduces phylogenetic error. , 2002, Systematic biology.

[409]  W. Ewens Mathematical Population Genetics : I. Theoretical Introduction , 2004 .

[410]  F. McMorris,et al.  Mathematical Hierarchies and Biology , 1997 .

[411]  L. Muggia,et al.  Resolving evolutionary relationships in lichen-forming fungi using diverse phylogenomic datasets and analytical approaches , 2016, Scientific Reports.

[412]  Z. Yang,et al.  A space-time process model for the evolution of DNA sequences. , 1995, Genetics.

[413]  Benjamin J. Raphael,et al.  A novel method for multiple alignment of sequences with repeated and shuffled elements. , 2004, Genome research.

[414]  D. Haussler,et al.  Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[415]  Tandy J. Warnow,et al.  Performance study of phylogenetic methods: (unweighted) quartet methods and neighbor-joining , 2001, SODA '01.

[416]  M. Wilkinson More on Reduced Consensus Methods , 1995 .

[417]  Ron Shamir,et al.  The median problems for breakpoints are NP-complete , 1998, Electron. Colloquium Comput. Complex..

[418]  Alexandros Stamatakis,et al.  Understanding Angiosperm Diversification Using Small and Large Phylogenetic Trees 1 , 2022 .

[419]  István Miklós,et al.  Bayesian coestimation of phylogeny and sequence alignment , 2005, BMC Bioinformatics.

[420]  J. Wakeley Coalescent Theory: An Introduction , 2008 .

[421]  David Sankoff,et al.  A tree · a window · a hill; generalization of nearest-neighbor interchange in phylogenetic optimization , 1994 .

[422]  Sean R. Eddy,et al.  Infernal 1.0: inference of RNA alignments , 2009, Bioinform..

[423]  Tandy J. Warnow,et al.  Inferring Optimal Species Trees Under Gene Duplication and Loss , 2013, Pacific Symposium on Biocomputing.

[424]  Tandy Warnow,et al.  ASTRID: Accurate Species TRees from Internode Distances , 2015 .

[425]  Tandy Warnow,et al.  BBCA: Improving the scalability of *BEAST using random binning , 2014, BMC Genomics.

[426]  P. Waddell,et al.  Rapid Evaluation of Least-Squares and Minimum-Evolution Criteria on Phylogenetic Trees , 1998 .

[427]  Tao Jiang,et al.  On the Complexity of Multiple Sequence Alignment , 1994, J. Comput. Biol..

[428]  Mike Steel,et al.  The Maximum Likelihood Point for a Phylogenetic Tree is Not Unique , 1994 .

[429]  David M. Raup,et al.  Mathematical models of cladogenesis , 1985, Paleobiology.

[430]  P. Buneman A Note on the Metric Properties of Trees , 1974 .

[431]  J. Felsenstein CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP , 1985, Evolution; international journal of organic evolution.

[432]  A. Kupczok Split-based computation of majority-rule supertrees , 2011, BMC Evolutionary Biology.

[433]  Lior Pachter,et al.  Fast Statistical Alignment , 2009, PLoS Comput. Biol..

[434]  J. Plotkin,et al.  Inferring the Dynamics of Diversification: A Coalescent Approach , 2010, PLoS biology.

[435]  Eric P. Nawrocki,et al.  Structural rna homology search and alignment using covariance models , 2009 .

[436]  Jijun Tang,et al.  Scaling up accurate phylogenetic reconstruction from gene-order data , 2003, ISMB.

[437]  David Sankoff,et al.  Exact and approximation algorithms for sorting by reversals, with application to genome rearrangement , 1995, Algorithmica.

[438]  Tao Jiang,et al.  A More Efficient Approximation Scheme for Tree Alignment , 2000, SIAM J. Comput..

[439]  Yun Yu,et al.  In the light of deep coalescence: revisiting trees within networks , 2016, BMC Bioinformatics.

[440]  Tandy J. Warnow,et al.  Rec-I-DCM3: A Fast Algorithmic Technique for Reconstructing Large Phylogenetic Trees , 2004, IEEE Computer Society Computational Systems Bioinformatics Conference.

[441]  E. Pennisi Shaking up the Tree of Life. , 2016, Science.

[442]  Feng Lin,et al.  Reconstruction of large phylogenetic trees: A parallel approach , 2005, Comput. Biol. Chem..

[443]  Tao Jiang,et al.  Recovering branches on the tree of life: an approximation algorithm , 1999, SODA '99.

[444]  Jeremy M. Brown,et al.  TreeScaper: Visualizing and Extracting Phylogenetic Signal from Sets of Trees. , 2016, Molecular biology and evolution.

[445]  Bin Ma,et al.  A new quartet approach for reconstructing phylogenetic trees: quartet joining method , 2007, J. Comb. Optim..

[446]  David Fernández-Baca,et al.  Simple Algorithms for Perfect Phylogeny and Triangulating Colored Graphs , 1996, Int. J. Found. Comput. Sci..

[447]  Masato Ishikawa,et al.  Comprehensive study on iterative algorithms of multiple sequence alignment , 1995, Comput. Appl. Biosci..

[448]  MARTIN VINGRON,et al.  Towards Integration of Multiple Alignment and Phylogenetic Tree Construction , 1997, J. Comput. Biol..

[449]  C. Kiefer,et al.  Supernetwork identifies multiple events of plastid trnF(GAA) pseudogene evolution in the Brassicaceae. , 2007, Molecular biology and evolution.

[450]  A. Phillimore,et al.  Density-Dependent Cladogenesis in Birds , 2008, PLoS biology.

[451]  Magnus Bordewich,et al.  Accuracy Guarantees for Phylogeny Reconstruction Algorithms Based on Balanced Minimum Evolution , 2010, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[452]  John Gatesy,et al.  The gene tree delusion. , 2016, Molecular phylogenetics and evolution.

[453]  Jack Sullivan,et al.  Model Selection in Phylogenetics , 2005 .

[454]  Wen-Hsiung Li,et al.  The molecular clock runs more slowly in man than in apes and monkeys , 1987, Nature.

[455]  J. Huelsenbeck,et al.  SUCCESS OF PHYLOGENETIC METHODS IN THE FOUR-TAXON CASE , 1993 .

[456]  A. D. Gordon,et al.  Obtaining common pruned trees , 1985 .

[457]  Tandy Warnow,et al.  On the Robustness to Gene Tree Estimation Error (or lack thereof) of Coalescent-Based Species Tree Methods. , 2015, Systematic biology.

[458]  David Haussler,et al.  Alignathon: a competitive assessment of whole-genome alignment methods , 2014, bioRxiv.

[459]  E. Virginia Armbrust,et al.  pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree , 2010, BMC Bioinformatics.

[460]  Ignacio Rojas,et al.  Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques , 2012, Nucleic acids research.

[461]  Loren H. Rieseberg,et al.  Hybrid Origins of Plant Species , 1997 .

[462]  Olivier Gascuel,et al.  Fast and Accurate Phylogeny Reconstruction Algorithms Based on the Minimum-Evolution Principle , 2002, J. Comput. Biol..

[463]  David Posada,et al.  Multilocus inference of species trees and DNA barcoding , 2016, Philosophical Transactions of the Royal Society B: Biological Sciences.

[464]  Nir Ailon,et al.  Fitting tree metrics: Hierarchical clustering and phylogeny , 2005, 46th Annual IEEE Symposium on Foundations of Computer Science (FOCS'05).

[465]  Kimmen Sjölander,et al.  Ortholog identification in the presence of domain architecture rearrangement , 2011, Briefings Bioinform..

[466]  Oliver Eulenstein,et al.  Genome-scale phylogenetics: inferring the plant tree of life from 18,896 gene trees. , 2011, Systematic biology.

[467]  Arne Ø. Mooers,et al.  Inferring Evolutionary Process from Phylogenetic Tree Shape , 1997, The Quarterly Review of Biology.

[468]  H. Bandelt,et al.  Median networks: speedy construction and greedy reduction, one simulation, and two case studies from human mtDNA. , 2000, Molecular phylogenetics and evolution.

[469]  L. Knowles,et al.  Unforeseen Consequences of Excluding Missing Data from Next-Generation Sequences: Simulation Study of RAD Sequences. , 2016, Systematic biology.

[470]  Serita M. Nelesen,et al.  Rapid and Accurate Large-Scale Coestimation of Sequence Alignments and Phylogenetic Trees , 2009, Science.

[471]  Nina Amenta,et al.  A Linear-Time Majority Tree Algorithm , 2003, WABI.

[472]  C. Guyer,et al.  COMPARISONS OF OBSERVED PHYLOGENETIC TOPOLOGIES WITH NULL EXPECTATIONS AMONG THREE MONOPHYLETIC LINEAGES , 1991, Evolution; international journal of organic evolution.

[473]  Nan Yu,et al.  The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs , 2002, BMC Bioinformatics.

[474]  L. Devroye,et al.  No Empirical Probability Measure can Converge in the Total Variation Sense for all Distributions , 1990 .

[475]  Travis C Glenn,et al.  Avoiding Missing Data Biases in Phylogenomic Inference: An Empirical Study in the Landfowl (Aves: Galliformes). , 2016, Molecular biology and evolution.

[476]  C. S. Wallace,et al.  Finite-state models in the alignment of macromolecules , 1992, Journal of Molecular Evolution.

[477]  Derrick J. Zwickl,et al.  Is sparse taxon sampling a problem for phylogenetic inference? , 2003, Systematic biology.

[478]  Constantinos Daskalakis,et al.  Alignment-Free Phylogenetic Reconstruction , 2010, RECOMB.

[479]  H. Philippe,et al.  Heterotachy, an important process of protein evolution. , 2002, Molecular biology and evolution.

[480]  Tanja Stadler,et al.  A polynomial time algorithm for calculating the probability of a ranked gene tree given a species tree , 2012, Algorithms for Molecular Biology.

[481]  Michael DeGiorgio,et al.  Robustness to divergence time underestimation when inferring species trees from estimated gene trees. , 2014, Systematic biology.

[482]  Robert R. Sokal,et al.  A statistical method for evaluating systematic relationships , 1958 .

[483]  S. Wright Evolution in mendelian populations , 1931 .

[484]  Dirk Metzler,et al.  Statistical alignment based on fragment insertion and deletion models , 2003, Bioinform..

[485]  David Zuckerman,et al.  Electronic Colloquium on Computational Complexity, Report No. 100 (2005) Linear Degree Extractors and the Inapproximability of MAX CLIQUE and CHROMATIC NUMBER , 2005 .

[486]  Seán G. Brady,et al.  Phylogenomic Analysis of Ants, Bees and Stinging Wasps: Improved Taxon Sampling Enhances Understanding of Hymenopteran Evolution , 2016, bioRxiv.

[487]  Roderic D. M. Page,et al.  Modified Mincut Supertrees , 2002, WABI.

[488]  Mike A. Steel,et al.  Constructing Optimal Trees from Quartets , 2001, J. Algorithms.

[489]  Tao Jiang,et al.  Approximation algorithms for tree alignment with a given phylogeny , 1996, Algorithmica.

[490]  Carolin Kosiol,et al.  Phylogenetics by likelihood: Evolutionary modeling as a tool for understanding the genome , 2006, J. Biomed. Informatics.

[491]  A. Dress,et al.  Split decomposition: a new and useful approach to phylogenetic analysis of distance data. , 1992, Molecular phylogenetics and evolution.

[492]  Jun Kawai,et al.  Heterotachy in Mammalian Promoter Evolution , 2006, PLoS genetics.

[493]  L. Kubatko Identifying hybridization events in the presence of coalescence via model selection. , 2009, Systematic biology.

[494]  Matthew W. Hahn,et al.  Irrational exuberance for resolved species trees , 2016, Evolution; international journal of organic evolution.

[495]  J. A. Studier,et al.  A note on the neighbor-joining algorithm of Saitou and Nei. , 1988, Molecular biology and evolution.

[496]  James H. Degnan,et al.  Species Tree Inference from Gene Splits by Unrooted STAR Methods , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[497]  S. Gupta,et al.  Statistical decision theory and related topics IV , 1988 .

[498]  Ian Holmes,et al.  An empirical codon model for protein sequence evolution. , 2007, Molecular biology and evolution.

[499]  Nicola De Maio,et al.  PoMo: An Allele Frequency-Based Approach for Species Tree Estimation , 2015, bioRxiv.

[500]  Sagi Snir,et al.  Weighted quartets phylogenetics. , 2015, Systematic biology.

[501]  W. J. Quesne,et al.  A Method of Selection of Characters in Numerical Taxonomy , 1969 .

[502]  Siavash Mir arabbaygi Novel scalable approaches for multiple sequence alignment and phylogenomic reconstruction , 2015 .

[503]  Scott V Edwards,et al.  A maximum pseudo-likelihood approach for estimating species trees under the coalescent model , 2010, BMC Evolutionary Biology.

[504]  N. El-Mabrouk,et al.  Efficient gene tree correction guided by species and synteny evolution , 2015 .

[505]  S. Whelan,et al.  A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. , 2001, Molecular biology and evolution.

[506]  D. Haussler,et al.  Aligning multiple genomic sequences with the threaded blockset aligner. , 2004, Genome research.

[507]  Satish Rao,et al.  Using Max Cut to Enhance Rooted Trees Consistency , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[508]  J. Nadeau,et al.  Lengths of chromosomal segments conserved since divergence of man and mouse. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[509]  Noah A. Rosenberg,et al.  Improvements to a Class of Distance Matrix Methods for Inferring Species Trees from Gene Trees , 2012, J. Comput. Biol..

[510]  Jack Dongarra,et al.  Proceedings of the International Conference on Computational Science, ICCS 2011 , 2011 .

[511]  Robert D. Nowak,et al.  Data Requirement for Phylogenetic Inference from Multiple Loci: A New Distance Method , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[512]  Luay Nakhleh,et al.  The Probability of a Gene Tree Topology within a Phylogenetic Network with Applications to Hybridization Detection , 2012, PLoS genetics.

[513]  M. Ragan,et al.  Next-generation phylogenomics , 2013, Biology Direct.

[514]  Constantinos Daskalakis,et al.  Species Trees from Gene Trees Despite a High Rate of Lateral Genetic Transfer: A Tight Bound (Extended Abstract) , 2015, SODA.

[515]  B Qian,et al.  Distribution of indel lengths , 2001, Proteins.

[516]  P. Buneman The Recovery of Trees from Measures of Dissimilarity , 1971 .

[517]  S. Jeffery Evolution of Protein Molecules , 1979 .

[518]  Haim Kaplan,et al.  Faster and simpler algorithm for sorting signed permutations by reversals , 1997, SODA '97.

[519]  Yaoqi Zhou,et al.  SPEM: improving multiple sequence alignment with sequence profiles and predicted secondary structures. , 2005, Bioinformatics.

[520]  François-Joseph Lapointe,et al.  Matrix representations with parsimony or with distances: two sides of the same coin? , 2003, Systematic biology.

[521]  David Sankoff,et al.  Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison , 1983 .

[522]  Yun Yu,et al.  Fast algorithms and heuristics for phylogenomics under ILS and hybridization , 2013, BMC Bioinformatics.

[523]  Joseph T. Chang,et al.  A signal-to-noise analysis of phylogeny estimation by neighbor-joining: Insufficiency of polynomial length sequences. , 2006, Mathematical biosciences.

[524]  H. Kishino,et al.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA , 2005, Journal of Molecular Evolution.

[525]  Andrew Rambaut,et al.  Heterotachy and tree building: a case study with plastids and eubacteria. , 2006, Molecular biology and evolution.

[526]  Pavel A. Pevzner,et al.  Transforming men into mice (polynomial algorithm for genomic distance problem) , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.

[527]  F. Gavril The intersection graphs of subtrees in tree are exactly the chordal graphs , 1974 .

[528]  Yufeng Wu,et al.  An Algorithm for Constructing Parsimonious Hybridization Networks with Multiple Phylogenetic Trees , 2013, RECOMB.

[529]  Richard Friedberg,et al.  Efficient sorting of genomic permutations by translocation, inversion and block interchange , 2005, Bioinform..

[530]  M. O. Dayhoff,et al.  Atlas of protein sequence and structure , 1965 .

[531]  M. Rosenberg,et al.  Multiple sequence alignment accuracy and phylogenetic inference. , 2006, Systematic biology.

[532]  D. Morrison,et al.  Is the tree of life the best metaphor, model, or heuristic for phylogenetics? , 2014, Systematic biology.

[533]  Laura Salter Kubatko,et al.  Quartet Inference from SNP Data Under the Coalescent Model , 2014, Bioinform..

[534]  A. Purvis,et al.  Building phylogenies: are the big easy? , 1997, Trends in ecology & evolution.

[535]  John D. Kececioglu,et al.  Learning Parameter-Advising Sets for Multiple Sequence Alignment , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[536]  H. Akaike A new look at the statistical model identification , 1974 .

[537]  Md. Shamsuzzoha Bayzid,et al.  Weighted Statistical Binning: Enabling Statistically Consistent Genome-Scale Phylogenetic Analyses , 2014, PloS one.

[538]  Joseph P. Rusinko,et al.  Species tree estimation using Neighbor Joining. , 2016, Journal of theoretical biology.

[539]  Zhiyong Wang,et al.  MRFalign: Protein Homology Detection through Alignment of Markov Random Fields , 2014, PLoS Comput. Biol..

[540]  Gerard Talavera,et al.  Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. , 2007, Systematic biology.

[541]  M. Golumbic Algorithmic graph theory and perfect graphs , 1980 .

[542]  Tandy J. Warnow,et al.  Distance-Based Genome Rearrangement Phylogeny , 2006, Journal of Molecular Evolution.

[543]  Jens Lagergren Combining polynomial running time and fast convergence for the disk-covering method , 2002, J. Comput. Syst. Sci..

[544]  K. Strimmer,et al.  Quartet Puzzling: A Quartet Maximum-Likelihood Method for Reconstructing Tree Topologies , 1996 .

[545]  A. D. McLachlan,et al.  Profile analysis: detection of distantly related proteins. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[546]  Manuel Gil,et al.  Who watches the watchmen? An appraisal of benchmarks for multiple sequence alignment. , 2012, Methods in molecular biology.

[547]  Vincent Moulton,et al.  NeighborNet: An Agglomerative Method for the Construction of Planar Phylogenetic Networks , 2002, WABI.

[548]  W. Ludwig,et al.  SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB , 2007, Nucleic acids research.

[549]  Mark Wilkinson,et al.  Supertree Methods for Building the Tree of Life: Divide-and-Conquer Approaches to Large Phylogenetic Problems , 2006 .

[550]  Daniel H. Huson,et al.  Phylogenetic Networks: Introduction to phylogenetic networks , 2010 .

[551]  J. A. Cavender Taxonomy with confidence , 1978 .

[552]  Ziheng Yang,et al.  Computational Molecular Evolution , 2006 .

[553]  W. Doolittle,et al.  Prokaryotic evolution in light of gene transfer. , 2002, Molecular biology and evolution.

[554]  A. von Haeseler,et al.  Reversible Polymorphism-Aware Phylogenetic Models and their Application to Tree Inference , 2016, bioRxiv.

[555]  Tandy J. Warnow,et al.  Naive binning improves phylogenomic analyses , 2013, Bioinform..

[556]  Tandy J. Warnow,et al.  An experimental study of Quartets MaxCut and other supertree methods , 2010, Algorithms for Molecular Biology.

[557]  H. Munro,et al.  Mammalian protein metabolism , 1964 .

[558]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[559]  Kimmen Sjölander,et al.  Phylogenomic inference of protein molecular function: advances and challenges , 2004, Bioinform..

[560]  J. Felsenstein An alternating least squares approach to inferring phylogenies from pairwise distances. , 1997, Systematic biology.

[561]  Lior Pachter,et al.  Why Neighbor-Joining Works , 2006, Algorithmica.

[562]  Qixin He,et al.  Sources of error inherent in species-tree estimation: impact of mutational and coalescent effects on accuracy and implications for choosing among different methods. , 2010, Systematic biology.

[563]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[564]  Vincent Moulton,et al.  Proceedings of the SMBE Tri-National Young Investigators' Workshop 2005. Improved consensus network techniques for genome-scale phylogeny. , 2006, Molecular biology and evolution.

[565]  P. Argos,et al.  Motif recognition and alignment for many sequences by comparison of dot-matrices. , 1991, Journal of molecular biology.

[566]  Stephen J. Willson,et al.  Tree-average distances on certain phylogenetic networks have their weights uniquely determined , 2012, Algorithms for Molecular Biology.

[567]  Luay Nakhleh,et al.  RIATA-HGT: A Fast and Accurate Heuristic for Reconstructing Horizontal Gene Transfer , 2005, COCOON.

[568]  S. J. Willson,et al.  Constructing rooted supertrees using distances , 2004, Bulletin of mathematical biology.

[569]  Kazutaka Katoh,et al.  PartTree: an algorithm to build an approximate tree from a large number of unaligned sequences , 2007, Bioinform..

[570]  A. Mooers Effects of tree shape on the accuracy of maximum likelihood-based ancestor reconstructions. , 2004, Systematic biology.

[571]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[572]  J. Townsend,et al.  Horizontal gene transfer, genome innovation and evolution , 2005, Nature Reviews Microbiology.

[573]  Andrew F. Neuwald,et al.  Rapid detection, classification and accurate alignment of up to a million or more related protein sequences , 2009, Bioinform..

[574]  Peter J. Munson,et al.  A novel randomized iterative strategy for aligning multiple protein sequences , 1991, Comput. Appl. Biosci..

[575]  Olivier Gascuel,et al.  Fast NJ-like algorithms to deal with incomplete distance matrices , 2008, BMC Bioinformatics.

[576]  Tandy J. Warnow,et al.  Approximating the Complement of the Maximum Compatible Subset of Leaves of k Trees , 2002, APPROX.

[577]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[578]  Jimin Pei,et al.  PROMALS: towards accurate multiple sequence alignments of distantly related proteins , 2007, Bioinform..

[579]  Stephen Jay Gould,et al.  Stochastic Models of Phylogeny and the Evolution of Diversity , 1973, The Journal of Geology.

[580]  Martin Vingron,et al.  The deferred path heuristic for the generalized tree alignment problem , 1997, RECOMB '97.

[581]  P. Hogeweg,et al.  The alignment of sets of sequences and the construction of phyletic trees: An integrated method , 2005, Journal of Molecular Evolution.

[582]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[583]  Elizabeth S. Allman,et al.  Identifiability of a Markovian model of molecular evolution with gamma-distributed rates , 2008, Advances in Applied Probability.

[584]  Chuong B. Do,et al.  ProbCons: Probabilistic consistency-based multiple sequence alignment. , 2005, Genome research.

[585]  Yu Lin,et al.  Estimating true evolutionary distances under the DCJ model , 2008, ISMB.

[586]  David Fernández-Baca,et al.  Improved Heuristics for Minimum-Flip Supertree Construction , 2006, Evolutionary bioinformatics online.

[587]  H. Shaffer,et al.  Troubleshooting Molecular Phylogenetic Analyses , 2002 .

[588]  Michael G. Nute,et al.  A comparative study of SVDquartets and other coalescent-based species tree estimation methods , 2015, BMC Genomics.

[589]  Tandy Warnow,et al.  Concatenation Analyses in the Presence of Incomplete Lineage Sorting , 2015, PLoS currents.

[590]  Tal Pupko,et al.  An alignment confidence score capturing robustness to guide tree uncertainty. , 2010, Molecular biology and evolution.

[591]  Tandy J. Warnow,et al.  MRL and SuperFine+MRL: new supertree methods , 2012, Algorithms for Molecular Biology.

[592]  M. Steel,et al.  Likelihood-based tree reconstruction on a concatenation of aligned sequence data sets can be statistically inconsistent. , 2015, Theoretical population biology.

[593]  Tandy J. Warnow,et al.  Absolute convergence: true trees from short sequences , 2001, SODA '01.

[594]  D. Aldous Stochastic models and descriptive statistics for phylogenetic trees, from Yule to today , 2001 .

[595]  M. P. Cummings,et al.  Sampling properties of DNA sequence data in phylogenetic analysis. , 1995, Molecular biology and evolution.

[596]  W. Vach Least squares approximation of addititve trees , 1989 .

[597]  J. Degnan,et al.  Fast and consistent estimation of species trees using supermatrix rooted triples. , 2010, Molecular biology and evolution.

[598]  N. Goldman,et al.  Codon-substitution models for heterogeneous selection pressure at amino acid sites. , 2000, Genetics.

[599]  J. Palmer,et al.  A chloroplast DNA inversion marks an ancient evolutionary split in the sunflower family (Asteraceae). , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[600]  J. Mallet Hybridization as an invasion of the genome. , 2005, Trends in ecology & evolution.

[601]  Bernard M. E. Moret,et al.  On Computing Breakpoint Distances for Genomes with Duplicate Genes , 2017, J. Comput. Biol..

[602]  Allan Sly,et al.  Phase transition in the sample complexity of likelihood-based phylogeny inference , 2015, 1508.01964.

[603]  Tao Jiang,et al.  Quartet Cleaning: Improved Algorithms and Simulations , 1999, ESA.

[604]  Vincent Moulton,et al.  A polynomial time algorithm for constructing the refined Buneman tree , 1999 .

[605]  O. Bininda-Emonds Phylogenetic Supertrees: Combining Information To Reveal The Tree Of Life , 2004 .

[606]  Kiyoshi Asai,et al.  A Classification of Bioinformatics Algorithms from the Viewpoint of Maximizing Expected Accuracy (MEA) , 2012, J. Comput. Biol..

[607]  Laura Kubatko,et al.  Hypothesis tests for phylogenetic quartets, with applications to coalescent-based species tree inference. , 2016, Journal of theoretical biology.

[608]  M. Suchard,et al.  Joint Bayesian estimation of alignment and phylogeny. , 2005, Systematic biology.

[609]  Manolis Kellis,et al.  Unified modeling of gene duplication, loss, and coalescence using a locus tree. , 2012, Genome research.

[610]  Robert D. Finn,et al.  HMMER web server: interactive sequence similarity searching , 2011, Nucleic Acids Res..

[611]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[613]  J. Huelsenbeck,et al.  Hobgoblin of phylogenetics? , 1994, Nature.

[614]  Sampath Kannan,et al.  Computing the local consensus of trees , 1995, SODA '95.

[615]  Md. Shamsuzzoha Bayzid,et al.  Response to Comment on “Statistical binning enables an accurate coalescent-based estimation of the avian tree” , 2015, Science.

[616]  Robert E. Tarjan,et al.  Decomposition by clique separators , 1985, Discret. Math..

[617]  S. Whelan,et al.  Phylogenetic Tree Estimation With and Without Alignment: New Distance Methods and Benchmarking , 2016, Systematic biology.

[618]  P. Erdös,et al.  A few logs suffice to build (almost) all trees (l): part I , 1997 .

[619]  Junhyong Kim,et al.  GENERAL INCONSISTENCY CONDITIONS FOR MAXIMUM PARSIMONY: EFFECTS OF BRANCH LENGTHS AND INCREASING NUMBERS OF TAXA , 1996 .

[620]  Liisa Holm,et al.  COFFEE: an objective function for multiple sequence alignments , 1998, Bioinform..

[621]  G. Estabrook,et al.  An idealized concept of the true cladistic character , 1975 .

[622]  David A. Morrison,et al.  Molecular homology and multiple-sequence alignment: an analysis of concepts and practice , 2015, Australian Systematic Botany.

[623]  Jim Leebens-Mack,et al.  Identifying the basal angiosperm node in chloroplast genome phylogenies: sampling one's way out of the Felsenstein zone. , 2005, Molecular biology and evolution.

[624]  N. Goldman,et al.  Clustering Genes of Common Evolutionary History , 2015, Molecular biology and evolution.

[625]  Xun Gu,et al.  The size distribution of insertions and deletions in human and rodent pseudogenes suggests the logarithmic gap penalty for sequence alignment , 1995, Journal of Molecular Evolution.

[626]  David Posada,et al.  ProtTest: selection of best-fit models of protein evolution , 2005, Bioinform..

[627]  Yu Lin,et al.  TIBA: a tool for phylogeny inference from rearrangement data with bootstrap analysis , 2012, Bioinform..

[628]  David Fernández-Baca,et al.  MulRF: a software package for phylogenetic analysis using multi-copy gene trees , 2015, Bioinform..

[629]  Dr. Susumu Ohno Evolution by Gene Duplication , 1970, Springer Berlin Heidelberg.

[630]  S. Holmes,et al.  STATISTICAL APPROACH TO TESTS INVOLVING PHYLOGENIES , 2004 .

[631]  Vincent Ranwez,et al.  SuperTriplets: a triplet-based supertree approach to phylogenomics , 2010, Bioinform..

[632]  Cédric Notredame,et al.  3DCoffee: combining protein sequences and structures within multiple sequence alignments. , 2004, Journal of molecular biology.

[633]  Yufeng Wu,et al.  COALESCENT‐BASED SPECIES TREE INFERENCE FROM GENE TREE TOPOLOGIES UNDER INCOMPLETE LINEAGE SORTING BY MAXIMUM LIKELIHOOD , 2012, Evolution; international journal of organic evolution.

[634]  É. Tannier,et al.  The Inference of Gene Trees with Species Trees , 2013, Systematic biology.

[635]  Fanica Gavril,et al.  Algorithms for Minimum Coloring, Maximum Clique, Minimum Covering by Cliques, and Maximum Independent Set of a Chordal Graph , 1972, SIAM J. Comput..

[636]  Ziheng Yang,et al.  The effect of insertions, deletions, and alignment errors on the branch-site test of positive selection. , 2010, Molecular biology and evolution.

[637]  Olivier Gascuel,et al.  SDM: a fast distance-based approach for (super) tree building in phylogenomics. , 2006, Systematic biology.

[638]  O. Gascuel,et al.  Neighbor-joining revealed. , 2006, Molecular biology and evolution.

[639]  Simone Linz,et al.  Identifying a species tree subject to random lateral gene transfer. , 2012, Journal of theoretical biology.

[640]  Johannes Söding,et al.  Protein homology detection by HMM?CHMM comparison , 2005, Bioinform..

[641]  Michael T. Hallett,et al.  New algorithms for the duplication-loss model , 2000, RECOMB '00.

[642]  M. Nei,et al.  Stochastic errors in DNA evolution and molecular phylogeny. , 1986, Progress in clinical and biological research.

[643]  Lior Pachter,et al.  MAVID: constrained ancestral alignment of multiple sequences. , 2003, Genome research.

[644]  Steven A Benner,et al.  Empirical analysis of protein insertions and deletions determining parameters for the correct placement of gaps in protein sequence alignments. , 2004, Journal of molecular biology.

[645]  Liang Liu,et al.  Estimating species trees from unrooted gene trees. , 2011, Systematic biology.

[646]  D. Hillis,et al.  Analysis and visualization of tree space. , 2005, Systematic biology.

[647]  J. Felsenstein,et al.  A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. , 1994, Molecular biology and evolution.

[648]  Saravanaraj N. Ayyampalayam,et al.  Phylotranscriptomic analysis of the origin and early diversification of land plants , 2014, Proceedings of the National Academy of Sciences.

[649]  Raymond E. Miller,et al.  Complexity of Computer Computations , 1972 .

[650]  Edward L. Braun,et al.  Error in Phylogenetic Estimation for Bushes in the Tree of Life , 2013 .

[651]  Marek Karpinski,et al.  New Approximation Algorithms for the Steiner Tree Problems , 1997, J. Comb. Optim..

[652]  J. Cotton,et al.  Supertrees join the mainstream of phylogenetics. , 2009, Trends in ecology & evolution.

[653]  A. Purvis,et al.  Are big trees indeed easy? Reply from A. Purvis and D.L.J. Quicke. , 1997, Trends in ecology & evolution.

[654]  S Kullback,et al.  LETTER TO THE EDITOR: THE KULLBACK-LEIBLER DISTANCE , 1987 .

[655]  Marc A Suchard,et al.  Unifying vertical and nonvertical evolution: a stochastic ARG-based framework. , 2010, Systematic biology.

[656]  John D. Kececioglu,et al.  Aligning Alignments , 1998, CPM.

[657]  Simon Whelan,et al.  Characterizing the phylogenetic tree-search problem. , 2012, Systematic biology.

[658]  Vincent Berry,et al.  Faster reliable phylogenetic analysis , 1999, RECOMB.

[659]  Andrey A. Zharkikh,et al.  Inconsistency of the Maximum-parsimony Method: the Case of Five Taxa With a Molecular Clock , 1993 .

[660]  Tandy Warnow,et al.  Algorithmic techniques for improving the speed and accuracy of phylogenetic methods , 2004 .

[661]  Jonas S. Almeida,et al.  Alignment-free sequence comparison-a review , 2003, Bioinform..

[662]  R. Hudson Gene genealogies and the coalescent process. , 1990 .

[663]  Serita Marie Nelesen Improved methods for phylogenetics , 2009 .

[664]  Junhyong Kim,et al.  The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer , 2005, PLoS biology.

[665]  Christoph Mayer,et al.  Visualizing differences in phylogenetic information content of alignments and distinction of three classes of long-branch effects , 2007, BMC Evolutionary Biology.

[666]  Steven Salzberg,et al.  Mugsy: fast multiple alignment of closely related whole genomes , 2010, Bioinform..

[667]  Sebastian Böcker,et al.  Polynomial Supertree Methods Revisited , 2011, Adv. Bioinformatics.

[668]  J. Felsenstein Cases in which Parsimony or Compatibility Methods will be Positively Misleading , 1978 .

[669]  Laura Salter Kubatko,et al.  Detecting hybrid speciation in the presence of incomplete lineage sorting using gene tree incongruence: a model. , 2009, Theoretical population biology.

[670]  Dan Gusfield,et al.  ReCombinatorics: The Algorithmics of Ancestral Recombination Graphs and Explicit Phylogenetic Networks , 2014 .

[671]  E. Braun,et al.  Testing hypotheses about the sister group of the passeriformes using an independent 30-locus data set. , 2012, Molecular biology and evolution.

[672]  Junhyong Kim,et al.  A Tree Obscured By Vines: Horizontal Gene Transfer and the Median Tree Method of Estimating Species Phylogeny , 2000, Pacific Symposium on Biocomputing.

[673]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[674]  M. P. Cummings,et al.  PAUP* Phylogenetic analysis using parsimony (*and other methods) Version 4 , 2000 .

[675]  Ward C. Wheeler Phylogenetic network analysis as a parsimony optimization problem , 2015, BMC Bioinformatics.

[676]  Luay Nakhleh,et al.  Bayesian Inference of Reticulate Phylogenies under the Multispecies Network Coalescent , 2016, PLoS genetics.

[677]  Olivier Gascuel,et al.  Inferring evolutionary trees with strong combinatorial evidence , 1997, Theor. Comput. Sci..

[678]  D. Sankoff,et al.  Gene order comparisons for phylogenetic inference: evolution of the mitochondrial genome. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[679]  J. Kruskal On the shortest spanning subtree of a graph and the traveling salesman problem , 1956 .

[680]  Philip Hugenholtz,et al.  NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genes , 2006, Nucleic Acids Res..

[681]  Folker Meyer,et al.  Rose: generating sequence families , 1998, Bioinform..

[682]  Tandy Warnow,et al.  Disk covering methods improve phylogenomic analyses , 2014, BMC Genomics.

[683]  Adam D. Leaché,et al.  Phylogenomics of Phrynosomatid Lizards: Conflicting Signals from Sequence Capture versus Restriction Site Associated DNA Sequencing , 2015, Genome biology and evolution.

[684]  O. Gotoh Consistency of optimal sequence alignments. , 1990, Bulletin of Mathematical Biology.

[685]  Alexandros Stamatakis,et al.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models , 2006, Bioinform..

[686]  Katherine St. John,et al.  Review Paper: The Shape of Phylogenetic Treespace , 2016, Systematic biology.

[687]  Daniel G. Brown,et al.  Fast Phylogenetic Tree Reconstruction Using Locality-Sensitive Hashing , 2012, WABI.

[688]  O. Gascuel,et al.  Consistency of Topological Moves Based on the Balanced Minimum Evolution Principle of Phylogenetic Inference , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[689]  David Sankoff,et al.  COMPUTATIONAL COMPLEXITY OF INFERRING PHYLOGENIES BY COMPATIBILITY , 1986 .

[690]  João Luís Sobral,et al.  Parallelizing SuperFine , 2012, SAC '12.

[691]  Amihood Amir,et al.  Maximum Agreement Subtree in a Set of Evolutionary Trees: Metrics and Efficient Algorithms , 1997, SIAM J. Comput..

[692]  Dannie Durand,et al.  Reconciliation with non-binary species trees. , 2008, Journal of computational biology : a journal of computational molecular cell biology.

[693]  François-Joseph Lapointe,et al.  THE AVERAGE CONSENSUS PROCEDURE: COMBINATION OF WEIGHTED TREES CONTAINING IDENTICAL OR OVERLAPPING SETS OF TAXA , 1997 .

[694]  L. Partridge,et al.  Oxford Surveys in Evolutionary Biology , 1991 .

[695]  Tandy J. Warnow,et al.  On contract-and-refine transformations between phylogenetic trees , 2004, SODA '04.

[696]  D. Hillis,et al.  Molecular Systematics, Second Edition , 1996 .

[697]  John D. Kececioglu,et al.  The Maximum Weight Trace Problem in Multiple Sequence Alignment , 1993, CPM.

[698]  R. Lanfear,et al.  Partitionfinder: combined selection of partitioning schemes and substitution models for phylogenetic analyses. , 2012, Molecular biology and evolution.

[699]  Ari Löytynoja,et al.  An algorithm for progressive multiple alignment of sequences with insertions. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[700]  A. George,et al.  Graph theory and sparse matrix computation , 1993 .

[701]  B. Rannala,et al.  Probability distribution of molecular evolutionary trees: A new method of phylogenetic inference , 1996, Journal of Molecular Evolution.

[702]  Alexandros Stamatakis,et al.  Parallel Divide-and-Conquer Phylogeny Reconstruction by Maximum Likelihood , 2005, HPCC.

[703]  Olivier Gascuel,et al.  Combinatorics of distance-based tree inference , 2012, Proceedings of the National Academy of Sciences.

[704]  David A. Bader,et al.  A Linear-Time Algorithm for Computing Inversion Distance between Signed Permutations with an Experimental Study , 2001, J. Comput. Biol..

[705]  M. Bishop,et al.  Maximum likelihood alignment of DNA sequences. , 1986, Journal of molecular biology.

[706]  Michael R. Fellows,et al.  Two Strikes Against Perfect Phylogeny , 1992, ICALP.

[707]  Cedric Chauve,et al.  Models and Algorithms for Genome Evolution , 2013, Computational Biology.

[708]  Duncan P. Brown,et al.  Automated Protein Subfamily Identification and Classification , 2007, PLoS Comput. Biol..

[709]  Jotun Hein,et al.  Simultaneous Bayesian estimation of alignment and phylogeny under a joint model of protein sequence and structure. , 2014, Molecular biology and evolution.

[710]  Mike Steel Consistency of Bayesian inference of resolved phylogenetic trees. , 2013, Journal of theoretical biology.

[711]  Mike Steel,et al.  Convex tree realizations of partitions , 1992 .

[712]  Elchanan Mossel,et al.  Identifiability and inference of non-parametric rates-across-sites models on large-scale phylogenies , 2011, Journal of mathematical biology.

[713]  Bastien Boussau,et al.  Efficient likelihood computations with nonreversible models of evolution. , 2006, Systematic biology.

[714]  Olivier François,et al.  Which random processes describe the tree of life? A large-scale study of phylogenetic tree imbalance. , 2006, Systematic biology.

[715]  W. Thuiller,et al.  Building megaphylogenies for macroecology: taking up the challenge. , 2013, Ecography.

[716]  M. Kreidl,et al.  Note on expected internode distances for gene trees in species trees , 2011 .

[717]  Tandy J. Warnow,et al.  FASTSP: linear time calculation of alignment accuracy , 2011, Bioinform..

[718]  O. Gascuel,et al.  The Minimum-Evolution Distance-Based Approach to Phylogeny Inference , 2005 .

[719]  Andrew E. Torda,et al.  Not assessing the efficiency of multiple sequence alignment programs , 2014, Algorithms for Molecular Biology.

[720]  M. Tristem Molecular Evolution — A Phylogenetic Approach. , 2000, Heredity.