Understanding Angiosperm Diversification Using Small and Large Phylogenetic Trees 1

How will the emerging possibility of inferring ultra-large phylogenies influence our ability to identify shifts in diversification rate? For several large angiosperm clades (Angiospermae, Monocotyledonae, Orchidaceae, Poaceae, Eudicotyledonae, Fabaceae, and Asteraceae), we explore this issue by contrasting two approaches: (1) using small backbone trees with an inferred number of extant species assigned to each terminal clade and (2) using a mega-phylogeny of 55473 seed plant species represented in GenBank. The mega-phylogeny approach assumes that the sample of species in GenBank is at least roughly proportional to the actual species diversity of different lineages, as appears to be the case for many major angiosperm lineages. Using both approaches, we found that diversification rate shifts are not directly associated with the major named clades examined here, with the sole exception of Fabaceae in the GenBank mega-phylogeny. These agreements are encouraging and may support a generality about angiosperm evolution: major shifts in diversification may not be directly associated with major named clades, but rather with clades that are nested not far within these groups. An alternative explanation is that there have been increased extinction rates in early-diverging lineages within these clades. Based on our mega-phylogeny, the shifts in diversification appear to be distributed quite evenly throughout the angiosperms. Mega-phylogenetic studies of diversification hold great promise for revealing new patterns, but we will need to focus more attention on properly specifying null expectation.

[1]  R. Raikow,et al.  Why are there so Many Kinds of Passerine Birds , 1986 .

[2]  P H Harvey,et al.  Tempo and mode of evolution revealed from molecular phylogenies. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[3]  M. Sanderson,et al.  ESTIMATING DIVERSIFICATION RATES: HOW USEFUL ARE DIVERGENCE TIMES? , 2011, Evolution; international journal of organic evolution.

[4]  V. Funk,et al.  The value of sampling anomalous taxa in phylogenetic studies: major clades of the Asteraceae revealed. , 2008, Molecular phylogenetics and evolution.

[5]  Daniel L Rabosky,et al.  EXTINCTION RATES SHOULD NOT BE ESTIMATED FROM MOLECULAR PHYLOGENIES , 2010, Evolution; international journal of organic evolution.

[6]  Fay,et al.  Multigene Analyses of Monocot Relationships , 2006 .

[7]  Pamela S Soltis,et al.  Darwin's abominable mystery: Insights from a supertree of the angiosperms , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[8]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[9]  Jerrold I. Davis,et al.  MULTIGENE ANALYSES OF MONOCOT RELATIONSHIPS : A SUMMARY , 2006 .

[10]  S. Nee,et al.  INFERRING SPECIATION RATES FROM PHYLOGENIES , 2001, Evolution; international journal of organic evolution.

[11]  Bin Wang,et al.  The deepest divergences in land plants inferred from phylogenomic evidence , 2006, Proceedings of the National Academy of Sciences.

[12]  R. Thorne How many species of seed plants are there , 2001 .

[13]  The American Journal of Botany , 1914, Science.

[14]  M. Donoghue,et al.  Mega-phylogeny approach for comparative biology: an alternative to supertree and supermatrix approaches , 2009, BMC Evolutionary Biology.

[15]  M. Donoghue,et al.  Shifts in Diversification Rate with the Origin of Angiosperms , 1994, Science.

[16]  Robert C Thomson,et al.  Rapid progress on the vertebrate tree of life , 2010, BMC Biology.

[17]  M. Sanderson,et al.  ABSOLUTE DIVERSIFICATION RATES IN ANGIOSPERM CLADES , 2001, Evolution; international journal of organic evolution.

[18]  Arne Ø. Mooers,et al.  Inferring Evolutionary Process from Phylogenetic Tree Shape , 1997, The Quarterly Review of Biology.

[19]  M. Donoghue,et al.  Correlates of Diversification in the Plant Clade Dipsacales: Geographic Movement and Evolutionary Innovations , 2007, The American Naturalist.

[20]  Kazutaka Katoh,et al.  Recent developments in the MAFFT multiple sequence alignment program , 2008, Briefings Bioinform..

[21]  M. Donoghue,et al.  A Bayesian approach for evaluating the impact of historical events on rates of diversification , 2009, Proceedings of the National Academy of Sciences.

[22]  M. Martindale,et al.  Assessing the root of bilaterian animals with scalable phylogenomic methods , 2009, Proceedings of the Royal Society B: Biological Sciences.

[23]  Andy Purvis,et al.  Evaluating phylogenetic tree shape: two modifications to Fusco & Cronk's method. , 2002, Journal of theoretical biology.

[24]  Karl J Niklas,et al.  Darwin's second 'abominable mystery': Why are there so many angiosperm species? , 2009, American journal of botany.

[25]  Pamela S Soltis,et al.  The ABC model and its applicability to basal angiosperms. , 2007, Annals of botany.

[26]  Brian R. Moore,et al.  SYMMETREE: whole-tree analysis of differential diversification rates , 2005, Bioinform..

[27]  Jerrold I. Davis,et al.  A Phylogeny of the Monocots, as Inferred from rbcL and atpA Sequence Variation, and a Comparison of Methods for Calculating Jackknife and Bootstrap Values , 2004 .

[28]  F. Darwin More Letters of Charles Darwin , 1903 .

[29]  K. Cameron A Comparison and Combination of Plastid atpB and rbcL Gene Sequences for Inferring Phylogenetic Relationships within Orchidaceae , 2006 .

[30]  Stefanie Hartmann,et al.  Using ESTs for phylogenomics: Can one accurately infer a phylogenetic tree from a gappy alignment? , 2008, BMC Evolutionary Biology.

[31]  J. Wendel,et al.  Ribosomal ITS sequences and plant phylogenetic inference. , 2003, Molecular phylogenetics and evolution.

[32]  K. Hilu,et al.  Phylogeny of basal eudicots: Insights from non-coding and rapidly evolving DNA , 2007 .

[33]  R. Jansen,et al.  Everywhere but Antarctica: using a supertree to understand the diversity and distribution of the Compositae. , 2005 .

[34]  M. A. Bello,et al.  Elusive Relationships Within Order Fabales: Phylogenetic Analyses Using matK and rbcL Sequence Data1 , 2009 .

[35]  Robert W. Scotland,et al.  How many species of seed plants are there , 2003 .

[36]  I. Lovette,et al.  Density-dependent diversification in North American wood warblers , 2008, Proceedings of the Royal Society B: Biological Sciences.

[37]  N. Pierce,et al.  Dating the origin of the Orchidaceae from a fossil orchid with its pollinator , 2007, Nature.

[38]  Michael T. Clegg,et al.  Relative rates of nucleotide substitution at the rbcl locus of monocotyledonous plants , 1992, Journal of Molecular Evolution.

[39]  M. Sanderson,et al.  A phylogeny of legumes (Leguminosae) based on analysis of the plastid matK gene resolves many well-supported subclades within the family. , 2004, American journal of botany.

[40]  J. Rougemont,et al.  A rapid bootstrap algorithm for the RAxML Web servers. , 2008, Systematic biology.

[41]  M. Donoghue,et al.  Phylogenies and angiosperm diversification , 1993, Paleobiology.

[42]  M. Donoghue,et al.  An uncorrelated relaxed-clock analysis suggests an earlier origin for flowering plants , 2010, Proceedings of the National Academy of Sciences.

[43]  W. Friedman The meaning of Darwin's 'abominable mystery'. , 2009, American journal of botany.

[44]  M. Sanderson,et al.  Phylogenetic supermatrix analysis of GenBank sequences from 2228 papilionoid legumes. , 2006, Systematic biology.

[45]  David C. Tank,et al.  An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: , 2009 .

[46]  V. Funk,et al.  Toward a phylogenetic subfamilial classification for the Compositae (Asteraceae) , 2002 .

[47]  O. Bininda-Emonds Phylogenetic Supertrees: Combining Information To Reveal The Tree Of Life , 2004 .

[48]  J. Wiens Can incomplete taxa rescue phylogenetic analyses from long-branch attraction? , 2005, Systematic biology.

[49]  L. Harmon,et al.  Did genome duplication drive the origin of teleosts? A comparative study of diversification in ray-finned fishes , 2009, BMC Evolutionary Biology.

[50]  A. Cooper,et al.  Evolutionary explosions and the phylogenetic fuse. , 1998, Trends in ecology & evolution.

[51]  Mike Steel,et al.  Phylogenomics with incomplete taxon coverage: the limits to inference , 2010, BMC Evolutionary Biology.

[52]  J. Farris,et al.  Phylogenetic analysis of 73 060 taxa corroborates major eukaryotic groups , 2009, Cladistics : the international journal of the Willi Hennig Society.

[53]  J. Wiens,et al.  Missing data, incomplete taxa, and phylogenetic accuracy. , 2003, Systematic biology.

[54]  C. Guyer,et al.  ADAPTIVE RADIATION AND THE TOPOLOGY OF LARGE PHYLOGENIES , 1993, Evolution; international journal of organic evolution.

[55]  Alexandros Stamatakis,et al.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models , 2006, Bioinform..

[56]  Andy Purvis,et al.  Power of eight tree shape statistics to detect nonrandom diversification: a comparison by simulation of two models of cladogenesis. , 2002, Systematic biology.

[57]  S. Magallón Using fossils to break long branches in molecular dating: a comparison of relaxed clocks applied to the origin of angiosperms. , 2010, Systematic biology.

[58]  M. Donoghue,et al.  Rates of Molecular Evolution Are Linked to Life History in Flowering Plants , 2008, Science.

[59]  Pamela S Soltis,et al.  Using plastid genome-scale data to resolve enigmatic relationships among basal angiosperms , 2007, Proceedings of the National Academy of Sciences.

[60]  F. Forest,et al.  Pollen morphology of the family Polygalaceae (Fabales) , 2008 .

[61]  Chad D. Brock,et al.  Nine exceptional radiations plus high turnover explain species diversity in jawed vertebrates , 2009, Proceedings of the National Academy of Sciences.

[62]  P. Goloboff Analyzing Large Data Sets in Reasonable Times: Solutions for Composite Optima , 1999, Cladistics : the international journal of the Willi Hennig Society.

[63]  Joseph B. Slowinski,et al.  Testing the Stochasticity of Patterns of Organismal Diversity: An Improved Null Model , 1989, The American Naturalist.

[64]  P. K. Endress,et al.  Gynoecium Structure and Evolution in Basal Angiosperms , 2000, International Journal of Plant Sciences.

[65]  Stephen A. Smith,et al.  Phylogenetic analyses reveal the shady history of C4 grasses , 2010, Proceedings of the National Academy of Sciences.

[66]  Steven Maere,et al.  Genome duplication and the origin of angiosperms. , 2005, Trends in ecology & evolution.

[67]  E. Holman Nodes in phylogenetic trees: the relation between imbalance and number of descendent species. , 2005, Systematic biology.

[68]  K. Chan,et al.  Whole-tree methods for detecting differential diversification rates. , 2002, Systematic biology.

[69]  Jim Leebens-Mack,et al.  Identifying the basal angiosperm node in chloroplast genome phylogenies: sampling one's way out of the Felsenstein zone. , 2005, Molecular biology and evolution.

[70]  James Leebens-Mack,et al.  Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns , 2007, Proceedings of the National Academy of Sciences.

[71]  Derrick J. Zwickl Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maximum likelihood criterion , 2006 .

[72]  Jeremy M. Brown,et al.  The Effect of Ambiguous Data on Phylogenetic Estimates Obtained by Maximum Likelihood and Bayesian Inference , 2009, Systematic biology.

[73]  P. Herendeen,et al.  Phylogenetic patterns and diversification in the caesalpinioid legumes , 2008 .

[74]  Boris Igić,et al.  Species Selection Maintains Self-Incompatibility , 2010, Science.

[75]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[76]  M. Donoghue Key innovations, convergence, and success: macroevolutionary lessons from plant phylogeny , 2005, Paleobiology.