Multiple polyploidy events in the early radiation of nodulating and nonnodulating legumes.

Unresolved questions about evolution of the large and diverse legume family include the timing of polyploidy (whole-genome duplication; WGDs) relative to the origin of the major lineages within the Fabaceae and to the origin of symbiotic nitrogen fixation. Previous work has established that a WGD affects most lineages in the Papilionoideae and occurred sometime after the divergence of the papilionoid and mimosoid clades, but the exact timing has been unknown. The history of WGD has also not been established for legume lineages outside the Papilionoideae. We investigated the presence and timing of WGDs in the legumes by querying thousands of phylogenetic trees constructed from transcriptome and genome data from 20 diverse legumes and 17 outgroup species. The timing of duplications in the gene trees indicates that the papilionoid WGD occurred in the common ancestor of all papilionoids. The earliest diverging lineages of the Papilionoideae include both nodulating taxa, such as the genistoids (e.g., lupin), dalbergioids (e.g., peanut), phaseoloids (e.g., beans), and galegoids (=Hologalegina, e.g., clovers), and clades with nonnodulating taxa including Xanthocercis and Cladrastis (evaluated in this study). We also found evidence for several independent WGDs near the base of other major legume lineages, including the Mimosoideae-Cassiinae-Caesalpinieae (MCC), Detarieae, and Cercideae clades. Nodulation is found in the MCC and papilionoid clades, both of which experienced ancestral WGDs. However, there are numerous nonnodulating lineages in both clades, making it unclear whether the phylogenetic distribution of nodulation is due to independent gains or a single origin followed by multiple losses.

[1]  Guy Baele,et al.  Analysis of 41 plant genomes supports a wave of successful genome duplications in association with the Cretaceous–Paleogene boundary , 2014, Genome research.

[2]  Jens Kattge,et al.  A single evolutionary innovation drives the deep evolution of symbiotic N2-fixation in angiosperms , 2014, Nature Communications.

[3]  A. Kilian,et al.  New evidence of ancestral polyploidy in the Genistoid legume Lupinus angustifolius L. (narrow-leafed lupin) , 2014, Theoretical and Applied Genetics.

[4]  David C. Tank,et al.  Three keys to the radiation of angiosperms into freezing environments , 2013, Nature.

[5]  Amborella Genome The Amborella Genome and the Evolution of Flowering Plants , 2013, Science.

[6]  Chun Xing Li,et al.  Comparative genomics suggests that an ancestral polyploidy event leads to enhanced root nodule symbiosis in the Papilionoideae. , 2013, Molecular biology and evolution.

[7]  Colin N. Dewey,et al.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis , 2013, Nature Protocols.

[8]  Liang Liu,et al.  STRAW: Species TRee Analysis Web server , 2013, Nucleic Acids Res..

[9]  Daniel J. Murphy,et al.  Legume phylogeny and classification in the 21st century: Progress, prospects and lessons for other species-rich clades , 2013 .

[10]  C. Dieterich,et al.  FLEXBAR—Flexible Barcode and Adapter Processing for Next-Generation Sequencing Platforms , 2012, Biology.

[11]  R. Pennington,et al.  Revisiting the phylogeny of papilionoid legumes: New insights from comprehensively sampled early-branching lineages. , 2012, American journal of botany.

[12]  A. Bruneau,et al.  Phylogeny reconstruction in the Caesalpinieae grade (Leguminosae) based on duplicated copies of the sucrose synthase gene and plastid markers. , 2012, Molecular phylogenetics and evolution.

[13]  S. Sagitov,et al.  Statistical inference of allopolyploid species networks in the presence of incomplete lineage sorting. , 2012, Systematic biology.

[14]  J. Salse In silico archeogenomics unveils modern plant genome organisation, regulation and evolution. , 2012, Current opinion in plant biology.

[15]  P. Edger,et al.  Ancient whole genome duplications, novelty and diversification: the WGD Radiation Lag-Time Model. , 2012, Current opinion in plant biology.

[16]  Jin Wang,et al.  De novo assembly and Characterisation of the Transcriptome during seed development, and generation of genic-SSR markers in Peanut (Arachis hypogaea L.) , 2012, BMC Genomics.

[17]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[18]  Yeting Zhang,et al.  A genome triplication associated with early diversification of the core eudicots , 2012, Genome Biology.

[19]  P. Poole,et al.  The rules of engagement in the legume-rhizobial symbiosis. , 2011, Annual review of genetics.

[20]  Alvaro J. González,et al.  The Medicago Genome Provides Insight into the Evolution of Rhizobial Symbioses , 2011, Nature.

[21]  J. Doyle Phylogenetic perspectives on the origins of nodulation. , 2011, Molecular plant-microbe interactions : MPMI.

[22]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[23]  Y. Peer A mystery unveiled , 2011, Genome Biology.

[24]  Claude W. dePamphilis,et al.  Ancestral polyploidy in seed plants and angiosperms , 2011, Nature.

[25]  R. Wing,et al.  LysM-Type Mycorrhizal Receptor Recruited for Rhizobium Symbiosis in Nonlegume Parasponia , 2011, Science.

[26]  Scott V Edwards,et al.  A maximum pseudo-likelihood approach for estimating species trees under the coalescent model , 2010, BMC Evolutionary Biology.

[27]  D. Soltis,et al.  The age and diversification of the angiosperms re-revisited. , 2010, American journal of botany.

[28]  Steven B. Cannon,et al.  Polyploidy Did Not Predate the Evolution of Nodulation in All Legumes , 2010, PloS one.

[29]  Jens Stougaard,et al.  The molecular network governing nodule organogenesis and infection in the model legume Lotus japonicus , 2010, Nature communications.

[30]  T. Sakurai,et al.  Genome sequence of the palaeopolyploid soybean , 2010, Nature.

[31]  Thomas L. Madden,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[32]  J. Sprent Legume Nodulation: A Global Perspective , 2009 .

[33]  Steven Maere,et al.  Plants with double genomes might have had a better chance to survive the Cretaceous–Tertiary extinction event , 2009, Proceedings of the National Academy of Sciences.

[34]  D. Soltis,et al.  Rosid radiation and the rapid rise of angiosperm-dominated forests , 2009, Proceedings of the National Academy of Sciences.

[35]  S. Cannon,et al.  An analysis of synteny of Arachis with Lotus and Medicago sheds new light on the structure, stability and evolution of legume genomes , 2009, BMC Genomics.

[36]  J. Rougemont,et al.  A rapid bootstrap algorithm for the RAxML Web servers. , 2008, Systematic biology.

[37]  P. Herendeen,et al.  Phylogenetic patterns and diversification in the caesalpinioid legumes , 2008 .

[38]  Laurent Laplaze,et al.  SymRK defines a common genetic basis for plant root endosymbioses with arbuscular mycorrhiza fungi, rhizobia, and Frankiabacteria , 2008, Proceedings of the National Academy of Sciences.

[39]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[40]  B. Roe,et al.  Legume genome evolution viewed through the Medicago truncatula and Lotus japonicus genomes , 2006, Proceedings of the National Academy of Sciences.

[41]  Brian C. Thomas,et al.  Gene-balanced duplications, like tetraploidy, provide predictable drive to increase morphological complexity. , 2006, Genome research.

[42]  Peer Bork,et al.  PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments , 2006, Nucleic Acids Res..

[43]  D. Soltis,et al.  Widespread genome duplications throughout the history of flowering plants. , 2006, Genome research.

[44]  M. Wojciechowski,et al.  Evolutionary rates analysis of Leguminosae implicates a rapid diversification of lineages during the tertiary. , 2005, Systematic biology.

[45]  R. Shoemaker,et al.  Placing paleopolyploidy in relation to taxon divergence: a phylogenetic analysis in legumes using 39 gene families. , 2005, Systematic biology.

[46]  Steven Salzberg,et al.  DAGchainer: a tool for mining segmental genome duplications and synteny , 2004, Bioinform..

[47]  M. Sanderson,et al.  A phylogeny of legumes (Leguminosae) based on analysis of the plastid matK gene resolves many well-supported subclades within the family. , 2004, American journal of botany.

[48]  Jessica A Schlueter,et al.  Mining EST databases to resolve evolutionary events in major crop species. , 2004, Genome.

[49]  Guillaume Blanc,et al.  Functional Divergence of Duplicated Genes Formed by Polyploidy during Arabidopsis Evolution , 2004, The Plant Cell Online.

[50]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[51]  M. Luckow,et al.  The Rest of the Iceberg. Legume Diversity and Evolution in a Phylogenetic Context1 , 2003, Plant Physiology.

[52]  M. Parniske,et al.  Evolution of signal transduction in intracellular symbiosis. , 2002, Trends in plant science.

[53]  Gwilym P. Lewis,et al.  Legumes of the World , 2000 .

[54]  D. Soltis,et al.  Chloroplast gene sequence data suggest a single origin of the predisposition for symbiotic nitrogen fixation in angiosperms. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[55]  P. Inglis,et al.  A study of the relationships of cultivated peanut (Arachis hypogaea) and its most closely related wild species using intron sequences and microsatellite markers. , 2013, Annals of botany.

[56]  J. Doyle Polyploidy in Legumes , 2012 .

[57]  D. Sankoff,et al.  Polyploidy and angiosperm diversification. , 2009, American journal of botany.

[58]  P. Goldblatt Cytology and the Phylogeny of Leguminosae. , 1981 .

[59]  P. Raven The Bases of Angiosperm Phylogeny: Cytology , 1975 .

[60]  P. K. Wall,et al.  Identification of shared single copy nuclear genes in Arabidopsis, Populus, Vitis and Oryza and their phylogenetic utility across various taxonomic levels , 2010, BMC Evolutionary Biology.