Phylogenomics reveals multiple losses of nitrogen-fixing root nodule symbiosis

Genomic traces of symbiosis loss A symbiosis between certain bacteria and their plant hosts delivers fixed nitrogen to the plants. Griesmann et al. sequenced several plant genomes to analyze why nitrogen-fixing symbiosis is irregularly scattered through the evolutionary tree (see the Perspective by Nagy). Various genomes carried traces of lost pathways that could have supported nitrogen-fixing symbiosis. It seems that this symbiosis, which relies on multiple pathways and complex interorganismal signaling, is susceptible to selection and prone to being lost over evolutionary time. Science, this issue p. eaat1743; see also p. 125 Genome-wide comparative analysis across species reveals the fragility of the plant-bacterial symbiosis needed for nitrogen fixation. INTRODUCTION Access to nutrients such as nitrogen is required for plant growth. Legumes and nine additional plant families benefit from the nitrogen-fixing root nodule (NFN) symbiosis, in which roots develop nodules that intracellularly host nitrogen-fixing bacteria. In this mutually beneficial symbiosis, the bacteria convert atmospheric nitrogen into ammonium and deliver it to the host plant. NFN symbiosis thus enables plant survival under nitrogen-limiting conditions in terrestrial ecosystems. In agriculture, this symbiosis reduces reliance on nitrogen fertilizer, thus reducing the costs, ecological impact, and fossil fuel consumption attendant on large-scale application of fertilizers. RATIONALE Molecular phylogenies show that NFN symbiosis is restricted to four angiosperm orders—Fabales, Fagales, Cucurbitales, and Rosales—that together form the monophyletic NFN clade. However, only 10 of the 28 plant families within this clade contain species engaged in the NFN symbiosis. Even within these 10 families, most genera do not form this symbiosis. The NFN symbiosis requires the coordinated function of more than 30 essential genes. Presence of this symbiosis in related families suggests that a genetic change in the ancestor of the NFN clade enabled evolution of NFN symbiosis in this clade. The scattered distribution of functional NFN symbiosis across the clade has led to the question of whether NFN symbiosis evolved multiple times independently in a convergent manner or was lost multiple times regardless of the number of times it arose. Fossil data have been unable to answer this question. Here we used molecular evidence to ask how the current pattern of plant species with NFN symbiosis evolved. RESULTS We sequenced the genomes of seven nodulating species belonging to the Fagales, Rosales, and Cucurbitales orders and the legume subfamily Caesalpinioideae. We complemented this dataset by sequencing three genomes of nonnodulating species from the Cucurbitales and from the legume subfamilies Cercidoideae and Papilionoideae. Using a genome-wide phylogenomic approach, we found that all legume genes with a characterized role in NFN symbiosis are conserved in nodulating species with one exception. We observed larger numbers of order-specific gene family expansions that, solely because of their phylogenetic distribution, may include genes contributing to multiple gains or subsequent refinements of the symbiosis. In parallel, we discovered signatures of multiple independent loss-of-function events for the gene encoding the indispensable NFN symbiosis regulator NODULE INCEPTION (NIN) in 10 of 13 genomes of nonnodulating species within the NFN clade. The pattern suggests at least eight independent losses of NFN symbiosis. CONCLUSION We found that multiple independent losses of NFN symbiosis occurred in the four orders of the NFN clade. These results suggest that NFN symbiosis has previously been more common than currently evident and that this symbiosis is subject to an underestimated adverse selection pressure. Phylogenomics and evolution of NFN symbiosis. Genome sequencing of nodulating and nonnodulating species combined with 27 previously available genomes resulted in a dataset spanning the NFN clade and species outside the NFN clade as an outgroup. Orthogroups were identified and filtered following three phylogenetic patterns. This genome-wide analysis identified two genes involved in NFN symbiosis, NIN and RHIZOBIUM-DIRECTED POLAR GROWTH (RPG), that were lost in most nonnodulating species. The occurrence of multiple losses (red crosses) of NFN symbiosis suggests an adverse selection pressure. The root nodule symbiosis of plants with nitrogen-fixing bacteria affects global nitrogen cycles and food production but is restricted to a subset of genera within a single clade of flowering plants. To explore the genetic basis for this scattered occurrence, we sequenced the genomes of 10 plant species covering the diversity of nodule morphotypes, bacterial symbionts, and infection strategies. In a genome-wide comparative analysis of a total of 37 plant species, we discovered signatures of multiple independent loss-of-function events in the indispensable symbiotic regulator NODULE INCEPTION in 10 of 13 genomes of nonnodulating species within this clade. The discovery that multiple independent losses shaped the present-day distribution of nitrogen-fixing root nodule symbiosis in plants reveals a phylogenetically wider distribution in evolutionary history and a so-far-underestimated selection pressure against this symbiosis.

[1]  D. Soltis Faculty Opinions recommendation of Phylogenomics reveals multiple losses of nitrogen-fixing root nodule symbiosis. , 2018, Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature.

[2]  S. Smit,et al.  Comparative genomics of the nonlegume Parasponia reveals insights into evolution of nitrogen-fixing rhizobium symbioses , 2018, Proceedings of the National Academy of Sciences.

[3]  T. Bisseling,et al.  CRISPR/Cas9-Mediated Mutagenesis of Four Putative Symbiosis Genes of the Tropical Tree Parasponia andersonii Reveals Novel Phenotypes , 2018, Front. Plant Sci..

[4]  A. von Haeseler,et al.  MPBoot: fast phylogenetic maximum parsimony tree inference and bootstrap approximation , 2018, BMC Evolutionary Biology.

[5]  E. James,et al.  Biogeography of nodulated legumes and their nitrogen-fixing symbionts. , 2017, The New phytologist.

[6]  L. Nagy Evolution: Complex Multicellular Life with 5,500 Genes , 2017, Current Biology.

[7]  Nuttha Thongchul,et al.  Identification and lactic acid production of bacteria isolated from soils and tree barks , 2017 .

[8]  Thomas K. F. Wong,et al.  ModelFinder: Fast Model Selection for Accurate Phylogenetic Estimates , 2017, Nature Methods.

[9]  Rafael Barbosa Pinto,et al.  A new subfamily classification of the leguminosae based on a taxonomically comprehensive phylogeny , 2017 .

[10]  Jianping Wang,et al.  Transcriptome profiles reveal gene regulation of peanut (Arachis hypogaea L.) nodulation , 2017, Scientific Reports.

[11]  R. Garrido-Oter,et al.  Root nodule symbiosis in Lotus japonicus drives the establishment of distinctive rhizosphere, root, and nodule bacterial communities , 2016, Proceedings of the National Academy of Sciences.

[12]  J. Doyle Chasing unicorns: Nodulation origins and the paradox of novelty. , 2016, American journal of botany.

[13]  Maureen A. O’Malley,et al.  Losing Complexity: The Role of Simplification in Macroevolution. , 2016, Trends in ecology & evolution.

[14]  K. Vandepoele,et al.  Are We There Yet? Reliably Estimating the Completeness of Plant Genome Sequences[OPEN] , 2016, Plant Cell.

[15]  C. Cañestro,et al.  Evolution by gene loss , 2016, Nature Reviews Genetics.

[16]  P. Bork,et al.  ETE 3: Reconstruction, Analysis, and Visualization of Phylogenomic Data , 2016, Molecular biology and evolution.

[17]  L. Mueller,et al.  Genes conserved for arbuscular mycorrhizal symbiosis identified through phylogenomics , 2016, Nature Plants.

[18]  L. Tisa,et al.  The Casuarina NIN gene is transcriptionally activated throughout Frankia root infection as well as in response to bacterial diffusible signals. , 2015, The New phytologist.

[19]  Evgeny M. Zdobnov,et al.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs , 2015, Bioinform..

[20]  D. Soltis,et al.  Large-scale phylogenetic analyses reveal multiple gains of actinorhizal nitrogen-fixing symbioses in angiosperms associated with climate change , 2015, Scientific Reports.

[21]  S. Kelly,et al.  OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy , 2015, Genome Biology.

[22]  G. Oldroyd,et al.  Tracing the evolutionary path to nitrogen-fixing crops. , 2015, Current opinion in plant biology.

[23]  C. Bond,et al.  Convergent evolution of strigolactone perception enabled host detection in parasitic plants , 2015, Science.

[24]  Olivier Gascuel,et al.  FastME 2.0: A Comprehensive, Accurate, and Fast Distance-Based Phylogeny Inference Program , 2015, Molecular biology and evolution.

[25]  Matthieu Muffato,et al.  Current Methods for Automated Filtering of Multiple Sequence Alignments Frequently Worsen Single-Gene Phylogenetic Inference , 2015, Systematic biology.

[26]  M. Delorenzi,et al.  A novel bioinformatics pipeline to discover genes related to arbuscular mycorrhizal symbiosis based on their evolutionary conservation pattern among higher plants , 2014, BMC Plant Biology.

[27]  A. von Haeseler,et al.  IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies , 2014, Molecular biology and evolution.

[28]  Stephen A. Smith,et al.  Orthology Inference in Nonmodel Organisms Using Transcriptomes and Low-Coverage Genomes: Improving Accuracy and Matrix Occupancy for Phylogenomics , 2014, Molecular biology and evolution.

[29]  D. Hibbett,et al.  Latent homology and convergent regulatory evolution underlies the repeated emergence of yeasts , 2014, Nature Communications.

[30]  G. Coruzzi,et al.  Comparative Phylogenomics Uncovers the Impact of Symbiotic Associations on Host Genome Evolution , 2014, PLoS genetics.

[31]  A. Furtado,et al.  Protocol: a simple method for extracting next-generation sequencing quality genomic DNA from recalcitrant plant species , 2014, Plant Methods.

[32]  Jens Kattge,et al.  A single evolutionary innovation drives the deep evolution of symbiotic N2-fixation in angiosperms , 2014, Nature Communications.

[33]  Rod A Wing,et al.  A reference genome for common bean and genome-wide analysis of dual domestications , 2014, Nature Genetics.

[34]  Tetsuya Hayashi,et al.  Efficient de novo assembly of highly heterozygous genomes from whole-genome shotgun short reads , 2014, Genome research.

[35]  S. Aoki,et al.  Evolutionary Dynamics of Nitrogen Fixation in the Legume–Rhizobia Symbiosis , 2014, PloS one.

[36]  R. Wrangham,et al.  Legume nodulation , 2014, Current Biology.

[37]  T. Bisseling,et al.  Evolution of a symbiotic receptor through gene duplications in the legume-rhizobium mutualism. , 2014, The New phytologist.

[38]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[39]  Carolyn J. Lawrence-Dill,et al.  MAKER-P: A Tool Kit for the Rapid Creation, Management, and Quality Control of Plant Genome Annotations1[W][OPEN] , 2013, Plant Physiology.

[40]  P. Normand,et al.  Micromonospora is a normal occupant of actinorhizal nodules , 2013, Journal of Biosciences.

[41]  David L. Stern,et al.  The genetic causes of convergent evolution , 2013, Nature Reviews Genetics.

[42]  Mira V. Han,et al.  Estimating gene gain and loss rates in the presence of error in genome assembly and annotation using CAFE 3. , 2013, Molecular biology and evolution.

[43]  S. Svistoonoff,et al.  The Independent Acquisition of Plant Root Nitrogen-Fixing Symbiosis in Fabids Recruited the Same Genetic Pathway for Nodule Organogenesis , 2013, PloS one.

[44]  Gordon Gremme,et al.  GenomeTools: A Comprehensive Software Library for Efficient Processing of Structured Genome Annotations , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[45]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[46]  Jian Wang,et al.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler , 2012, GigaScience.

[47]  James Leebens-Mack,et al.  Evaluating Methods for Isolating Total RNA and Predicting the Success of Sequencing Phylogenetically Diverse Plant Transcriptomes , 2012, PloS one.

[48]  C. Davis,et al.  Phylogenomics and a posteriori data partitioning resolve the Cretaceous angiosperm radiation Malpighiales , 2012, Proceedings of the National Academy of Sciences.

[49]  M. Chiurazzi,et al.  Molecular characterization of the Lotus japonicus NRT1(PTR) and NRT2 families. , 2012, Plant, cell & environment.

[50]  J. Setubal,et al.  Distribution of nitrogen fixation and nitrogenase-like sequences amongst microbial genomes , 2012, BMC Genomics.

[51]  K. Demchenko,et al.  The diversity of actinorhizal symbiosis , 2012, Protoplasma.

[52]  Alvaro J. González,et al.  The Medicago Genome Provides Insight into the Evolution of Rhizobial Symbioses , 2011, Nature.

[53]  J. Doyle Phylogenetic perspectives on the origins of nodulation. , 2011, Molecular plant-microbe interactions : MPMI.

[54]  A. Good,et al.  Future Prospects for Cereals That Fix Nitrogen , 2011, Science.

[55]  N. Friedman,et al.  Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data , 2011, Nature Biotechnology.

[56]  Carl Kingsford,et al.  A fast, lock-free approach for efficient parallel counting of occurrences of k-mers , 2011, Bioinform..

[57]  Susan R. Wessler,et al.  MITE-Hunter: a program for discovering miniature inverted-repeat transposable elements from genomic sequences , 2010, Nucleic acids research.

[58]  T. Sakurai,et al.  Genome sequence of the palaeopolyploid soybean , 2010, Nature.

[59]  Mingyi Wang,et al.  The Medicago truncatula gene expression atlas web server , 2009, BMC Bioinformatics.

[60]  S. Carroll,et al.  Deep homology and the origins of evolutionary novelty , 2009, Nature.

[61]  T. Mailund,et al.  SNPFile – A software library and file format for large scale association mapping and population genetics studies , 2008, BMC Bioinformatics.

[62]  Martin Parniske,et al.  Arbuscular mycorrhiza: the mother of plant root endosymbioses , 2008, Nature Reviews Microbiology.

[63]  A. Jauneau,et al.  The RPG gene of Medicago truncatula controls Rhizobium-directed polar growth during infection , 2008, Proceedings of the National Academy of Sciences.

[64]  S. Carroll Evo-Devo and an Expanding Evolutionary Synthesis: A Genetic Theory of Morphological Evolution , 2008, Cell.

[65]  Laurent Laplaze,et al.  SymRK defines a common genetic basis for plant root endosymbioses with arbuscular mycorrhiza fungi, rhizobia, and Frankiabacteria , 2008, Proceedings of the National Academy of Sciences.

[66]  Katharina Markmann,et al.  Functional Adaptation of a Plant Receptor- Kinase Paved the Way for the Evolution of Intracellular Root Symbioses with Bacteria , 2008, PLoS biology.

[67]  K. Treseder,et al.  Nitrogen limitation of net primary productivity in terrestrial ecosystems is globally distributed. , 2008, Ecology.

[68]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[69]  J. Sprent Evolving ideas of legume evolution and diversity: a taxonomic perspective on the occurrence of nodulation. , 2007, The New phytologist.

[70]  J. F. Marsh,et al.  Medicago truncatula NIN Is Essential for Rhizobial-Independent Nodule Organogenesis Induced by Autoactive Calcium/Calmodulin-Dependent Protein Kinase1 , 2007, Plant Physiology.

[71]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[72]  D. Balding,et al.  Handbook of statistical genetics , 2004 .

[73]  Victor V. Solovyev,et al.  Statistical Approaches in Eukaryotic Gene Prediction , 2004 .

[74]  Burkhard Morgenstern,et al.  AUGUSTUS: a web server for gene finding in eukaryotes , 2004, Nucleic Acids Res..

[75]  Ian Korf,et al.  Gene finding in novel genomes , 2004, BMC Bioinformatics.

[76]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[77]  Stephen M. Mount,et al.  Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. , 2003, Nucleic acids research.

[78]  S. West,et al.  Host sanctions and the legume–rhizobium mutualism , 2003, Nature.

[79]  C. Fraser,et al.  Phylogenomics: Intersection of Evolution and Genomics , 2003, Science.

[80]  Arsen O. Batagov,et al.  The Sym35 Gene Required for Root Nodule Development in Pea Is an Ortholog of Nin from Lotus japonicus 1 , 2003, Plant Physiology.

[81]  M. Parniske,et al.  Evolution of signal transduction in intracellular symbiosis. , 2002, Trends in plant science.

[82]  Rolf Apweiler,et al.  InterProScan - an integration platform for the signature-recognition methods in InterPro , 2001, Bioinform..

[83]  D J Lipman,et al.  Lineage-specific loss and divergence of functionally linked genes in eukaryotes. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[84]  Eugene W. Myers,et al.  A whole-genome assembly of Drosophila. , 2000, Science.

[85]  Leif Schauser,et al.  A plant regulator controlling development of symbiotic root nodules , 1999, Nature.

[86]  C. Valverde,et al.  Time course of nodule development in the Discaria trinervis (Rhamnaceae) -Frankia symbiosis. , 1999, The New phytologist.

[87]  M. Borodovsky,et al.  GeneMark.hmm: new solutions for gene finding. , 1998, Nucleic acids research.

[88]  S. Swensen THE EVOLUTION OF ACTINORHIZAL SYMBIOSES: EVIDENCE FOR MULTIPLE ORIGINS OF THE SYMBIOTIC ASSOCIATION , 1996 .

[89]  T. Bisseling,et al.  A nodule-specific gene encoding a subtilisin-like protease is expressed in early stages of actinorhizal nodule development. , 1995, The Plant cell.

[90]  D. Soltis,et al.  Chloroplast gene sequence data suggest a single origin of the predisposition for symbiotic nitrogen fixation in angiosperms. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[91]  James B. Hicks,et al.  A plant DNA minipreparation: Version II , 1983, Plant Molecular Biology Reporter.

[92]  W. Rutter,et al.  Isolation of biologically active ribonucleic acid from sources enriched in ribonuclease. , 1979, Biochemistry.

[93]  D. Gorbet,et al.  A Non-Nodulating Peanut 1 , 1979 .

[94]  M. R. Chandler Some Observations on Infection of Arachis hypogaea L. by Rhizobium , 1978 .

[95]  G. Fåhraeus The infection of clover root hairs by nodule bacteria studied by a simple glass slide technique. , 1957, Journal of general microbiology.

[96]  D. Soltis,et al.  T HE AGE AND DIVERSIFICATION OF THE ANGIOSPERMS RE - REVISITED 1 , 2010 .

[97]  W. Newton,et al.  Nitrogen-fixing actinorhizal symbioses , 2008 .

[98]  S. Swensen,et al.  Evolution Of Actinorhizal Host Plants And Frankia Endosymbionts , 2007 .

[99]  Gesine Reinert,et al.  Poisson Process Approximation for Sequence Repeats and Sequencing by Hybridization , 1996, J. Comput. Biol..

[100]  P. Wong,et al.  Inhibition of legume nodule formation and N2 fixation by nitrate , 1988 .

[101]  H. Hellriegel,et al.  Untersuchungen über die Stickstoffnahrung der Gramineen und Leguminosen , 1888 .

[102]  Walter Pirovano,et al.  BIOINFORMATICS APPLICATIONS , 2022 .