Protein networks identify novel symbiogenetic genes resulting from plastid endosymbiosis

Significance Endosymbiotic gene transfer from the plastid genome to the nucleus comprises the most significant source of horizontal gene transfer in photosynthetic eukaryotes. We investigated genomic data at the infragenic level to determine whether the cyanobacterial endosymbiont also contributed gene fragments (i.e., domains) to create novel nuclear-encoded proteins. We found 67 such gene families that are expressed as RNA and widely distributed among plants and algae. At least 23 genes are putatively involved in redox regulation and light response, namely the maintenance of a photodynamic organelle. Our results add a new layer of complexity to plastid integration and point to the role of fused proteins as key players in this process. The integration of foreign genetic information is central to the evolution of eukaryotes, as has been demonstrated for the origin of the Calvin cycle and of the heme and carotenoid biosynthesis pathways in algae and plants. For photosynthetic lineages, this coordination involved three genomes of divergent phylogenetic origins (the nucleus, plastid, and mitochondrion). Major hurdles overcome by the ancestor of these lineages were harnessing the oxygen-evolving organelle, optimizing the use of light, and stabilizing the partnership between the plastid endosymbiont and host through retargeting of proteins to the nascent organelle. Here we used protein similarity networks that can disentangle reticulate gene histories to explore how these significant challenges were met. We discovered a previously hidden component of algal and plant nuclear genomes that originated from the plastid endosymbiont: symbiogenetic genes (S genes). These composite proteins, exclusive to photosynthetic eukaryotes, encode a cyanobacterium-derived domain fused to one of cyanobacterial or another prokaryotic origin and have emerged multiple, independent times during evolution. Transcriptome data demonstrate the existence and expression of S genes across a wide swath of algae and plants, and functional data indicate their involvement in tolerance to oxidative stress, phototropism, and adaptation to nitrogen limitation. Our research demonstrates the “recycling” of genetic information by photosynthetic eukaryotes to generate novel composite genes, many of which function in plastid maintenance.

[1]  Swati Agrawal,et al.  Toxoplasma gondii Tic20 is essential for apicoplast protein import , 2008, Proceedings of the National Academy of Sciences.

[2]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[3]  S. Gomes,et al.  A Rhodopsin-Guanylyl Cyclase Gene Fusion Functions in Visual Perception in a Fungus , 2014, Current Biology.

[4]  Debashish Bhattacharya,et al.  Cyanobacterial Contribution to Algal Nuclear Genomes Is Primarily Limited to Plastid Functions , 2006, Current Biology.

[5]  Diego Miranda-Saavedra,et al.  Chromerid genomes reveal the evolutionary path from photosynthetic algae to obligate intracellular parasites , 2015, eLife.

[6]  Xing Wang Deng,et al.  Conservation and Divergence of Light-Regulated Genome Expression Patterns during Seedling Development in Rice and Arabidopsis[W] , 2005, The Plant Cell Online.

[7]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[8]  G. Murtaza,et al.  Molecular and genetic analyses of Tic20 homologues in Arabidopsis thaliana chloroplasts. , 2011, The Plant journal : for cell and molecular biology.

[9]  G. Heijne,et al.  ChloroP, a neural network‐based method for predicting chloroplast transit peptides and their cleavage sites , 1999, Protein science : a publication of the Protein Society.

[10]  A. Fernie,et al.  The Arabidopsis onset of leaf death5 Mutation of Quinolinate Synthase Affects Nicotinamide Adenine Dinucleotide Biosynthesis and Causes Early Ageing[W] , 2008, The Plant Cell Online.

[11]  C. Delwiche,et al.  Phylogeny and Molecular Evolution of the Green Algae , 2012 .

[12]  J. McInerney,et al.  A Pluralistic Account of Homology: Adapting the Models to the Data , 2013, Molecular biology and evolution.

[13]  D. Bhattacharya,et al.  Genome of the halotolerant green alga Picochlorum sp. reveals strategies for thriving under fluctuating environmental conditions. , 2015, Environmental microbiology.

[14]  D. Bhattacharya,et al.  Evolution of salt tolerance in a laboratory reared population of Chlamydomonas reinhardtii. , 2014, Environmental microbiology.

[15]  J. Overmann The phototrophic consortium "Chlorochromatium aggregatum" - a model for bacterial heterologous multicellularity. , 2010, Advances in experimental medicine and biology.

[16]  Narmada Thanki,et al.  CDD: NCBI's conserved domain database , 2014, Nucleic Acids Res..

[17]  K. Cline,et al.  Molecular characterization of the Arabidopsis 9-cis epoxycarotenoid dioxygenase gene family. , 2003, The Plant journal : for cell and molecular biology.

[18]  R. White,et al.  Eye-like ocelloids are built from different endosymbiotically acquired components , 2015, Nature.

[19]  G. Hölzl,et al.  Fatty Acid Phytyl Ester Synthesis in Chloroplasts of Arabidopsis[W] , 2012, Plant Cell.

[20]  T. Le Bihan,et al.  Shotgun proteomic analysis of the unicellular alga Ostreococcus tauri. , 2011, Journal of proteomics.

[21]  Sarah R. Smith,et al.  The Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP): Illuminating the Functional Diversity of Eukaryotic Life in the Oceans through Transcriptome Sequencing , 2014, PLoS biology.

[22]  E. Meyerowitz,et al.  Arabidopsis ethylene-response gene ETR1: similarity of product to two-component regulators. , 1993, Science.

[23]  K. Hirschi,et al.  Structural insights into the N-terminal GIY–YIG endonuclease activity of Arabidopsis glutaredoxin AtGRXS16 in chloroplasts , 2013, Proceedings of the National Academy of Sciences.

[24]  T. Richards,et al.  Genome-scale comparative analysis of gene fusions, gene fissions, and the fungal tree of life , 2012, Proceedings of the National Academy of Sciences.

[25]  K. V. van Wijk,et al.  The Functional Network of the Arabidopsis Plastoglobule Proteome Based on Quantitative Proteomics and Genome-Wide Coexpression Analysis1[C][W][OA] , 2012, Plant Physiology.

[26]  P. Falkowski,et al.  Deciphering Primordial Cyanobacterial Genome Functions from Protein Network Analysis , 2015, Current Biology.

[27]  P. Falkowski,et al.  Remodeling of intermediate metabolism in the diatom Phaeodactylum tricornutum under nitrogen stress , 2014, Proceedings of the National Academy of Sciences.

[28]  M. Mittag,et al.  The heme-binding protein SOUL3 of Chlamydomonas reinhardtii influences size and position of the eyespot. , 2013, Molecular plant.

[29]  Miroslav Oborník,et al.  Mosaic origin of the heme biosynthesis pathway in photosynthetic eukaryotes. , 2005, Molecular biology and evolution.

[30]  David Alvarez-Ponce,et al.  Gene similarity networks provide tools for understanding eukaryote origins and evolution , 2013, Proceedings of the National Academy of Sciences.

[31]  B. Halliwell Reactive Species and Antioxidants. Redox Biology Is a Fundamental Theme of Aerobic Life , 2006, Plant Physiology.

[32]  Sabine Cornelsen,et al.  Evolutionary analysis of Arabidopsis, cyanobacterial, and chloroplast genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[33]  R. Levine,et al.  Cytochrome f and plastocyanin: their sequence in the photosynthetic electron transport chain of Chlamydomonas reinhardi. , 1965, Proceedings of the National Academy of Sciences of the United States of America.

[34]  James O. McInerney,et al.  Evolutionary analyses of non-genealogical bonds produced by introgressive descent , 2012, Proceedings of the National Academy of Sciences.

[35]  D. Stammers,et al.  Calcium regulation of chloroplast protein translocation is mediated by calmodulin binding to Tic32 , 2006, Proceedings of the National Academy of Sciences.

[36]  Eric Bapteste,et al.  Extensive Gene Remodeling in the Viral World: New Evidence for Nongradual Evolution in the Mobilome Network , 2014, Genome biology and evolution.

[37]  M. Akita,et al.  Early Steps in the Biosynthesis of NAD in Arabidopsis Start with Aspartate and Occur in the Plastid1 , 2006, Plant Physiology.

[38]  T. Mock,et al.  Plastid proteome prediction for diatoms and other algae with secondary plastids of the red lineage , 2015, The Plant journal : for cell and molecular biology.

[39]  Michael Hippler,et al.  Characterizing the Anaerobic Response of Chlamydomonas reinhardtii by Quantitative Proteomics , 2010, Molecular & Cellular Proteomics.

[40]  C. Slamovits,et al.  Correction: Corrigendum: A bacterial proteorhodopsin proton pump in marine eukaryotes , 2011, Nature Communications.

[41]  Klaus Palme,et al.  Lateral relocation of auxin efflux regulator PIN3 mediates tropism in Arabidopsis , 2002, Nature.

[42]  G. McFadden Origin and evolution of plastids and photosynthesis in eukaryotes. , 2014, Cold Spring Harbor perspectives in biology.

[43]  M. Fontecave,et al.  Characterization of Arabidopsis thaliana SufE2 and SufE3. Functions in chloroplast iron-sulfur cluster assembly and NAD synthesis. , 2007, The Journal of Biological Chemistry.

[44]  Debashish Bhattacharya,et al.  Phylogeny of Calvin cycle enzymes supports Plantae monophyly. , 2007, Molecular phylogenetics and evolution.

[45]  Filipa L. Sousa,et al.  Origins of major archaeal clades correspond to gene acquisitions from bacteria , 2014, Nature.

[46]  E. Koonin,et al.  Bacterial rhodopsin: evidence for a new type of phototrophy in the sea. , 2000, Science.

[47]  A. Grossman,et al.  Ancient recruitment by chromists of green algal genes encoding enzymes for carotenoid biosynthesis. , 2008, Molecular biology and evolution.

[48]  Debashish Bhattacharya,et al.  A molecular timeline for the origin of photosynthetic eukaryotes. , 2004, Molecular biology and evolution.

[49]  N. Rouhier,et al.  Arabidopsis thaliana Nfu2 accommodates [2Fe-2S] or [4Fe-4S] clusters and is competent for in vitro maturation of chloroplast [2Fe-2S] and [4Fe-4S] cluster-containing proteins. , 2013, Biochemistry.

[50]  A. Weber,et al.  Proteomic analysis of the Cyanophora paradoxa muroplast provides clues on early events in plastid endosymbiosis , 2012, Planta.

[51]  S. Kim,et al.  The structure of the signal receiver domain of the Arabidopsis thaliana ethylene receptor ETR1. , 1999, Structure.

[52]  Anne Berry,et al.  MosaicFinder: identification of fused gene families in sequence similarity networks , 2013, Bioinform..

[53]  Qi Sun,et al.  PPDB, the Plant Proteomics Database at Cornell , 2008, Nucleic Acids Res..

[54]  T. Oertner,et al.  The rhodopsin–guanylyl cyclase of the aquatic fungus Blastocladiella emersonii enables fast optical control of cGMP signaling , 2015, Science Signaling.

[55]  D. Bhattacharya,et al.  Primary endosymbiosis and the evolution of light and oxygen sensing in photosynthetic eukaryotes , 2014, Front. Ecol. Evol..

[56]  M. Hasegawa,et al.  Gene transfer to the nucleus and the evolution of chloroplasts , 1998, Nature.