The Plastid Genome of the Cryptophyte Alga, Guillardia theta: Complete Sequence and Conserved Synteny Groups Confirm Its Common Ancestry with Red Algae

Abstract. The plastid genome of the cryptophyte alga Guillardia theta (121,524 bp) has been completely sequenced. The genome is 33% G+C and contains a short, nonidentical inverted repeat (4.9 kb) encoding the two rRNA cistrons. The large and small single-copy regions are 96.3 and 15.4 kb, respectively. Forty-six genes encoding proteins for photosynthesis, 5 genes for biosynthetic function, 5 genes involved in replication and division, 30 tRNA genes, 44 ribosomal protein genes (26 large subunit and 18 small subunit), 3 translation factors, 8 genes encoding components of the transcriptional machinery including 3 ycfs (hypothetical chloroplast frames), and 26 additional ycfs have been identified. There are eight ORFs larger than 50 amino acids, 3 of which have homologues on the plastid genome of the rhodophyte, Porphyra purpurea (Reith and Munholland 1995) and/or the Synechocystis genome (Kaneko et al. 1996) and can be designated new ycfs. Intergenic spacers are very short, no introns have been detected, and several genes overlap, all resulting in a very compact genome. In addition, large clusters of genes (such as those for the ribosomal proteins) are organized into single transcriptional units (Wang et al. 1997), again resulting in an economically organized genome. The cryptophyte plastid genome is almost completely comprised of clusters of genes that are found on the rhodophyte Porphyra purpurea, confirming its common ancestry with red algae. Furthermore, recombination events involving both tRNA genes and the rRNA cistrons appear to have been responsible for the structure of the cryptophyte plastid genome, including the formation of the inverted repeat.

[1]  David J. States,et al.  Identification of protein coding regions by database similarity search , 1993, Nature Genetics.

[2]  M. Sugiura,et al.  The chloroplast genome. , 1992, Plant molecular biology.

[3]  Y Van de Peer,et al.  Substitution rate calibration of small subunit ribosomal RNA identifies chlorarachniophyte endosymbionts as remnants of green algae. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[4]  X. Liu,et al.  The plastid genome of Cryptomonas phi encodes an hsp70-like protein, a histone-like protein, and an acyl carrier protein. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[5]  J. Alonso,et al.  The recombinant product of the Chryptomonas phi plastid gene hlpA is an architectural HU-like protein that promotes the assembly of complex nucleoprotein structures. , 1997, European journal of biochemistry.

[6]  S. Douglas Chloroplast Origins and Evolution , 1994 .

[7]  Richard Wetherbee,et al.  Guillardia theta gen. et sp.nov. (Cryptophyceae) , 1990 .

[8]  S. Douglas,et al.  STRUCTURAL, TRANSCRIPTIONAL, AND PHYLOGENETIC ANALYSES OF THE atpB GENE CLUSTER FROM THE PLASTID OF CRYPTOMONASΨ (CRYPTOPHYCEAE) 1 , 1994 .

[9]  J. Palmer,et al.  Rampant horizontal transfer and duplication of rubisco genes in eubacteria and plastids. , 1996, Molecular biology and evolution.

[10]  J. Popot,et al.  The 4-kDa nuclear-encoded PetM polypeptide of the chloroplast cytochrome b6f complex. Nucleic acid and protein sequences, targeting signals, transmembrane topology. , 1996, The Journal of biological chemistry.

[11]  Sayaka,et al.  Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions. , 1996, DNA research : an international journal for rapid publication of reports on genes and genomes.

[12]  K. Kowallik Origin and Evolution of Chloroplasts: Current Status and Future Perspectives , 1997 .

[13]  H. Bohnert,et al.  The complete sequence of the Cyanophora paradoxa cyanelle genome (Glaucocystophyceae) , 1997 .

[14]  T Gaasterland,et al.  Fully automated genome analysis that reflects user needs and preferences. A detailed introduction to the MAGPIE system architecture. , 1996, Biochimie.

[15]  A. Monfort,et al.  Complete sequence of Euglena gracilis chloroplast DNA. , 1993, Nucleic acids research.

[16]  P. Maliga,et al.  Deletion of rpoB reveals a second distinct transcription system in plastids of higher plants. , 1996, The EMBO journal.

[17]  D. Durnford,et al.  NUCLEOTIDE SEQUENCE OF THE GENE FOR THE LARGE SUBUNIT OF RIBULOSE‐1.5‐DISPHOSPHATE CARBOXYLASE/OXYGENASE FROM CRYPTOMNASΦ EVIDENCE SUPPORTING THE POLYPHYLETIC ORGIN OF PLASTIDS 1 , 1990 .

[18]  J. Walker,et al.  The organization and sequence of the genes for ATP synthase subunits in the cyanobacterium Synechococcus 6301. Support for an endosymbiotic origin of chloroplasts. , 1987, Journal of molecular biology.

[19]  T. Cavalier-smith,et al.  Bonsai genomics: sequencing the smallest eukaryotic genomes. , 1997, Trends in genetics : TIG.

[20]  A. Danon Translational Regulation in the Chloroplast , 1997, Plant physiology.

[21]  S. P. Gibbs The Chloroplast Endoplasmic Reticulum: Structure, Function, and Evolutionary Significance , 1981 .

[22]  K. Kowallik,et al.  Chloroplast ATPase genes in the diatom Odontella sinensis reflect cyanobacterial characters in structure and arrangement. , 1992, Journal of molecular biology.

[23]  Z. Hu,et al.  A DnaB intein in Rhodothermus marinus: indication of recent intein homing across remotely related organisms. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Y. Suzuki,et al.  Complete nucleotide sequence of the chloroplast genome from the green alga Chlorella vulgaris: the existence of genes possibly involved in chloroplast division. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[25]  D. Spencer,et al.  Cryptomonad algae are evolutionary chimaeras of two phylogenetically distinct unicellular eukaryotes , 1991, Nature.

[26]  C. Marck,et al.  'DNA Strider': a 'C' program for the fast analysis of DNA and protein sequences on the Apple Macintosh family of computers. , 1988, Nucleic acids research.

[27]  M Sugita,et al.  Organization of a large gene cluster encoding ribosomal proteins in the cyanobacterium Synechococcus sp. strain PCC 6301: comparison of gene clusters among cyanobacteria, eubacteria and chloroplast genomes. , 1997, Gene.

[28]  Y. Nakamura,et al.  Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions (supplement). , 1996, DNA research : an international journal for rapid publication of reports on genes and genomes.

[29]  P. Maliga,et al.  The two RNA polymerases encoded by the nuclear and the plastid compartments transcribe distinct groups of genes in tobacco plastids , 1997, The EMBO journal.

[30]  D. Bhattacharya,et al.  THE PHYLOGENY OF PLASTIDS: A REVIEW BASED ON COMPARISONS OF SMALL‐SUBUNIT RIBOSOMAL RNA CODING REGIONS , 1995 .

[31]  S. P. Gibbs,et al.  THE CRYPTOMONAD NUCLEOMORPH: ITS ULTRASTRUCTURE AND EVOLUTIONARY SIGNIFICANCE 1 , 1980 .