What can we do with 1000 plastid genomes?

The plastid genome of plants is the smallest and most gene-rich of the three genomes in each cell and the one generally present in the highest copy number. As a result, obtaining plastid DNA sequence is a particularly cost-effective way of discovering genetic information about a plant. Until recently, the sequence information gathered in this way was generally limited to small portions of the genome amplified by polymerase chain reaction, but recent advances in sequencing technology have stimulated a substantial rate of increase in the sequencing of complete plastid genomes. Within the last year, the number of complete plastid genomes accessible in public sequence repositories has exceeded 1000. This sudden flood of data raises numerous challenges in data analysis and interpretation, but also offers the keys to potential insights across large swathes of plant biology. We examine what has been learnt so far, what more could be learnt if we look at the data in the right way, and what we might gain from the tens of thousands more genome sequences that will surely arrive in the next few years. The most exciting new discoveries are likely to be made at the interdisciplinary interfaces between molecular biology and ecology.

[1]  Ian Small,et al.  The Complete Sequence of the Acacia ligulata Chloroplast Genome Reveals a Highly Divergent clpP1 Gene , 2015, PloS one.

[2]  R. Bock,et al.  Why are most organelle genomes transmitted maternally? , 2014, BioEssays : news and reviews in molecular, cellular and developmental biology.

[3]  Dan G. Bock,et al.  On the adaptive value of cytoplasmic genomes in plants , 2014, Molecular ecology.

[4]  A. J. Bendich,et al.  The linear plastid chromosomes of maize: terminal sequences, structures, and implications for DNA replication , 2015, Current Genetics.

[5]  Susanne S. Renner,et al.  The Plastomes of Two Species in the Endoparasite Genus Pilostyles (Apodanthaceae) Each Retain Just Five or Six Possibly Functional Genes , 2015, Genome biology and evolution.

[6]  Gregory W. Stull,et al.  A targeted enrichment strategy for massively parallel sequencing of angiosperm plastid genomes , 2013, Applications in plant sciences.

[7]  Patrick J. Biggs,et al.  Systematic Error in Seed Plant Phylogenomics , 2011, Genome biology and evolution.

[8]  W. John Kress,et al.  A DNA barcode for land plants , 2009, Proceedings of the National Academy of Sciences.

[9]  R. Dobler,et al.  A meta‐analysis of the strength and nature of cytoplasmic genetic effects , 2014, Journal of evolutionary biology.

[10]  M. Hanson,et al.  Chloroplast RNA metabolism. , 2010, Annual review of plant biology.

[11]  Tracey A Ruhlman,et al.  NDH expression marks major transitions in plant evolution and reveals coordinate intracellular gene loss , 2015, BMC Plant Biology.

[12]  B. Rao,et al.  A comparative approach to elucidate chloroplast genome replication , 2009, BMC Genomics.

[13]  Yan Zhang,et al.  Mechanisms of Functional and Physical Genome Reduction in Photosynthetic and Nonphotosynthetic Parasitic Plants of the Broomrape Family[W][OPEN] , 2013, Plant Cell.

[14]  I. Small,et al.  Integration of complete chloroplast genome sequences with small amplicon datasets improves phylogenetic resolution in Acacia. , 2016, Molecular phylogenetics and evolution.

[15]  C. Zeyl,et al.  Organelle inheritance in plants , 1994, Heredity.

[16]  N. Colegrave,et al.  Direct estimate of the spontaneous mutation rate uncovers the effects of drift and recombination in the Chlamydomonas reinhardtii plastid genome , 2015, bioRxiv.

[17]  D. Karcher,et al.  A new synthetic biology approach allows transfer of an entire metabolic pathway from a medicinal plant to a biomass crop , 2016, eLife.

[18]  Linda A. Raubeson,et al.  Comparative chloroplast genomics: analyses including new sequences from the angiosperms Nuphar advena and Ranunculus macranthus , 2007, BMC Genomics.

[19]  I. Andersson,et al.  Changes in Rubisco kinetics during the evolution of C4 photosynthesis in Flaveria (Asteraceae) are associated with positive selection on genes encoding the enzyme. , 2011, Molecular biology and evolution.

[20]  A. Barkan,et al.  Codon Optimization to Enhance Expression Yields Insights into Chloroplast Translation1[OPEN] , 2016, Plant Physiology.

[21]  B. Green Chloroplast genomes of photosynthetic eukaryotes. , 2011, The Plant journal : for cell and molecular biology.

[22]  Karen B Barnard-Kubow,et al.  Biparental chloroplast inheritance leads to rescue from cytonuclear incompatibility. , 2017, The New phytologist.

[23]  P. Medgyesy,et al.  Limited chloroplast gene transfer via recombination overcomes plastomegenome incompatibility between Nicotiana tabacum and Solanum tuberosum , 2004, Plant Molecular Biology.

[24]  A. Millar,et al.  The pentatricopeptide repeat gene OTP51 with two LAGLIDADG motifs is required for the cis-splicing of plastid ycf3 intron 2 in Arabidopsis thaliana. , 2008, The Plant journal : for cell and molecular biology.

[25]  R. J. Bayer,et al.  Molecular phylogenetics of Acacia (Fabaceae: Mimosoideae) based on the chloroplast MATK coding sequence and flanking TRNK intron spacer regions. , 2001, American journal of botany.

[26]  K. Yamaguchi,et al.  Chloroplast RNA editing going extreme: more than 3400 events of C-to-U editing in the chloroplast transcriptome of the lycophyte Selaginella uncinata , 2014, RNA.

[27]  M. Chase,et al.  Ancient flowering plants: DNA sequences and angiosperm classification , 2001, Genome Biology.

[28]  Charles S. Bond,et al.  A Combinatorial Amino Acid Code for RNA Recognition by Pentatricopeptide Repeat Proteins , 2012, PLoS genetics.

[29]  C. dePamphilis,et al.  Disproportional plastome-wide increase of substitution rates and relaxed purifying selection in genes of carnivorous Lentibulariaceae. , 2014, Molecular biology and evolution.

[30]  Filipa L. Sousa,et al.  YCF1: A Green TIC? , 2015, Plant Cell.

[31]  A. Grossman,et al.  The Type II NADPH Dehydrogenase Facilitates Cyclic Electron Flow, Energy-Dependent Quenching, and Chlororespiratory Metabolism during Acclimation of Chlamydomonas reinhardtii to Nitrogen Deprivation1[OPEN] , 2016, Plant Physiology.

[32]  A. Barkan Expression of Plastid Genes: Organelle-Specific Elaborations on a Prokaryotic Scaffold1 , 2011, Plant Physiology.

[33]  U. Gowik,et al.  The Path from C3 to C4 Photosynthesis1 , 2010, Plant Physiology.

[34]  Kai F. Müller,et al.  The evolution of the plastid chromosome in land plants: gene content, gene order, gene function , 2011, Plant Molecular Biology.

[35]  Jim Leebens-Mack,et al.  Identifying the basal angiosperm node in chloroplast genome phylogenies: sampling one's way out of the Felsenstein zone. , 2005, Molecular biology and evolution.

[36]  Rachel S. Meyer,et al.  Possible Loss of the Chloroplast Genome in the Parasitic Flowering Plant Rafflesia lagascae (Rafflesiaceae) , 2014, Molecular biology and evolution.

[37]  H. Daniell,et al.  Chloroplast genomes: diversity, evolution, and applications in genetic engineering , 2016, Genome Biology.

[38]  Shilin Chen,et al.  Plant DNA barcoding: from gene to genome , 2015, Biological reviews of the Cambridge Philosophical Society.

[39]  J. Mandel,et al.  Ancestral Gene Flow and Parallel Organellar Genome Capture Result in Extreme Phylogenomic Discord in a Lineage of Angiosperms , 2016, Systematic biology.

[40]  Pierre Taberlet,et al.  From barcodes to genomes: extending the concept of DNA barcoding , 2016, Molecular ecology.

[41]  M. Kunnimalaiyaan,et al.  Fine mapping of replication origins (ori A and ori B) in Nicotiana tabacum chloroplast DNA. , 1997, Nucleic acids research.

[42]  N. Sato,et al.  Enzymes involved in organellar DNA replication in photosynthetic eukaryotes , 2014, Front. Plant Sci..

[43]  Wen J. Li,et al.  Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation , 2015, Nucleic Acids Res..

[44]  P. Keeling,et al.  Mitochondrial and plastid genome architecture: Reoccurring themes, but significant differences at the extremes , 2015, Proceedings of the National Academy of Sciences.

[45]  Y. Nakamura,et al.  Complete structure of the chloroplast genome of Arabidopsis thaliana. , 1999, DNA research : an international journal for rapid publication of reports on genes and genomes.

[46]  Mark Fishbein,et al.  Navigating the tip of the genomic iceberg: Next-generation sequencing for plant systematics. , 2012, American journal of botany.

[47]  H. Koop,et al.  Linear molecules of tobacco ptDNA end at known replication origins and additional loci , 2006, Plant Molecular Biology.

[48]  R. A. Atherton,et al.  Whole genome sequencing of enriched chloroplast DNA using the Illumina GAII platform , 2010, Plant Methods.

[49]  Pamela S Soltis,et al.  From algae to angiosperms–inferring the phylogeny of green plants (Viridiplantae) from 360 plastid genomes , 2014, BMC Evolutionary Biology.

[50]  Ian Small,et al.  Predictable Alteration of Sequence Recognition by RNA Editing Factors from Arabidopsis[OPEN] , 2015, Plant Cell.

[51]  R. Bock Engineering plastid genomes: methods, tools, and applications in basic research and biotechnology. , 2015, Annual review of plant biology.

[52]  R. Sage Variation in the kcat of Rubisco in C3 and C4 plants and some implications for photosynthetic performance at high and low temperature , 2002 .

[53]  Ian Small,et al.  Rampant Gene Loss in the Underground Orchid Rhizanthella gardneri Highlights Evolutionary Constraints on Plastid Genomes , 2011, Molecular biology and evolution.

[54]  Kui Lin,et al.  Sequencing Angiosperm Plastid Genomes Made Easy: A Complete Set of Universal Primers and a Case Study on the Phylogeny of Saxifragales , 2013, Genome biology and evolution.

[55]  Ashley B. Morris,et al.  Chloroplast DNA sequence utility for the lowest phylogenetic and phylogeographic inferences in angiosperms: the tortoise and the hare IV. , 2014, American journal of botany.

[56]  Brian D. Ondov,et al.  Mash: fast genome and metagenome distance estimation using MinHash , 2015, Genome Biology.

[57]  C. Lawrence,et al.  An Integrated Perspective on Phylogenetic Workflows. , 2016, Trends in ecology & evolution.

[58]  Riccardo Velasco,et al.  An evaluation of the PacBio RS platform for sequencing and de novo assembly of a chloroplast genome , 2013, BMC Genomics.

[59]  D. Smith Mutation Rates in Plastid Genomes: They Are Lower than You Might Think , 2015, Genome biology and evolution.

[60]  C. Birky Uniparental inheritance of mitochondrial and chloroplast genes: mechanisms and evolution. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[61]  Miller Tran,et al.  Chlamydomonas reinhardtii chloroplasts as protein factories. , 2007, Current opinion in biotechnology.

[62]  D. Soltis,et al.  Angiosperm phylogeny inferred from multiple genes as a tool for comparative biology , 1999, Nature.

[63]  Elena S. Ulbricht-Jones,et al.  Spontaneous Chloroplast Mutants Mostly Occur by Replication Slippage and Show a Biased Pattern in the Plastome of Oenothera[OPEN] , 2016, Plant Cell.

[64]  Mitsuyasu Hasebe,et al.  High levels of RNA editing in a vascular plant chloroplast genome: analysis of transcripts from the fern Adiantum capillus-veneris. , 2004, Gene.

[65]  A. Barkan,et al.  Mechanism of RNA stabilization and translational activation by a pentatricopeptide repeat protein , 2010, Proceedings of the National Academy of Sciences.

[66]  Zou,et al.  In vivo analysis of plastid psbA, rbcL and rpl32 UTR elements by chloroplast transformation: tobacco plastid gene expression is controlled by modulation of transcript levels and translation efficiency , 1999, The Plant journal : for cell and molecular biology.

[67]  Peter M Hollingsworth,et al.  Telling plant species apart with DNA: from barcodes to genomes , 2016, Philosophical Transactions of the Royal Society B: Biological Sciences.

[68]  Robert K. Jansen,et al.  Automatic annotation of organellar genomes with DOGMA , 2004, Bioinform..

[69]  S. Graham,et al.  Understanding the spectacular failure of DNA barcoding in willows (Salix): Does this result from a trans‐specific selective sweep? , 2014, Molecular ecology.

[70]  F. Takaiwa,et al.  The complete nucleotide sequence of the tobacco chloroplast genome: its gene organization and expression , 1986, The EMBO journal.

[71]  Xiaojun Guan,et al.  CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences , 2012, BMC Genomics.

[72]  K. Krause From chloroplasts to “cryptic” plastids: evolution of plastid genomes in parasitic plants , 2008, Current Genetics.

[73]  J. Palmer,et al.  Chloroplast DNA rearrangements are more frequent when a large inverted repeat sequence is lost , 1982, Cell.

[74]  S. Graham,et al.  Are plant species inherently harder to discriminate than animal species using DNA barcoding markers? , 2009, Molecular ecology resources.

[75]  Q. Wang,et al.  Structural basis for specific single-stranded RNA recognition by designer pentatricopeptide repeat proteins , 2016, Nature Communications.

[76]  R. A. Atherton,et al.  The evolutionary root of flowering plants. , 2013, Systematic biology.

[77]  R. W. Ness,et al.  Strategies for complete plastid genome sequencing , 2016, Molecular ecology resources.

[78]  N. Brisson,et al.  Recombination and the maintenance of plant organelle genome stability. , 2010, The New phytologist.

[79]  S. Wicke,et al.  Massive intracellular gene transfer during plastid genome reduction in nongreen Orobanchaceae. , 2016, The New phytologist.

[80]  J. Lundberg,et al.  An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants : APG II THE ANGIOSPERM PHYLOGENY GROUP * , 2003 .

[81]  A. Barkan,et al.  Protein-mediated protection as the predominant mechanism for defining processed mRNA termini in land plant chloroplasts , 2011, Nucleic acids research.

[82]  Elizabeth A. Kellogg,et al.  Verdant: automated annotation, alignment and phylogenetic analysis of whole chloroplast genomes , 2016, Bioinform..

[83]  Q. Cronk,et al.  Ultra-barcoding in cacao (Theobroma spp.; Malvaceae) using whole chloroplast genomes and nuclear ribosomal DNA. , 2012, American journal of botany.

[84]  Ching-Ping Lin,et al.  Loss of Different Inverted Repeat Copies from the Chloroplast Genomes of Pinaceae and Cupressophytes and Influence of Heterotachy on the Evaluation of Gymnosperm Phylogeny , 2011, Genome biology and evolution.

[85]  R. Jansen,et al.  Implications of the Plastid Genome Sequence of Typha (Typhaceae, Poales) for Understanding Genome Evolution in Poaceae , 2010, Journal of Molecular Evolution.

[86]  Tracey A Ruhlman,et al.  The Role of Heterologous Chloroplast Sequence Elements in Transgene Integration and Expression1[W][OA] , 2010, Plant Physiology.

[87]  Ian Small,et al.  The potential for manipulating RNA with pentatricopeptide repeat proteins. , 2014, The Plant journal : for cell and molecular biology.

[88]  D. Smith,et al.  A Plastid without a Genome: Evidence from the Nonphotosynthetic Green Algal Genus Polytomella1[W][OPEN] , 2014, Plant Physiology.

[89]  M. Nakai YCF1: A Green TIC: Response to the de Vries et al. Commentary , 2015, Plant Cell.

[90]  C. dePamphilis,et al.  Mechanistic model of evolutionary rate variation en route to a nonphotosynthetic lifestyle in plants , 2016, Proceedings of the National Academy of Sciences.

[91]  A. Barkan,et al.  Pentatricopeptide repeat proteins in plants. , 2014, Annual review of plant biology.

[92]  R. Jansen,et al.  Extreme reconfiguration of plastid genomes in the angiosperm family Geraniaceae: rearrangements, repeats, and codon usage. , 2011, Molecular biology and evolution.

[93]  Shilin Chen,et al.  High-accuracy de novo assembly and SNP detection of chloroplast genomes using a SMRT circular consensus sequencing strategy. , 2014, The New phytologist.

[94]  G. Igloi,et al.  Complete sequence of the maize chloroplast genome: gene content, hotspots of divergence and fine tuning of genetic information by transcript editing. , 1995, Journal of molecular biology.

[95]  T. Shikanai,et al.  Physiological Functions of Cyclic Electron Transport Around Photosystem I in Sustaining Photosynthesis and Plant Growth. , 2016, Annual review of plant biology.

[96]  James Leebens-Mack,et al.  Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns , 2007, Proceedings of the National Academy of Sciences.

[97]  T. Kohchi,et al.  Chloroplast gene organization deduced from complete sequence of liverwort Marchantia polymorpha chloroplast DNA , 1986, Nature.

[98]  Z. Quan,et al.  A precise chloroplast genome of Nelumbo nucifera (Nelumbonaceae) evaluated with Sanger, Illumina MiSeq, and PacBio RS II sequencing platforms: insight into the plastid evolution of basal eudicots , 2014, BMC Plant Biology.

[99]  A. J. Bendich,et al.  DNA maintenance in plastids and mitochondria of plants , 2015, Front. Plant Sci..

[100]  David C. Tank,et al.  A long PCR–based approach for DNA enrichment prior to next-generation sequencing for systematic studies1 , 2014, Applications in plant sciences.

[101]  Jian‐Wu Li,et al.  Lineage-Specific Reductions of Plastid Genomes in an Orchid Tribe with Partially and Fully Mycoheterotrophic Species , 2016, Genome biology and evolution.

[102]  Damon P. Little,et al.  Choosing and Using a Plant DNA Barcode , 2011, PloS one.

[103]  En-Hua Xia,et al.  Contradiction between Plastid Gene Transcription and Function Due to Complex Posttranscriptional Splicing: An Exemplary Study of ycf15 Function and Evolution in Angiosperms , 2013, PloS one.

[104]  J. Palmer,et al.  Function and evolution of a minimal plastid genome from a nonphotosynthetic parasitic plant. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[105]  F. Myouga,et al.  Identification of Nuclear Genes Encoding Chloroplast-Localized Proteins Required for Embryo Development in Arabidopsis1[W][OA] , 2010, Plant Physiology.

[106]  Koichi Yoshinaga,et al.  RNA editing in hornwort chloroplasts makes more than half the genes functional. , 2003, Nucleic acids research.

[107]  T. Shikanai,et al.  A pentatricopeptide repeat protein is essential for RNA editing in chloroplasts , 2005, Nature.

[108]  Burkhard Morgenstern,et al.  kmacs: the k-mismatch average common substring approach to alignment-free sequence comparison , 2014, Bioinform..

[109]  Ralph Bock,et al.  Synthetic biology in plastids. , 2014, The Plant journal : for cell and molecular biology.

[110]  R. Petit,et al.  Comparative organization of chloroplast, mitochondrial and nuclear diversity in plant populations. , 2005, Molecular ecology.

[111]  D. Soltis,et al.  Phylogenomic and structural analyses of 18 complete plastomes across nearly all families of early-diverging eudicots, including an angiosperm-wide analysis of IR gene content evolution. , 2016, Molecular phylogenetics and evolution.