The Spirodela polyrhiza genome reveals insights into its neotenous reduction fast growth and aquatic lifestyle

The subfamily of the Lemnoideae belongs to a different order than other monocotyledonous species that have been sequenced and comprises aquatic plants that grow rapidly on the water surface. Here we select Spirodela polyrhiza for whole-genome sequencing. We show that Spirodela has a genome with no signs of recent retrotranspositions but signatures of two ancient whole-genome duplications, possibly 95 million years ago (mya), older than those in Arabidopsis and rice. Its genome has only 19,623 predicted protein-coding genes, which is 28% less than the dicotyledonous Arabidopsis thaliana and 50% less than monocotyledonous rice. We propose that at least in part, the neotenous reduction of these aquatic plants is based on readjusted copy numbers of promoters and repressors of the juvenile-to-adult transition. The Spirodela genome, along with its unique biology and physiology, will stimulate new insights into environmental adaptation, ecology, evolution and plant development, and will be instrumental for future bioenergy applications.

[1]  J. Messing,et al.  The complete nucleotide sequence of an infectious clone of cauliflower mosaic virus by M13mp7 shotgun sequencing. , 1981, Nucleic acids research.

[2]  J Messing,et al.  A system for shotgun DNA sequencing. , 1981, Nucleic acids research.

[3]  U. Köck The Family of Lemnaceae – a Monographic Study, 2, E. Landolt, R. Kandeler. Veröffentlichungen des Geobotanischen Institutes der Eidgenössischen Technischen Hochschule, Stiftung Rübel, Zürich (1987), 596S, Vol. 95 638 S., 60 Abb. Broschiert, Schweizer Franken 64,-; US $ 48,- , 1989 .

[4]  R B Denman,et al.  Using RNAFOLD to predict the activity of small catalytic RNAs. , 1993, BioTechniques.

[5]  Denman Rb,et al.  Using RNAFOLD to predict the activity of small catalytic RNAs. , 1993 .

[6]  M T Clegg,et al.  Substitution rate comparisons between grasses and palms: synonymous rate differences at the nuclear gene Adh parallel rate differences at the plastid gene rbcL. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[7]  G. Rothwell,et al.  The fossil monocot Limnobiophyllum scutatum: resolving the phylogeny of Lemnaceae. , 1997, American journal of botany.

[8]  S. Eddy,et al.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. , 1997, Nucleic acids research.

[9]  Phillip SanMiguel,et al.  The paleontology of intergene retrotransposons of maize , 1998, Nature Genetics.

[10]  G. Benson,et al.  Tandem repeats finder: a program to analyze DNA sequences. , 1999, Nucleic acids research.

[11]  R. Guigó,et al.  An assessment of gene prediction accuracy in large DNA sequences. , 2000, Genome research.

[12]  Wei Qian,et al.  Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. , 2000, Molecular biology and evolution.

[13]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[14]  V. Solovyev,et al.  Ab initio gene finding in Drosophila genomic DNA. , 2000, Genome research.

[15]  Daniel J. Cosgrove,et al.  Loosening of plant cell walls by expansins , 2000, Nature.

[16]  K. Bremer Early Cretaceous lineages of monocot flowering plants. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Kevin R. Thornton,et al.  Gene duplication and evolution. , 2001, Science.

[18]  Rolf Apweiler,et al.  InterProScan - an integration platform for the signature-recognition methods in InterPro , 2001, Bioinform..

[19]  Enno Ohlebusch,et al.  The Enhanced Suffix Array and Its Applications to Genome Analysis , 2002, WABI.

[20]  Daniel J. Crawford,et al.  Phylogeny and Systematics of Lemnaceae, the Duckweed Family , 2009 .

[21]  D. Cosgrove,et al.  Regulation of Root Hair Initiation and Expansin Gene Expression in Arabidopsis Online version contains Web-only data. Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.006437. , 2002, The Plant Cell Online.

[22]  J. Riechmann,et al.  Analysis of the Arabidopsis MADS AFFECTING FLOWERING Gene Family: MAF2 Prevents Vernalization by Short Periods of Cold Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.009506. Online version contains Web-only data. , 2003, The Plant Cell Online.

[23]  S. Rhee,et al.  AraCyc: A Biochemical Pathway Database for Arabidopsis1 , 2003, Plant Physiology.

[24]  John F. McDonald,et al.  LTR_STRUC: a novel search and identification program for LTR retrotransposons , 2003, Bioinform..

[25]  C. Stoeckert,et al.  OrthoMCL: identification of ortholog groups for eukaryotic genomes. , 2003, Genome research.

[26]  Yi Lee,et al.  Regulation of Expansin Gene Expression Affects Growth and Development in Transgenic Rice Plants Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.011965. , 2003, The Plant Cell Online.

[27]  Ian Korf,et al.  Gene finding in novel genomes , 2004, BMC Bioinformatics.

[28]  G. Rothwell,et al.  Molecular phylogenetic relationships among Lemnaceae and Araceae using the chloroplast trnL-trnF intergenic spacer. , 2004, Molecular phylogenetics and evolution.

[29]  M. Edelman,et al.  Callus induction and regeneration in Spirodela and Lemna , 2004, Plant Cell Reports.

[30]  Steven Salzberg,et al.  TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders , 2004, Bioinform..

[31]  D. Les,et al.  Systematics of theLemnaceae (duckweeds): Inferences from micromolecular and morphological data , 1997, Plant Systematics and Evolution.

[32]  Robert C. Edgar,et al.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity , 2004, BMC Bioinformatics.

[33]  P. Robles,et al.  The SEP4 Gene of Arabidopsis thaliana Functions in Floral Organ and Meristem Identity , 2004, Current Biology.

[34]  Steven Salzberg,et al.  DAGchainer: a tool for mining segmental genome duplications and synteny , 2004, Bioinform..

[35]  D. Bartel,et al.  Computational identification of plant microRNAs and their targets, including a stress-induced miRNA. , 2004, Molecular cell.

[36]  K. Bremer,et al.  The age of major monocot groups inferred from 800+ rbcL sequences , 2004 .

[37]  C. Sengupta-Gopalan,et al.  Down-regulation of specific members of the glutamine synthetase gene family in alfalfa by antisense RNA technology , 1998, Plant Molecular Biology.

[38]  Gordon Gremme,et al.  Engineering a software tool for gene structure prediction in higher organisms , 2005, Inf. Softw. Technol..

[39]  Yasunori Nakamura,et al.  Expression Profiling of Genes Involved in Starch Synthesis in Sink and Source Organs of Rice , 2005 .

[40]  Dawei Li,et al.  The Genomes of Oryza sativa: A History of Duplications , 2005, PLoS biology.

[41]  Tracy Money,et al.  A single amino acid converts a repressor to an activator of flowering. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[42]  L. Lepiniec,et al.  TRANSPARENT TESTA10 Encodes a Laccase-Like Enzyme Involved in Oxidative Polymerization of Flavonoids in Arabidopsis Seed Coatw⃞ , 2005, The Plant Cell Online.

[43]  A. Stomp The duckweeds: a valuable plant for biomanufacturing. , 2005, Biotechnology annual review.

[44]  Takuji Sasaki,et al.  The map-based sequence of the rice genome , 2005, Nature.

[45]  Steven Salzberg,et al.  JIGSAW: integration of multiple sources of evidence for gene prediction , 2005, Bioinform..

[46]  Gregory Butler,et al.  OrfPredictor: predicting protein-coding regions in EST-derived sequences , 2005, Nucleic Acids Res..

[47]  S. Moose,et al.  microRNA172 down-regulates glossy15 to promote vegetative phase change in maize. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[48]  abc,et al.  Expansins : expanding importance in plant growth and development , 2006 .

[49]  G. Coupland,et al.  The quest for florigen: a review of recent progress. , 2006, Journal of experimental botany.

[50]  M. Gribskov,et al.  The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray) , 2006, Science.

[51]  Qihui Zhu,et al.  Nucleotide substitution pattern in rice paralogues: implication for negative correlation between the synonymous substitution rate and codon usage bias. , 2006, Gene.

[52]  M. Kater,et al.  AGL24, SHORT VEGETATIVE PHASE, and APETALA1 Redundantly Control AGAMOUS during Early Stages of Flower Development in Arabidopsis[W] , 2006, The Plant Cell Online.

[53]  Jun Li,et al.  KaKs_Calculator: Calculating Ka and Ks Through Model Selection and Model Averaging , 2007, Genom. Proteom. Bioinform..

[54]  Burkhard Morgenstern,et al.  AUGUSTUS: ab initio prediction of alternative transcripts , 2006, Nucleic Acids Res..

[55]  Yajun Wu,et al.  Involvement of AtLAC15 in lignin synthesis in seeds and in root elongation of Arabidopsis , 2006, Planta.

[56]  V. Schubert,et al.  Sister Chromatids Are Often Incompletely Aligned in Meristematic and Endopolyploid Interphase Nuclei of Arabidopsis thaliana , 2006, Genetics.

[57]  Asri Gani,et al.  Effect of cellulose and lignin content on pyrolysis and combustion characteristics for several types of biomass. , 2007 .

[58]  Robert Gentleman,et al.  Using GOstats to test gene lists for GO term association , 2007, Bioinform..

[59]  J. Doležel,et al.  Estimation of nuclear DNA content in plants using flow cytometry , 2007, Nature Protocols.

[60]  R. Tharanathan,et al.  Fruit Ripening Phenomena–An Overview , 2007, Critical reviews in food science and nutrition.

[61]  Steven S Xu,et al.  Meiosis-driven genome variation in plants. , 2007, Current genomics.

[62]  Peer Bork,et al.  Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation , 2007, Bioinform..

[63]  John A. Hamilton,et al.  The TIGR Rice Genome Annotation Resource: improvements and new features , 2006, Nucleic Acids Res..

[64]  J. Poulain,et al.  The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla , 2007, Nature.

[65]  Wei Li,et al.  Identification of drought-induced microRNAs in rice. , 2007, Biochemical and biophysical research communications.

[66]  S. Kurtz,et al.  A new method to compute K-mer frequencies and its application to annotate large repetitive plant genomes , 2008, BMC Genomics.

[67]  Yoshihiro Kawahara,et al.  The Rice Annotation Project Database (RAP-DB): 2008 update , 2007, Nucleic Acids Res..

[68]  W. Hillman The Lemnaceae, or duckweeds , 1961, The Botanical Review.

[69]  Yang Wu,et al.  A repressor complex governs the integration of flowering signals in Arabidopsis. , 2008, Developmental cell.

[70]  Detlef Weigel,et al.  Dual Effects of miR156-Targeted SPL Genes and CYP78A5/KLUH on Plastochron Length and Organ Size in Arabidopsis thaliana[W][OA] , 2008, The Plant Cell Online.

[71]  M. Chase,et al.  Phylogenetic relationships of aroids and duckweeds (Araceae) inferred from coding and noncoding plastid DNA. , 2008, American journal of botany.

[72]  Haibao Tang,et al.  Angiosperm genome comparisons reveal early polyploidy in the monocot lineage , 2009, Proceedings of the National Academy of Sciences.

[73]  Dawn H. Nagel,et al.  The B73 Maize Genome: Complexity, Diversity, and Dynamics , 2009, Science.

[74]  Detlef Weigel,et al.  miR156-Regulated SPL Transcription Factors Define an Endogenous Flowering Pathway in Arabidopsis thaliana , 2009, Cell.

[75]  J. Dvorak,et al.  A BAC-based physical map of Brachypodium distachyon and its comparative analysis with rice and wheat , 2009, BMC Genomics.

[76]  Peng Gao,et al.  Comparative genome analysis of lignin biosynthesis gene families across the plant kingdom , 2009, BMC Bioinformatics.

[77]  Erich Bornberg-Bauer,et al.  Dr. Zompo: an online data repository for Zostera marina and Posidonia oceanica ESTs , 2009, Database J. Biol. Databases Curation.

[78]  Mihaela M. Martis,et al.  The Sorghum bicolor genome and the diversification of grasses , 2009, Nature.

[79]  Y. van de Peer,et al.  PLAZA: A Comparative Genomics Resource to Study Gene and Genome Evolution in Plants[W] , 2009, The Plant Cell Online.

[80]  Jay J. Cheng,et al.  Growing Duckweed to Recover Nutrients from Wastewaters and for Production of Fuel Ethanol and Animal Feed , 2009 .

[81]  Detlef Weigel,et al.  The Sequential Action of miR156 and miR172 Regulates Developmental Timing in Arabidopsis , 2009, Cell.

[82]  J. Bogner The free-floating Aroids (Araceae) – living and fossil , 2009 .

[83]  Jörg D. Becker,et al.  Epigenetic Reprogramming and Small RNA Silencing of Transposable Elements in Pollen , 2009, Cell.

[84]  Joaquín Dopazo,et al.  ETE: a python Environment for Tree Exploration , 2010, BMC Bioinformatics.

[85]  L. Davin,et al.  The laccase multigene family in Arabidopsis thaliana: towards addressing the mystery of their gene function(s) , 2011, Planta.

[86]  C. Chapple,et al.  The genetics of lignin biosynthesis: connecting genotype to phenotype. , 2010, Annual review of genetics.

[87]  Joachim Messing,et al.  DNA barcoding of the Lemnaceae, a family of aquatic monocots , 2010, BMC Plant Biology.

[88]  H. Gilbert The Biochemistry and Structural Biology of Plant Cell Wall Deconstruction , 2010, Plant Physiology.

[89]  Xingliang Hou,et al.  MOTHER OF FT AND TFL1 Regulates Seed Germination through a Negative Feedback Loop Modulating ABA Signaling in Arabidopsis[C][W] , 2010, Plant Cell.

[90]  Sai Guna Ranjan Gurazada,et al.  Genome sequencing and analysis of the model grass Brachypodium distachyon , 2010, Nature.

[91]  V. Irish The flowering of Arabidopsis flower development. , 2010, The Plant journal : for cell and molecular biology.

[92]  Paramvir S. Dehal,et al.  FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments , 2010, PloS one.

[93]  Wenqin Wang,et al.  Analysis of ADP-glucose pyrophosphorylase expression during turion formation induced by abscisic acid in Spirodela polyrhiza (greater duckweed) , 2012, BMC Plant Biology.

[94]  Wenqin Wang,et al.  High-Throughput Sequencing of Three Lemnoideae (Duckweeds) Chloroplast Genomes from Total DNA , 2011, PloS one.

[95]  G. Coupland,et al.  The Arabidopsis SOC1-like genes AGL42, AGL71 and AGL72 promote flowering in the shoot apical and axillary meristems. , 2011, The Plant journal : for cell and molecular biology.

[96]  M. Schmid,et al.  The control of developmental phase transitions in plants , 2011, Development.

[97]  Ana Kozomara,et al.  miRBase: integrating microRNA annotation and deep-sequencing data , 2010, Nucleic Acids Res..

[98]  T. Michael,et al.  Evolution of Genome Size in Duckweeds (Lemnaceae) , 2011 .

[99]  S. Balzergue,et al.  Disruption of LACCASE4 and 17 Results in Tissue-Specific Alterations to Lignification of Arabidopsis thaliana Stems[W] , 2011, Plant Cell.

[100]  Michele Magrane,et al.  UniProt Knowledgebase: a hub of integrated protein data , 2011, Database J. Biol. Databases Curation.

[101]  M. Nei,et al.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. , 2011, Molecular biology and evolution.

[102]  J. A. Jarillo,et al.  Timing is everything in plant development. The central role of floral repressors. , 2011, Plant science : an international journal of experimental plant biology.

[103]  Youhuang Bai,et al.  Root hair-specific expansins modulate root hair elongation in rice. , 2011, The Plant journal : for cell and molecular biology.

[104]  Tanya Z. Berardini,et al.  The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools , 2011, Nucleic Acids Res..

[105]  Saravanaraj N. Ayyampalayam,et al.  The banana (Musa acuminata) genome and the evolution of monocotyledonous plants , 2012, Nature.

[106]  Wenqin Wang,et al.  The Mitochondrial Genome of an Aquatic Plant, Spirodela polyrhiza , 2012, PloS one.

[107]  Daniel W. A. Buchan,et al.  The tomato genome sequence provides insights into fleshy fruit evolution , 2012, Nature.

[108]  Thomas Nussbaumer,et al.  MIPS PlantsDB: a database framework for comparative plant genome research , 2012, Nucleic Acids Res..

[109]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..