The Medicago sativa gene index 1.2: a web-accessible gene expression atlas for investigating expression differences between Medicago sativa subspecies

BackgroundAlfalfa (Medicago sativa L.) is the primary forage legume crop species in the United States and plays essential economic and ecological roles in agricultural systems across the country. Modern alfalfa is the result of hybridization between tetraploid M. sativa ssp. sativa and M. sativa ssp. falcata. Due to its large and complex genome, there are few genomic resources available for alfalfa improvement.ResultsA de novo transcriptome assembly from two alfalfa subspecies, M. sativa ssp. sativa (B47) and M. sativa ssp. falcata (F56) was developed using Illumina RNA-seq technology. Transcripts from roots, nitrogen-fixing root nodules, leaves, flowers, elongating stem internodes, and post-elongation stem internodes were assembled into the Medicago sativa Gene Index 1.2 (MSGI 1.2) representing 112,626 unique transcript sequences. Nodule-specific and transcripts involved in cell wall biosynthesis were identified. Statistical analyses identified 20,447 transcripts differentially expressed between the two subspecies. Pair-wise comparisons of each tissue combination identified 58,932 sequences differentially expressed in B47 and 69,143 sequences differentially expressed in F56. Comparing transcript abundance in floral tissues of B47 and F56 identified expression differences in sequences involved in anthocyanin and carotenoid synthesis, which determine flower pigmentation. Single nucleotide polymorphisms (SNPs) unique to each M. sativa subspecies (110,241) were identified.ConclusionsThe Medicago sativa Gene Index 1.2 increases the expressed sequence data available for alfalfa by ninefold and can be expanded as additional experiments are performed. The MSGI 1.2 transcriptome sequences, annotations, expression profiles, and SNPs were assembled into the Alfalfa Gene Index and Expression Database (AGED) at http://plantgrn.noble.org/AGED/, a publicly available genomic resource for alfalfa improvement and legume research.

[1]  Carroll P. Vance,et al.  An RNA-Seq Transcriptome Analysis of Orthophosphate-Deficient White Lupin Reveals Novel Insights into Phosphorus Acclimation in Plants1[W][OA] , 2012, Plant Physiology.

[2]  Zhengwei Zhu,et al.  CD-HIT: accelerated for clustering the next-generation sequencing data , 2012, Bioinform..

[3]  C. P. Vance Carbon and Nitrogen Metabolism in Legume Nodules , 2008 .

[4]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[5]  Z. Nan,et al.  Global Transcriptome Sequencing Using the Illumina Platform and the Development of EST-SSR Markers in Autotetraploid Alfalfa , 2013, PloS one.

[6]  En-Hua Xia,et al.  Prevalent Role of Gene Features in Determining Evolutionary Fates of Whole-Genome Duplication Duplicated Genes in Flowering Plants1[W][OA] , 2013, Plant Physiology.

[7]  M. Holsters,et al.  Never too many? How legumes control nodule numbers. , 2012, Plant, cell & environment.

[8]  S. S. Yang,et al.  Using RNA-Seq for gene identification, polymorphism detection and transcript profiling in two alfalfa genotypes with divergent cell wall composition in stems , 2011, BMC Genomics.

[9]  X. Huang,et al.  CAP3: A DNA sequence assembly program. , 1999, Genome research.

[10]  C. Town,et al.  Genome-wide SNP discovery in tetraploid alfalfa using 454 sequencing and high resolution melting analysis , 2011, BMC Genomics.

[11]  Mark W. Budwig,et al.  A Matter of Depth , 2003 .

[12]  W. Wolkers,et al.  Isolation and characterization of a D-7 LEA protein from pollen that stabilizes glasses in vitro. , 2001, Biochimica et biophysica acta.

[13]  M. Strand,et al.  Deep Sequencing Identifies Viral and Wasp Genes with Potential Roles in Replication of Microplitis demolitor Bracovirus , 2012, Journal of Virology.

[14]  Trupti Joshi,et al.  An integrated transcriptome atlas of the crop model Glycine max, and its use in comparative analyses in plants. , 2010, The Plant journal : for cell and molecular biology.

[15]  O. Postnikova,et al.  Analysis of the alfalfa root transcriptome in response to salinity stress. , 2013, Plant & cell physiology.

[16]  Tetsuro Mimura,et al.  Transcription switches for protoxylem and metaxylem vessel formation. , 2005, Genes & development.

[17]  R. Dixon,et al.  Downregulation of Caffeic Acid 3-O-Methyltransferase and Caffeoyl CoA 3-O-Methyltransferase in Transgenic Alfalfa: Impacts on Lignin Structure and Implications for the Biosynthesis of G and S Lignin , 2001, Plant Cell.

[18]  Richard D. Hayes,et al.  The genome of Eucalyptus grandis , 2014, Nature.

[19]  J. Craig,et al.  The Xylem and Phloem Transcriptomes from Secondary Tissues of the Arabidopsis Root-Hypocotyl1[w] , 2005, Plant Physiology.

[20]  Alfalfa Stem Tissues , 2002 .

[21]  M. Monteros,et al.  Development of an Alfalfa SNP Array and Its Use to Evaluate Patterns of Population Structure and Linkage Disequilibrium , 2014, PloS one.

[22]  T. Sakurai,et al.  Genome sequence of the palaeopolyploid soybean , 2010, Nature.

[23]  M. Dilworth Nitrogen-fixing leguminous symbioses , 2007 .

[24]  Rod A Wing,et al.  A reference genome for common bean and genome-wide analysis of dual domestications , 2014, Nature Genetics.

[25]  G. R. Gray,et al.  Cold acclimation, de-acclimation and re-acclimation of spring canola, winter canola and winter wheat: The role of carbohydrates, cold-induced stress proteins and vernalization , 2014 .

[26]  R. Shoemaker,et al.  Replication protein A subunit 3 and the iron efficiency response in soybean. , 2014, Plant, cell & environment.

[27]  E. Bornberg-Bauer,et al.  Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing , 2011, BMC Genomics.

[28]  E. Stockinger,et al.  Comparative Genomic Sequence and Expression Analyses of Medicago truncatula and Alfalfa Subspecies falcata COLD-ACCLIMATION-SPECIFIC Genes1[W][OA] , 2008, Plant Physiology.

[29]  Xinbin Dai,et al.  An RNA-Seq based gene expression atlas of the common bean , 2014, BMC Genomics.

[30]  B. Sundberg,et al.  Walls are thin 1 (WAT1), an Arabidopsis homolog of Medicago truncatula NODULIN21, is a tonoplast-localized protein required for secondary wall formation in fibers. , 2010, The Plant journal : for cell and molecular biology.

[31]  C. Vance,et al.  Root Nodule Enzymes of Ammonia Assimilation in Alfalfa (Medicago sativa L.) : DEVELOPMENTAL PATTERNS AND RESPONSE TO APPLIED NITROGEN. , 1981, Plant physiology.

[32]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[33]  V. Hurry,et al.  Effects of a Short-Term Shift to Low Temperature and of Long-Term Cold Hardening on Photosynthesis and Ribulose-1,5-Bisphosphate Carboxylase/Oxygenase and Sucrose Phosphate Synthase Activity in Leaves of Winter Rye (Secale cereale L.) , 1994, Plant physiology.

[34]  Mikiko Abe,et al.  Plant Peptides Govern Terminal Differentiation of Bacteria in Symbiosis , 2010, Science.

[35]  Y. Im,et al.  Phosphoinositide signaling. , 2012, Annual review of plant biology.

[36]  Stefan R. Henz,et al.  A gene expression map of Arabidopsis thaliana development , 2005, Nature Genetics.

[37]  E. Brummer,et al.  A Saturated Genetic Linkage Map of Autotetraploid Alfalfa (Medicago sativa L.) Developed Using Genotyping-by-Sequencing Is Highly Syntenous with the Medicago truncatula Genome , 2014, G3: Genes, Genomes, Genetics.

[38]  L. Sweetlove,et al.  Increased β-cyanoalanine nitrilase activity improves cyanide tolerance and assimilation in Arabidopsis. , 2014, Molecular plant.

[39]  Lin Feng,et al.  Power of Deep Sequencing and Agilent Microarray for Gene Expression Profiling Study , 2010, Molecular biotechnology.

[40]  C. Bonferroni Il calcolo delle assicurazioni su gruppi di teste , 1935 .

[41]  C. Kole,et al.  Arabidopsis Genome Initiative , 2016 .

[42]  T. Demura,et al.  SND1, a NAC Domain Transcription Factor, Is a Key Regulator of Secondary Wall Synthesis in Fibers of Arabidopsis[W] , 2006, The Plant Cell Online.

[43]  Yong Wang Characterization of a novel Medicago sativaNAC transcription factor gene involved in response to drought stress , 2013, Molecular Biology Reports.

[44]  T. Demura,et al.  VND-INTERACTING2, a NAC Domain Transcription Factor, Negatively Regulates Xylem Vessel Formation in Arabidopsis[W][OA] , 2010, Plant Cell.

[45]  M. Gerstein,et al.  The Transcriptional Landscape of the Yeast Genome Defined by RNA Sequencing , 2008, Science.

[46]  P. Gresshoff,et al.  Asparagine as a major factor in the N-feedback regulation of N2 fixation in Medicago truncatula. , 2010, Physiologia plantarum.

[47]  C. Jacquet,et al.  Nod factor perception protein carries weight in biotic interactions. , 2013, Trends in plant science.

[48]  Mahmut Can Hiz,et al.  Transcriptome Analysis of Salt Tolerant Common Bean (Phaseolus vulgaris L.) under Saline Conditions , 2014, PloS one.

[49]  J. Bouton The economic benefits of forage improvement in the United States , 2007, Euphytica.

[50]  R. Verma,et al.  The biosynthesis of L-arabinose in plants: molecular cloning and characterization of a Golgi-localized UDP-D-xylose 4-epimerase encoded by the MUR4 gene of Arabidopsis. , 2003, The Plant cell.

[51]  N. Young,et al.  Legume genomes: more than peas in a pod. , 2003, Current opinion in plant biology.

[52]  Martin Vingron,et al.  Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels , 2012, Bioinform..

[53]  Z. Chen,et al.  Genetic and epigenetic mechanisms for gene expression and phenotypic variation in plant polyploids. , 2007, Annual review of plant biology.

[54]  R. Longhi,et al.  Protection of Sinorhizobium against Host Cysteine-Rich Antimicrobial Peptides Is Critical for Symbiosis , 2011, PLoS biology.

[55]  G. May,et al.  Prevalence of single nucleotide polymorphism among 27 diverse alfalfa genotypes as assessed by transcriptome sequencing , 2012, BMC Genomics.

[56]  Leighton J. Core,et al.  Nascent RNA Sequencing Reveals Widespread Pausing and Divergent Initiation at Human Promoters , 2008, Science.

[57]  Toshiro K. Ohsumi,et al.  Genome-wide identification of polycomb-associated RNAs by RIP-seq. , 2010, Molecular cell.

[58]  Caroline Smith,et al.  Increasing Phosphatidylinositol (4,5)-Bisphosphate Biosynthesis Affects Basal Signaling and Chloroplast Metabolism in Arabidopsis thaliana , 2014, Plants.

[59]  Christian Kappel,et al.  Recent advances in the transcriptional regulation of the flavonoid biosynthetic pathway. , 2011, Journal of experimental botany.

[60]  S. S. Yang,et al.  Single‐Feature Polymorphism Discovery in the Transcriptome of Tetraploid Alfalfa , 2009 .

[61]  T. A. Campbell,et al.  Genetic mapping of biomass production in tetraploid alfalfa , 2007 .

[62]  P. Lerouge,et al.  Purification and Characterization of Enzymes Exhibiting β-d-Xylosidase Activities in Stem Tissues of Arabidopsis1 , 2004, Plant Physiology.

[63]  David M. A. Martin,et al.  Genome sequence and analysis of the tuber crop potato , 2011, Nature.

[64]  Mingui Zhao,et al.  Comparative studies on tolerance of Medicago truncatula and Medicago falcata to freezing , 2011, Planta.

[65]  Rex T. Nelson,et al.  RNA-Seq Atlas of Glycine max: A guide to the soybean transcriptome , 2010, BMC Plant Biology.

[66]  B. Winkel-Shirley,et al.  Flavonoid biosynthesis. A colorful model for genetics, biochemistry, cell biology, and biotechnology. , 2001, Plant physiology.

[67]  C. Chapple,et al.  Modified lignin in tobacco and poplar plants over-expressing the Arabidopsis gene encoding ferulate 5-hydroxylase. , 2000, The Plant journal : for cell and molecular biology.

[68]  S. Jackson,et al.  Defining the Transcriptome Assembly and Its Use for Genome Dynamics and Transcriptome Profiling Studies in Pigeonpea (Cajanus cajan L.) , 2011, DNA research : an international journal for rapid publication of reports on genes and genomes.

[69]  F. M. Engels,et al.  Alfalfa Stem Tissues: Cell Wall Deposition, Composition, and Degradability , 2002 .

[70]  N. Huner,et al.  Low-Temperature Effects on Photosynthesis and Correlation with Freezing Tolerance in Spring and Winter Cultivars of Wheat and Rye , 1993, Plant physiology.

[71]  S. Brunak,et al.  SignalP 4.0: discriminating signal peptides from transmembrane regions , 2011, Nature Methods.

[72]  Federico De Masi,et al.  ATAF1 transcription factor directly regulates abscisic acid biosynthetic gene NCED3 in Arabidopsis thaliana , 2013, FEBS open bio.

[73]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[74]  W. Lukowitz,et al.  Arabidopsis cyt1 mutants are deficient in a mannose-1-phosphate guanylyltransferase and point to a requirement of N-linked glycosylation for cellulose biosynthesis , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[75]  D. Nettleton,et al.  The Soybean Rhg1 Locus for Resistance to the Soybean Cyst Nematode Heterodera glycines Regulates the Expression of a Large Number of Stress- and Defense-Related Genes in Degenerating Feeding Cells1[C][W][OA] , 2011, Plant Physiology.

[76]  E. Brummer,et al.  Applied Genetics and Genomics in Alfalfa Breeding , 2012 .

[77]  G. Weiller,et al.  A gene expression atlas of the model legume Medicago truncatula. , 2008, The Plant journal : for cell and molecular biology.

[78]  Richard M. Clark,et al.  A plant-specific HUA2-LIKE (HULK) gene family in Arabidopsis thaliana is essential for development , 2014, The Plant journal : for cell and molecular biology.

[79]  R. A. Fisher,et al.  Design of Experiments , 1936 .

[80]  M. Wisniewski,et al.  Understanding plant cold hardiness: an opinion. , 2013, Physiologia plantarum.

[81]  Jun Li,et al.  LegumeIP: an integrative database for comparative genomics and transcriptomics of model legumes , 2011, Nucleic Acids Res..

[82]  Hong-Hwa Chen,et al.  Downregulation of putative UDP-glucose: flavonoid 3-O-glucosyltransferase gene alters flower coloring in Phalaenopsis , 2011, Plant Cell Reports.

[83]  Philip N Benfey,et al.  Control of Arabidopsis root development. , 2012, Annual review of plant biology.

[84]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[85]  T. Ruttink,et al.  De novo assembly of red clover transcriptome based on RNA-Seq data provides insight into drought response, gene discovery and marker identification , 2014, BMC Genomics.

[86]  Li Liu,et al.  A dynamic gene expression atlas covering the entire life cycle of rice. , 2010, The Plant journal : for cell and molecular biology.

[87]  P. Christou,et al.  The regulation of carotenoid pigmentation in flowers. , 2010, Archives of biochemistry and biophysics.

[88]  A. Conesa,et al.  Differential expression in RNA-seq: a matter of depth. , 2011, Genome research.

[89]  A. Myburg,et al.  SND2, a NAC transcription factor gene, regulates genes involved in secondary cell wall development in Arabidopsis fibres and increases fibre cell area in Eucalyptus , 2011, BMC Plant Biology.

[90]  A. A. Hanson,et al.  Alfalfa and Alfalfa Improvement , 1988 .

[91]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[92]  H. Kouchi,et al.  How Many Peas in a Pod? Legume Genes Responsible for Mutualistic Symbioses Underground , 2010, Plant & cell physiology.

[93]  R. Cooper,et al.  Flower Pigments in Diploid Alfalfa 1 , 1964 .

[94]  M. Piotrowski,et al.  Cyanide Metabolism in Higher Plants: Cyanoalanine Hydratase is a NIT4 Homolog , 2006, Plant Molecular Biology.

[95]  T. Harkins,et al.  The Cassava Genome: Current Progress, Future Directions , 2012, Tropical Plant Biology.

[96]  Michele Magrane,et al.  UniProt Knowledgebase: a hub of integrated protein data , 2011, Database J. Biol. Databases Curation.

[97]  G. Bauchan,et al.  The genus Medicago and the origin of the Medicago sativa complex , 1988 .

[98]  C. Sheaffer,et al.  Alfalfa Leaf Protein and Stem Cell Wall Polysaccharide Yields under Hay and Biomass Management Systems , 2007 .

[99]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[100]  Kathryn A. VandenBosch,et al.  Computational Identification and Characterization of Novel Genes from Legumes1[w] , 2004, Plant Physiology.

[101]  Alvaro J. González,et al.  The Medicago Genome Provides Insight into the Evolution of Rhizobial Symbioses , 2011, Nature.

[102]  N. Cogan,et al.  Transcriptome sequencing of lentil based on second-generation technology permits large-scale unigene assembly and SSR marker discovery , 2011, BMC Genomics.

[103]  D. R. Hoagland,et al.  The Water-Culture Method for Growing Plants Without Soil , 2018 .

[104]  Jasbir Singh,et al.  The effects of phenotypic plasticity on photosynthetic performance in winter rye, winter wheat and Brassica napus. , 2012, Physiologia plantarum.

[105]  Yuhong Tang,et al.  System responses to long-term drought and re-watering of two contrasting alfalfa varieties. , 2011, The Plant journal : for cell and molecular biology.

[106]  G. Marzluf,et al.  Isolation of nit-4, the minor nitrogen regulatory gene which mediates nitrate induction in Neurospora crassa , 1989, Journal of bacteriology.