Gene Discovery and Tissue-Specific Transcriptome Analysis in Chickpea with Massively Parallel Pyrosequencing and Web Resource Development1[W][OA]

Chickpea (Cicer arietinum) is an important food legume crop but lags in the availability of genomic resources. In this study, we have generated about 2 million high-quality sequences of average length of 372 bp using pyrosequencing technology. The optimization of de novo assembly clearly indicated that hybrid assembly of long-read and short-read primary assemblies gave better results. The hybrid assembly generated a set of 34,760 transcripts with an average length of 1,020 bp representing about 4.8% (35.5 Mb) of the total chickpea genome. We identified more than 4,000 simple sequence repeats, which can be developed as functional molecular markers in chickpea. Putative function and Gene Ontology terms were assigned to at least 73.2% and 71.0% of chickpea transcripts, respectively. We have also identified several chickpea transcripts that showed tissue-specific expression and validated the results using real-time polymerase chain reaction analysis. Based on sequence comparison with other species within the plant kingdom, we identified two sets of lineage-specific genes, including those conserved in the Fabaceae family (legume specific) and those lacking significant similarity with any non chickpea species (chickpea specific). Finally, we have developed a Web resource, Chickpea Transcriptome Database, which provides public access to the data and results reported in this study. The strategy for optimization of de novo assembly presented here may further facilitate the transcriptome sequencing and characterization in other organisms. Most importantly, the data and results reported in this study will help to accelerate research in various areas of genomics and implementing breeding programs in chickpea.

[1]  N. Sethy,et al.  Advancing the STMS genomic resources for defining new locations on the intraspecific genetic linkage map of chickpea (Cicer arietinum L.) , 2011, BMC Genomics.

[2]  M. Zaman-Allah,et al.  The salt-responsive transcriptome of chickpea roots and nodules via deepSuperSAGE , 2011, BMC Plant Biology.

[3]  K. Gaikwad,et al.  Development of genic-SSR markers by deep transcriptome sequencing in pigeonpea [Cajanus cajan (L.) Millspaugh] , 2011, BMC Plant Biology.

[4]  Akhilesh K. Tyagi,et al.  De Novo Assembly of Chickpea Transcriptome Using Short Reads for Gene Discovery and Marker Identification , 2011, DNA research : an international journal for rapid publication of reports on genes and genomes.

[5]  M. Blaxter,et al.  Comparing de novo assemblers for 454 transcriptome data , 2010, BMC Genomics.

[6]  D. Shtienberg,et al.  A BAC/BIBAC-based physical map of chickpea, Cicer arietinum L , 2010, BMC Genomics.

[7]  M. Schatz,et al.  Assembly of large genomes using second-generation sequencing. , 2010, Genome research.

[8]  Rex T. Nelson,et al.  RNA-Seq Atlas of Glycine max: A guide to the soybean transcriptome , 2010, BMC Plant Biology.

[9]  Mukesh Jain,et al.  Validation of internal control genes for quantitative gene expression studies in chickpea (Cicer arietinum L.). , 2010, Biochemical and biophysical research communications.

[10]  Trupti Joshi,et al.  An integrated transcriptome atlas of the crop model Glycine max, and its use in comparative analyses in plants. , 2010, The Plant journal : for cell and molecular biology.

[11]  P. Winter,et al.  A consensus genetic map of chickpea (Cicer arietinum L.) based on 10 mapping populations , 2010, Euphytica.

[12]  Xun Gu,et al.  Comparative analyses reveal distinct sets of lineage-specific genes within Arabidopsis thaliana , 2010, BMC Evolutionary Biology.

[13]  D. Jain,et al.  Analysis of gene expression in response to water deficit of chickpea (Cicer arietinum L.) varieties differing in drought tolerance , 2010, BMC Plant Biology.

[14]  G. Malerba,et al.  Characterization of Transcriptional Complexity during Berry Development in Vitis vinifera Using RNA-Seq1[W] , 2010, Plant Physiology.

[15]  T. Sakurai,et al.  Genome sequence of the palaeopolyploid soybean , 2010, Nature.

[16]  Alexie Papanicolaou,et al.  Next generation transcriptomes for next generation genomes using est2assembly , 2009, BMC Bioinformatics.

[17]  R. Varshney,et al.  A comprehensive resource of drought- and salinity- responsive ESTs for gene discovery and marker development in chickpea (Cicer arietinum L.) , 2009, BMC Genomics.

[18]  Birgit Kersten,et al.  PlnTFDB: updated content and new features of the plant transcription factor database , 2009, Nucleic Acids Res..

[19]  S. Jackson,et al.  Three Sequenced Legume Genomes and Many Crop Species: Rich Opportunities for Translational Genomics , 2009, Plant Physiology.

[20]  S. Chakraborty,et al.  Comparative analyses of genotype dependent expressed sequence tags and stress-responsive transcriptome of chickpea wilt illustrate predicted and unexpected genes and novel regulators of plant immunity , 2009, BMC Genomics.

[21]  T. Joshi,et al.  Legume Transcription Factor Genes: What Makes Legumes So Special?1[W] , 2009, Plant Physiology.

[22]  D. Jain,et al.  CAP2 enhances germination of transgenic tobacco seeds at high temperature and promotes heat stress tolerance in yeast , 2009, The FEBS journal.

[23]  M. Marra,et al.  Applications of new sequencing technologies for transcriptome analysis. , 2009, Annual review of genomics and human genetics.

[24]  E. Kristiansson,et al.  Characterization of the Zoarces viviparus liver transcriptome using massively parallel pyrosequencing , 2009, BMC Genomics.

[25]  B. Wilhelm,et al.  RNA-Seq-quantitative measurement of expression through massively parallel RNA-sequencing. , 2009, Methods.

[26]  D. Chattopadhyay,et al.  CIPK6, a CBL-interacting protein kinase is required for development and salt tolerance in plants. , 2009, The Plant journal : for cell and molecular biology.

[27]  J. Jackson,et al.  Next-generation pyrosequencing of gonad transcriptomes in the polyploid lake sturgeon (Acipenser fulvescens): the relative merits of normalization and rarefaction in gene discovery , 2009, BMC Genomics.

[28]  Scott Duguid,et al.  Development and analysis of EST-SSRs for flax (Linum usitatissimum L.) , 2009, Theoretical and Applied Genetics.

[29]  N. Sethy,et al.  Development of chickpea EST-SSR markers and analysis of allelic variation across related species , 2009, Theoretical and Applied Genetics.

[30]  M. Nei,et al.  Evolution of F-box genes in plants: Different modes of sequence divergence and their relationships with functional diversification , 2009, Proceedings of the National Academy of Sciences.

[31]  C. Molina,et al.  SuperSAGE: the drought stress-responsive transcriptome of chickpea roots , 2008, BMC Genomics.

[32]  Chen Chen,et al.  Comparative analysis of ESTs in response to drought stress in chickpea (C. arietinum L.). , 2008, Biochemical and biophysical research communications.

[33]  D. Chattopadhyay,et al.  Two divergent genes encoding L-myo-inositol 1-phosphate synthase1 (CaMIPS1) and 2 (CaMIPS2) are differentially expressed in chickpea. , 2008, Plant, cell & environment.

[34]  S. Udupa,et al.  Genetic structure, diversity, and allelic richness in composite collection and reference set in chickpea (Cicer arietinum L.) , 2008, BMC Plant Biology.

[35]  Mukesh Jain,et al.  Genome‐wide identification, classification, evolutionary expansion and expression analyses of homeobox genes in rice , 2008, The FEBS journal.

[36]  S. Ranade,et al.  Stem cell transcriptome profiling via massive-scale mRNA sequencing , 2008, Nature Methods.

[37]  Bruce A. Roe,et al.  Analysis of genome organization, composition and microsynteny using 500 kb BAC sequences in chickpea , 2008, Theoretical and Applied Genetics.

[38]  S. Tabata,et al.  Characterization of the Soybean Genome Using EST-derived Microsatellite Markers , 2008, DNA research : an international journal for rapid publication of reports on genes and genomes.

[39]  S. Chakraborty,et al.  Proteomics Approach to Identify Dehydration Responsive Nuclear Proteins from Chickpea (Cicer arietinum L.)*S , 2008, Molecular & Cellular Proteomics.

[40]  N. Mantri,et al.  Transcriptional profiling of chickpea genes differentially regulated in response to high-salinity, cold and drought , 2007, BMC Genomics.

[41]  L. B. Mhase,et al.  Development of an integrated intraspecific map of chickpea (Cicer arietinum L.) using two recombinant inbred line populations , 2007, Theoretical and Applied Genetics.

[42]  J. Ohlrogge,et al.  Sampling the Arabidopsis Transcriptome with Massively Parallel Pyrosequencing1[W][OA] , 2007, Plant Physiology.

[43]  Mukesh Jain,et al.  F-Box Proteins in Rice. Genome-Wide Analysis, Classification, Temporal and Spatial Gene Expression during Panicle and Seed Development, and Regulation by Light and Abiotic Stress1[W][OA] , 2007, Plant Physiology.

[44]  Wei Zhu,et al.  The TIGR Plant Transcript Assemblies database , 2006, Nucleic Acids Res..

[45]  S. Chakraborty,et al.  The nuclear proteome of chickpea (Cicer arietinum L.) reveals predicted and unexpected proteins. , 2006, Journal of proteome research.

[46]  Andrew D Kern,et al.  Novel genes derived from noncoding DNA in Drosophila melanogaster are frequently X-linked and exhibit testis-biased expression. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[47]  David K. Smith,et al.  Accelerated Evolutionary Rate May Be Responsible for the Emergence of Lineage-Specific Genes in Ascomycota , 2006, Journal of Molecular Evolution.

[48]  P. Winter,et al.  Chickpea molecular breeding: New tools and concepts , 2006, Euphytica.

[49]  J. Crouch,et al.  Development of ESTs from chickpea roots and their use in diversity analysis of the Cicer genus , 2005, BMC Plant Biology.

[50]  Martin Kuiper,et al.  BiNGO: a Cytoscape plugin to assess overrepresentation of Gene Ontology categories in Biological Networks , 2005, Bioinform..

[51]  Gertraud Burger,et al.  AutoFACT: An Automatic Functional Annotation and Classification Tool , 2005, BMC Bioinformatics.

[52]  Ju-Kyung Yu,et al.  Nonrandom distribution and frequencies of genomic and EST-derived microsatellite markers in rice, wheat, and barley , 2005, BMC Genomics.

[53]  C. Scheuring,et al.  Construction of BAC and BIBAC libraries and their applications for generation of SSR markers for genome analysis of chickpea, Cicer arietinum L. , 2005, Theoretical and Applied Genetics.

[54]  F. Muehlbauer,et al.  Construction of a HindIII Bacterial Artificial Chromosome library and its use in identification of clones associated with disease resistance in chickpea , 2004, Theoretical and Applied Genetics.

[55]  D. Tautz,et al.  An evolutionary analysis of orphan genes in Drosophila. , 2003, Genome research.

[56]  G. Perrière,et al.  The source of laterally transferred genes in bacterial genomes , 2003, Genome Biology.

[57]  M. Luckow,et al.  The Rest of the Iceberg. Legume Diversity and Evolution in a Phylogenetic Context1 , 2003, Plant Physiology.

[58]  C. Vance,et al.  Legumes: Importance and Constraints to Greater Use , 2003, Plant Physiology.

[59]  M. Morgante,et al.  Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes , 2002, Nature Genetics.

[60]  L. Duret,et al.  GC-content evolution in mammalian genomes: the biased gene conversion hypothesis. , 2001, Genetics.

[61]  G. Bernardi,et al.  Two classes of genes in plants. , 2000, Genetics.

[62]  G. Kahl,et al.  Sequence-tagged microsatellite site markers for chickpea (Cicer arietinum L.). , 1999, Genome.

[63]  Nicole M. Long,et al.  Supplemental Figure , 2013 .

[64]  Hui Peng,et al.  Cloning and Characterization of a Novel NAC Family Gene CarNAC1 from Chickpea (Cicer arietinum L.) , 2010, Molecular biotechnology.

[65]  Katja Nowick,et al.  Lineage-specific transcription factors and the evolution of gene regulatory networks. , 2010, Briefings in functional genomics.

[66]  Andreas Graner,et al.  Genic microsatellite markers in plants: features and applications. , 2005, Trends in biotechnology.

[67]  A. Vinogradov DNA helix: the importance of being AT-rich , 2003 .

[68]  D. Metzgar,et al.  Selection against frameshift mutations limits microsatellite expansion in coding DNA. , 2000, Genome research.