Comprehensive Sequence Analysis of 24,783 Barley Full-Length cDNAs Derived from 12 Clone Libraries1[W][OA]

Full-length cDNA (FLcDNA) libraries consisting of 172,000 clones were constructed from a two-row malting barley cultivar (Hordeum vulgare ‘Haruna Nijo’) under normal and stressed conditions. After sequencing the clones from both ends and clustering the sequences, a total of 24,783 complete sequences were produced. By removing duplicates between these and publicly available sequences, 22,651 representative sequences were obtained: 17,773 were novel barley FLcDNAs, and 1,699 were barley specific. Highly conserved genes were found in the barley FLcDNA sequences for 721 of 881 rice (Oryza sativa) trait genes with 50% or greater identity. These FLcDNA resources from our Haruna Nijo cDNA libraries and the full-length sequences of representative clones will improve our understanding of the biological functions of genes in barley, which is the cereal crop with the fourth highest production in the world, and will provide a powerful tool for annotating the barley genome sequences that will become available in the near future.

[1]  Ute Baumann,et al.  An atlas of gene expression from seed to seed through barley development , 2006, Functional & Integrative Genomics.

[2]  Masakazu Satou,et al.  RIKEN Arabidopsis full-length (RAFL) cDNA and its applications for expression profiling under abiotic stress conditions. , 2003, Journal of experimental botany.

[3]  T. Takabe,et al.  Comparative transcriptome analyses of barley and rice under salt stress , 2006, Theoretical and Applied Genetics.

[4]  Richard Mott,et al.  EST_GENOME: a program to align spliced DNA sequences to unspliced genomic DNA , 1997, Comput. Appl. Biosci..

[5]  Uwe Scholz,et al.  Barley Grain Maturation and Germination: Metabolic Pathway and Regulatory Network Commonalities and Differences Highlighted by New MapMan/PageMan Profiling Tools1[W][OA] , 2008, Plant Physiology.

[6]  Nozomu Sakurai,et al.  Large-scale analysis of full-length cDNAs from the tomato (Solanum lycopersicum) cultivar Micro-Tom, a reference system for the Solanaceae genomics , 2010, BMC Genomics.

[7]  A. Graner,et al.  Barley Genomics: An Overview , 2008, International journal of plant genomics.

[8]  A. Wahid,et al.  Dehydrin gene expression provides an indicator of low temperature and drought stress: transcriptome-based analysis of Barley (Hordeum vulgare L.) , 2008, Functional & Integrative Genomics.

[9]  C. Bult,et al.  Functional annotation of a full-length mouse cDNA collection , 2001, Nature.

[10]  Jianxin Ma,et al.  Consistent over-estimation of gene number in complex plant genomes. , 2004, Current opinion in plant biology.

[11]  Terry Gaasterland,et al.  Genome-wide prediction and identification of cis-natural antisense transcripts in Arabidopsis thaliana , 2005, Genome Biology.

[12]  K. Akiyama,et al.  Functional Annotation of a Full-Length Arabidopsis cDNA Collection , 2002, Science.

[13]  Piero Carninci,et al.  High-efficiency full-length cDNA cloning by biotinylated CAP trapper. , 1996, Genomics.

[14]  Y. Suzuki,et al.  Construction and characterization of a full length-enriched and a 5'-end-enriched cDNA library. , 1997, Gene.

[15]  Kazuo Shinozaki,et al.  Sequencing and Analysis of Approximately 40 000 Soybean cDNA Clones from a Full-Length-Enriched cDNA Library , 2008, DNA research : an international journal for rapid publication of reports on genes and genomes.

[16]  Carol Soderlund,et al.  Sequencing, Mapping, and Analysis of 27,455 Maize Full-Length cDNAs , 2009, PLoS genetics.

[17]  Rolf Apweiler,et al.  InterProScan - an integration platform for the signature-recognition methods in InterPro , 2001, Bioinform..

[18]  Uwe Scholz,et al.  Gene Content and Virtual Gene Order of Barley Chromosome 1H1[C][W][OA] , 2009, Plant Physiology.

[19]  Tatiana A. Tatusova,et al.  NCBI Reference Sequences: current status, policy and new initiatives , 2008, Nucleic Acids Res..

[20]  Kanako O. Koyanagi,et al.  Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana. , 2007, Genome research.

[21]  Y. Sakaki,et al.  Assessment of adaptive evolution between wheat and rice as deduced from full-length common wheat cDNA sequence data and expression patterns , 2009, BMC Genomics.

[22]  Takuji Sasaki,et al.  The map-based sequence of the rice genome , 2005, Nature.

[23]  Y. Yamazaki,et al.  Oryzabase. An Integrated Biological and Genome Information Database for Rice1[OA] , 2005, Plant Physiology.

[24]  B. S. Bushman,et al.  Transcripts Associated with Non‐Acclimated Freezing Response in Two Barley Cultivars , 2008 .

[25]  Yoshihiro Kawahara,et al.  The Rice Annotation Project Database (RAP-DB): 2008 update , 2007, Nucleic Acids Res..

[26]  Rachael P. Huntley,et al.  The GOA database in 2009—an integrated Gene Ontology Annotation resource , 2008, Nucleic Acids Res..

[27]  K. Takeda,et al.  Construction and Characterization of a Bacterial Artificial Chromosome (BAC) Library from the Japanese Malting Barley variety ‘Haruna Nijo’ , 2007 .

[28]  K. Takeda,et al.  A high-density transcript linkage map of barley derived from a single population , 2009, Heredity.

[29]  Marc Strickert,et al.  Gene expression patterns reveal tissue-specific signaling networks controlling programmed cell death and ABA- regulated maturation in developing barley seeds. , 2006, The Plant journal : for cell and molecular biology.

[30]  Dawn H. Nagel,et al.  The B73 Maize Genome: Complexity, Diversity, and Dynamics , 2009, Science.

[31]  The UniProt Consortium,et al.  The Universal Protein Resource (UniProt) 2009 , 2008, Nucleic Acids Res..

[32]  A. Wahid,et al.  Expression analysis of barley (Hordeum vulgare L.) during salinity stress , 2006, Functional & Integrative Genomics.

[33]  Rod A Wing,et al.  A New Resource for Cereal Genomics: 22K Barley GeneChip Comes of Age1 , 2004, Plant Physiology.

[34]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt) , 2004, Nucleic Acids Res..

[35]  Stijn van Dongen,et al.  miRBase: tools for microRNA genomics , 2007, Nucleic Acids Res..

[36]  Sai Guna Ranjan Gurazada,et al.  Genome sequencing and analysis of the model grass Brachypodium distachyon , 2010, Nature.

[37]  P. Langridge,et al.  Application of genomics to molecular breeding of wheat and barley. , 2007, Advances in genetics.

[38]  Piero Carninci,et al.  Normalization and subtraction of cap-trapper-selected cDNAs to prepare full-length cDNA libraries for rapid discovery of new genes. , 2000, Genome research.

[39]  G. Bernardi,et al.  The new genes of rice: a closer look. , 2004, Trends in plant science.

[40]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[41]  N. Nomura,et al.  Complete sequencing and characterization of 21,243 full-length human cDNAs , 2004, Nature Genetics.

[42]  Robert D. Finn,et al.  InterPro: the integrative protein signature database , 2008, Nucleic Acids Res..

[43]  K. Maruyama,et al.  Oligo-capping: a simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides. , 1994, Gene.

[44]  Li Yang,et al.  MIPSPlantsDB—plant database resource for integrative and comparative plant genome research , 2007, Nucleic Acids Res..

[45]  Kanako O. Koyanagi,et al.  Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones , 2004, PLoS Biology.

[46]  李佩芳 International Rice Genome Sequencing Project. 2005. The map-based sequence of the rice genome. , 2005 .

[47]  J. Kawai,et al.  Collection, Mapping, and Annotation of Over 28,000 cDNA Clones from japonica Rice , 2003, Science.

[48]  P. Langridge,et al.  The International Barley Sequencing Consortium—At the Threshold of Efficient Access to the Barley Genome1[W] , 2009, Plant Physiology.

[49]  N. Alexandrov,et al.  Features of Arabidopsis Genes and Genome Discovered using Full-length cDNAs , 2005, Plant Molecular Biology.

[50]  Mihaela M. Martis,et al.  The Sorghum bicolor genome and the diversification of grasses , 2009, Nature.

[51]  G. Bernardi,et al.  Incorrectly predicted genes in rice? , 2004, Gene.

[52]  Shoshi Kikuchi,et al.  Antisense transcripts with rice full-length cDNAs , 2003, Genome Biology.

[53]  H. Bohnert,et al.  Journal of Experimental Botany Advance Access published November 16, 2006 Journal of Experimental Botany, Page 1 of 12 Integrated Approaches to Sustain and Improve Plant Production under Drought Stress Special Issue , 2006 .

[54]  H. Kanamori,et al.  Barley grain with adhering hulls is controlled by an ERF family transcription factor gene regulating a lipid biosynthesis pathway , 2008, Proceedings of the National Academy of Sciences.

[55]  Piero Carninci,et al.  Construction of a full-length cDNA library from young spikelets of hexaploid wheat and its characterization by large-scale sequencing of expressed sequence tags. , 2004, Genes & genetic systems.

[56]  Matthew D. Wilkerson,et al.  PlantGDB: a resource for comparative plant genomics , 2007, Nucleic Acids Res..

[57]  Kazuo Shinozaki,et al.  Development of 5006 Full-Length CDNAs in Barley: A Tool for Accessing Cereal Genomics Resources , 2009, DNA research : an international journal for rapid publication of reports on genes and genomes.