Empirical Analysis of Transcriptional Activity in the Arabidopsis Genome

Functional analysis of a genome requires accurate gene structure information and a complete gene inventory. A dual experimental strategy was used to verify and correct the initial genome sequence annotation of the reference plant Arabidopsis. Sequencing full-length cDNAs and hybridizations using RNA populations from various tissues to a set of high-density oligonucleotide arrays spanning the entire genome allowed the accurate annotation of thousands of gene structures. We identified 5817 novel transcription units, including a substantial amount of antisense gene transcription, and 40 genes within the genetically defined centromeres. This approach resulted in completion of ∼30% of the Arabidopsis ORFeome as a resource for global functional experimentation of the plant proteome.

[1]  AC Tose Cell , 1993, Cell.

[2]  R. Quatrano Genomics , 1998, Plant Cell.

[3]  Stephen J. Elledge,et al.  The univector plasmid-fusion system, a method for rapid construction of recombinant DNA without restriction enzymes , 1998, Current Biology.

[4]  Daniel R. Richards,et al.  Direct allelic variation scanning of the yeast genome. , 1998, Science.

[5]  R D Klausner,et al.  The mammalian gene collection. , 1999, Science.

[6]  M. Marra,et al.  Genetic definition and sequence analysis of Arabidopsis centromeres. , 1999, Science.

[7]  William C. Nierman,et al.  Lin, X. et al. Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana. Nature 402, 761-768 , 1999 .

[8]  M. Cotton,et al.  Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana , 1999, Nature.

[9]  Eugen C. Buehler,et al.  Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana , 1999, Nature.

[10]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[11]  W Ansorge,et al.  Sequence and analysis of chromosome 3 of the plant Arabidopsis thaliana. , 2000, Nature.

[12]  K Mayer,et al.  Sequence and analysis of chromosome 5 of the plant Arabidopsis thaliana. , 2000, Nature.

[13]  L. Steinmetz,et al.  High-Density Arrays and Insights into Genome function , 2000, Biotechnology & genetic engineering reviews.

[14]  Paul Shinn,et al.  Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana , 2000, Nature.

[15]  M. Vidal A Biological Atlas of Functional Maps , 2001, Cell.

[16]  Gregory R. Grant,et al.  Statistical Methods in Bioinformatics , 2001 .

[17]  S. Eddy Non–coding RNA genes and the modern RNA world , 2001, Nature Reviews Genetics.

[18]  D. Botstein,et al.  Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF , 2001, Nature.

[19]  T. Moore,et al.  Open-reading-frame sequence tags (OSTs) support the existence of at least 17,300 genes in C. elegans , 2001, Nature Genetics.

[20]  Piero Carninci,et al.  Balanced-size and long-size cloning of full-length, cap-trapped cDNAs into vectors of the novel lambda-FLC family allows enhanced gene discovery rate and functional analysis. , 2001, Genomics.

[21]  R. Stoughton,et al.  Experimental annotation of the human genome using microarray technology , 2001, Nature.

[22]  P. Rouzé,et al.  Current methods of gene prediction, their strengths and weaknesses. , 2002, Nucleic acids research.

[23]  Joshua Labaer,et al.  The FLEXGene repository: exploiting the fruits of the genome projects by creating a needed resource to face the challenges of the post-genomic era. , 2002, Archives of medical research.

[24]  Shoshi Kikuchi,et al.  A computer-based method of selecting clones for a full-length cDNA project: simultaneous collection of negligibly redundant and variant cDNAs. , 2002, Genome research.

[25]  G. Rubin,et al.  Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[26]  K. Akiyama,et al.  Functional Annotation of a Full-Length Arabidopsis cDNA Collection , 2002, Science.

[27]  C. Town,et al.  Cloning and Sequencing of cDNAs for Hypothetical Genes from Chromosome 2 of Arabidopsis1,212 , 2002, Plant Physiology.

[28]  S. P. Fodor,et al.  Large-Scale Transcriptional Activity in Chromosomes 21 and 22 , 2002, Science.

[29]  Huanming Yang,et al.  A Draft Sequence of the Rice Genome (Oryza sativa L. ssp. indica) , 2002, Science.

[30]  P. Hugenholtz Exploring prokaryotic diversity in the genomic era , 2002, Genome Biology.

[31]  M. A. Rector,et al.  Endogenous and Silencing-Associated Small RNAs in Plants Online version contains Web-only data. Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.003210. , 2002, The Plant Cell Online.

[32]  E. Kolker,et al.  Transcriptome analysis of Escherichia coli using high-density oligonucleotide probe arrays. , 2002, Nucleic acids research.

[33]  Michael Q. Zhang Computational prediction of eukaryotic protein-coding genes , 2002, Nature Reviews Genetics.

[34]  E. Birney,et al.  Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs , 2002, Nature.

[35]  Atsushi Yoshiki,et al.  An expressed pseudogene regulates the messenger-RNA stability of its homologous coding gene , 2003, Nature.

[36]  A. Azad,et al.  Regulation of Elongating RNA Polymerase II by Forkhead Transcription Factors in Yeast , 2003, Science.

[37]  M. Schmid,et al.  Genome-Wide Insertional Mutagenesis of Arabidopsis thaliana , 2003, Science.

[38]  Detlef Weigel,et al.  Large-scale identification of single-feature polymorphisms in complex genomes. , 2003, Genome research.

[39]  R. Sederoff,et al.  Apparent homology of expressed genes from wood-forming tissues of loblolly pine (Pinus taeda L.) with Arabidopsis thaliana , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Erez Y. Levanon,et al.  Widespread occurrence of antisense transcription in the human genome , 2003, Nature Biotechnology.

[41]  J. Rinn,et al.  The transcriptional activity of human Chromosome 22. , 2003, Genes & development.

[42]  宁北芳,et al.  疟原虫var基因转换速率变化导致抗原变异[英]/Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A , 2005 .