Transcriptional Maps of 10 Human Chromosomes at 5-Nucleotide Resolution

Sites of transcription of polyadenylated and nonpolyadenylated RNAs for 10 human chromosomes were mapped at 5–base pair resolution in eight cell lines. Unannotated, nonpolyadenylated transcripts comprise the major proportion of the transcriptional output of the human genome. Of all transcribed sequences, 19.4, 43.7, and 36.9% were observed to be polyadenylated, nonpolyadenylated, and bimorphic, respectively. Half of all transcribed sequences are found only in the nucleus and for the most part are unannotated. Overall, the transcribed portions of the human genome are predominantly composed of interlaced networks of both poly A+ and poly A– annotated transcripts and unannotated transcripts of unknown function. This organization has important implications for interpreting genotype-phenotype associations, regulation of gene expression, and the definition of a gene.

[1]  M. Edmonds,et al.  The isolation and characterization of adenosine monophosphate-rich polynucleotides synthesized by Ehrlich ascites cells. , 1969, The Journal of biological chemistry.

[2]  C. Milcarek,et al.  The metabolism of a poly(A) minus mRNA fraction in HeLa cells. , 1974, Cell.

[3]  G. Sonenshein,et al.  Characteristics and polyadenylate content of the actin messenger RNA of mouse sarcoma-180 ascites cells. , 1978, Biochemistry.

[4]  R. Burdon,et al.  Non‐polyadenylated mRNAs from eukaryotes , 1980, FEBS letters.

[5]  M. Busslinger,et al.  Transcription termination and 3′ processing: the end is in site! , 1985, Cell.

[6]  Isolation and characterization of a partial cDNA clone for heparin cofactor II , 1986 .

[7]  M. Morrison‐Bogorad,et al.  Brain non-adenylated mRNAs , 1992, Brain Research Reviews.

[8]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[9]  T. Masuko,et al.  A Novel Protein Interacts with the Werner's Syndrome Gene Product Physically and Functionally* , 2001, The Journal of Biological Chemistry.

[10]  R. Stoughton,et al.  Experimental annotation of the human genome using microarray technology , 2001, Nature.

[11]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[12]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[13]  J. Rowley,et al.  Identifying novel transcripts and novel genes in the human genome by using novel SAGE tags , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[14]  S. P. Fodor,et al.  Large-Scale Transcriptional Activity in Chromosomes 21 and 22 , 2002, Science.

[15]  A. Sparks,et al.  Using the transcriptome to annotate the genome , 2002, Nature Biotechnology.

[16]  E. Birney,et al.  Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs , 2002, Nature.

[17]  Thomas E. Royce,et al.  Distribution of NF-κB-binding sites across human chromosome 22 , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Y. Hayashizaki,et al.  Systematic expression profiling of the mouse transcriptome using RIKEN cDNA microarrays. , 2003, Genome research.

[19]  Erez Y. Levanon,et al.  Widespread occurrence of antisense transcription in the human genome , 2003, Nature Biotechnology.

[20]  Joseph M. Dale,et al.  Empirical Analysis of Transcriptional Activity in the Arabidopsis Genome , 2003, Science.

[21]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[22]  J. Rinn,et al.  The transcriptional activity of human Chromosome 22. , 2003, Genes & development.

[23]  Gang Wu,et al.  SGS3 and SGS2/SDE1/RDR6 are required for juvenile development and the production of trans-acting siRNAs in Arabidopsis. , 2004, Genes & development.

[24]  J. Bonfield,et al.  Finishing the euchromatic sequence of the human genome , 2004, Nature.

[25]  S. Cawley,et al.  Unbiased Mapping of Transcription Factor Binding Sites along Human Chromosomes 21 and 22 Points to Widespread Regulation of Noncoding RNAs , 2004, Cell.

[26]  Ryan D. Morin,et al.  The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). , 2004, Genome research.

[27]  Thomas E. Royce,et al.  Global Identification of Human Transcribed Sequences with Genome Tiling Arrays , 2004, Science.

[28]  Masakazu Satou,et al.  RIKEN Arabidopsis full-length (RAFL) cDNA and its applications for expression profiling under abiotic stress conditions. , 2003, Journal of experimental botany.

[29]  Paul T. Groth,et al.  The ENCODE (ENCyclopedia Of DNA Elements) Project , 2004, Science.

[30]  Scott A. Rifkin,et al.  A Gene Expression Map for the Euchromatic Genome of Drosophila melanogaster , 2004, Science.

[31]  Franck Vazquez,et al.  Endogenous trans-acting siRNAs regulate the accumulation of Arabidopsis mRNAs. , 2004, Molecular cell.

[32]  S. Cawley,et al.  Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22. , 2004, Genome research.

[33]  N. Nomura,et al.  Complete sequencing and characterization of 21,243 full-length human cDNAs , 2004, Nature Genetics.