Complex Loci in Human and Mouse Genomes

Mammalian genomes harbor a larger than expected number of complex loci, in which multiple genes are coupled by shared transcribed regions in antisense orientation and/or by bidirectional core promoters. To determine the incidence, functional significance, and evolutionary context of mammalian complex loci, we identified and characterized 5,248 cis-antisense pairs, 1,638 bidirectional promoters, and 1,153 chains of multiple cis-antisense and/or bidirectionally promoted pairs from 36,606 mouse transcriptional units (TUs), along with 6,141 cis-antisense pairs, 2,113 bidirectional promoters, and 1,480 chains from 42,887 human TUs. In both human and mouse, 25% of TUs resided in cis-antisense pairs, only 17% of which were conserved between the two organisms, indicating frequent species specificity of antisense gene arrangements. A sampling approach indicated that over 40% of all TUs might actually be in cis-antisense pairs, and that only a minority of these arrangements are likely to be conserved between human and mouse. Bidirectional promoters were characterized by variable transcriptional start sites and an identifiable midpoint at which overall sequence composition changed strand and the direction of transcriptional initiation switched. In microarray data covering a wide range of mouse tissues, genes in cis-antisense and bidirectionally promoted arrangement showed a higher probability of being coordinately expressed than random pairs of genes. In a case study on homeotic loci, we observed extensive transcription of nonconserved sequences on the noncoding strand, implying that the presence rather than the sequence of these transcripts is of functional importance. Complex loci are ubiquitous, host numerous nonconserved gene structures and lineage-specific exonification events, and may have a cis-regulatory impact on the member genes.

[1]  Laurence D. Hurst,et al.  Evidence for a preferential targeting of 3′-UTRs by cis-encoded natural antisense transcripts , 2005, Nucleic acids research.

[2]  S. Salzberg,et al.  The Transcriptional Landscape of the Mammalian Genome , 2005, Science.

[3]  S. Batalov,et al.  Antisense Transcription in the Mammalian Transcriptome , 2005, Science.

[4]  Philipp Kapranov,et al.  Examples of the complex architecture of the human transcriptome revealed by RACE and high-density tiling arrays. , 2005, Genome research.

[5]  Gianluigi Zanetti,et al.  AntiHunter 2.0: increased speed and sensitivity in searching BLAST output for EST antisense transcripts , 2005, Nucleic Acids Res..

[6]  L. Hurst,et al.  Genome-wide analysis of coordinate expression and evolution of human cis-encoded sense-antisense transcripts. , 2005, Trends in genetics : TIG.

[7]  G. Helt,et al.  Transcriptional Maps of 10 Human Chromosomes at 5-Nucleotide Resolution , 2005, Science.

[8]  Yoshihide Hayashizaki,et al.  Disclosing hidden transcripts: mouse natural sense-antisense transcripts tend to be poly(A) negative and nuclear localized. , 2005, Genome research.

[9]  Xiu-yun Cui,et al.  Anti-tumor effect of hematopoietic cells carrying the gene of ribonuclease inhibitor , 2005, Cancer Gene Therapy.

[10]  Rotem Sorek,et al.  Naturally occurring antisense: transcriptional leakage or real overlap? , 2005, Genome research.

[11]  Thomas E. Royce,et al.  Global Identification of Human Transcribed Sequences with Genome Tiling Arrays , 2004, Science.

[12]  Boris Lenhard,et al.  Arrays of ultraconserved non-coding regions span the loci of key developmental genes in vertebrate genomes , 2004, BMC Genomics.

[13]  Y. Hayashizaki,et al.  Identification of region‐specific transcription factor genes in the adult mouse brain by medium‐scale real‐time RT‐PCR , 2004, FEBS letters.

[14]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[15]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Terrence S. Furey,et al.  The DNA sequence and biology of human chromosome 19 , 2004, Nature.

[17]  S. Cawley,et al.  Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22. , 2004, Genome research.

[18]  S. Cawley,et al.  Unbiased Mapping of Transcription Factor Binding Sites along Human Chromosomes 21 and 22 Points to Widespread Regulation of Noncoding RNAs , 2004, Cell.

[19]  Ben Lehner,et al.  In search of antisense. , 2004, Trends in biochemical sciences.

[20]  Michal Galdzicki,et al.  Mammalian overlapping genes: the comparative perspective. , 2004, Genome research.

[21]  R. Myers,et al.  An abundance of bidirectional promoters in the human genome. , 2003, Genome research.

[22]  Xiaoqiu Huang,et al.  Over 20% of human transcripts might form sense-antisense pairs. , 2004, Nucleic acids research.

[23]  Wyeth W. Wasserman,et al.  JASPAR: an open-access database for eukaryotic transcription factor binding profiles , 2004, Nucleic Acids Res..

[24]  J. T. Kadonaga,et al.  The RNA polymerase II core promoter. , 2003, Annual review of biochemistry.

[25]  Paul Denny,et al.  A comprehensive transcript map of the mouse Gnas imprinted complex. , 2003, Genome research.

[26]  Yoshihide Hayashizaki,et al.  Antisense transcripts with FANTOM2 clone set and their implications for gene regulation. , 2003, Genome research.

[27]  M. Fagiolini,et al.  Targeting a complex transcriptome: the construction of the mouse full-length cDNA encyclopedia. , 2003, Genome research.

[28]  Axel Meyer,et al.  Evolutionary conservation of regulatory elements in vertebrate Hox gene clusters. , 2003, Genome research.

[29]  Erez Y. Levanon,et al.  Widespread occurrence of antisense transcription in the human genome , 2003, Nature Biotechnology.

[30]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[31]  M. King,et al.  Novel transcriptional units and unconventional gene pairs in the human genome: toward a sequence-level basis for primate-specific phenotypes? , 2003, Cold Spring Harbor symposia on quantitative biology.

[32]  G. Rubin,et al.  Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Jay Shendure,et al.  Computational discovery of sense-antisense transcription in the human and mouse genomes , 2002, Genome Biology.

[34]  Wyeth W. Wasserman,et al.  TFBS: Computational framework for transcription factor binding site analysis , 2002, Bioinform..

[35]  D. Toczyski,et al.  A unified view of the DNA-damage checkpoint. , 2002, Current opinion in cell biology.

[36]  Kei-Hoi Cheung,et al.  An integrated approach for finding overlooked genes in yeast , 2002, Nature Biotechnology.

[37]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[38]  R. Gibbs,et al.  PipMaker--a web server for aligning two genomic DNA sequences. , 2000, Genome research.

[39]  S Rozen,et al.  Primer3 on the WWW for general users and for biologist programmers. , 2000, Methods in molecular biology.

[40]  Thomas L. Madden,et al.  BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. , 1999, FEMS microbiology letters.

[41]  S. Potter,et al.  Evolutionary conservation and tissue-specific processing of Hoxa 11 antisense transcripts , 1998, Mammalian Genome.

[42]  O. Mor,et al.  The human Surfeit locus. , 1998, Genomics.

[43]  C. Vaquero,et al.  Do natural antisense transcripts make sense in eukaryotes? , 1998, Gene.

[44]  D J Lipman,et al.  Making (anti)sense of non-coding sequence conservation. , 1997, Nucleic acids research.

[45]  W. Gehring,et al.  Homeodomain proteins. , 1994, Annual review of biochemistry.

[46]  R. Simons,et al.  Antisense RNA control in bacteria, phages, and plasmids. , 1994, Annual review of microbiology.

[47]  H. Thiesen,et al.  Target Detection Assay (TDA): a versatile procedure to determine DNA binding sites as demonstrated on SP1 protein. , 1990, Nucleic acids research.

[48]  P. Chomczyński,et al.  Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. , 1987, Analytical biochemistry.