Distinct class of putative "non-conserved" promoters in humans: comparative studies of alternative promoters of human and mouse genes.

Although recent studies have revealed that the majority of human genes are subject to regulation of alternative promoters, the biological relevance of this phenomenon remains unclear. We have also demonstrated that roughly half of the human RefSeq genes examined contain putative alternative promoters (PAPs). Here we report large-scale comparative studies of PAPs between human and mouse counterpart genes. Detailed sequence comparison of the 17,245 putative promoter regions (PPRs) in 5463 PAP-containing human genes revealed that PPRs in only a minor fraction of genes (807 genes) showed clear evolutionary conservation as one or more pairs. Also, we found that there were substantial qualitative differences between conserved and non-conserved PPRs, with the latter class being AT-rich PPRs of relative minor usage, enriched in repetitive elements and sometimes producing transcripts that encode small or no proteins. Systematic luciferase assays of these PPRs revealed that both classes of PPRs did have promoter activity, but that their strength ranges were significantly different. Furthermore, we demonstrate that these characteristic features of the non-conserved PPRs are shared with the PPRs of previously discovered putative non-protein coding transcripts. Taken together, our data suggest that there are two distinct classes of promoters in humans, with the latter class of promoters emerging frequently during evolution.

[1]  S. Salzberg,et al.  The Transcriptional Landscape of the Mammalian Genome , 2005, Science.

[2]  R. Banerjee,et al.  UDP-glucuronosyltransferases: gene structures of UGT1 and UGT2 families. , 2005, Methods in enzymology.

[3]  Gail Mandel,et al.  Defining the CREB Regulon A Genome-Wide Analysis of Transcription Factor Regulatory Regions , 2004, Cell.

[4]  Sumio Sugano,et al.  5′-end SAGE for the analysis of transcriptional start sites , 2004, Nature Biotechnology.

[5]  Jiwang Zhang,et al.  Cloning and functional analysis of cDNAs with open reading frames for 300 previously undefined genes expressed in CD34+ hematopoietic stem/progenitor cells. , 2000, Genome research.

[6]  N. Nomura,et al.  Complete sequencing and characterization of 21,243 full-length human cDNAs , 2004, Nature Genetics.

[7]  K. Nakai,et al.  Diversification of transcriptional modulation: large-scale identification and characterization of putative alternative promoters of human genes. , 2005, Genome research.

[8]  Noam Shomron,et al.  Canalization of development by microRNAs , 2006, Nature Genetics.

[9]  G. Helt,et al.  Transcriptional Maps of 10 Human Chromosomes at 5-Nucleotide Resolution , 2005, Science.

[10]  J. Mattick,et al.  Non-coding RNA. , 2006, Human molecular genetics.

[11]  Christopher J. Lee,et al.  A genomic view of alternative splicing , 2002, Nature Genetics.

[12]  Alexander E. Kel,et al.  TRANSFAC® and its module TRANSCompel®: transcriptional gene regulation in eukaryotes , 2005, Nucleic Acids Res..

[13]  S. Cawley,et al.  Unbiased Mapping of Transcription Factor Binding Sites along Human Chromosomes 21 and 22 Points to Widespread Regulation of Noncoding RNAs , 2004, Cell.

[14]  W F Reynolds,et al.  The consensus sequence of a major Alu subfamily contains a functional retinoic acid response element. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Dixie L Mager,et al.  Complex controls: the role of alternative promoters in mammalian genomes. , 2003, Trends in genetics : TIG.

[16]  G. Rubin,et al.  Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[18]  V. Gladyshev,et al.  Alternative splicing involving the thioredoxin reductase module in mammals: a glutaredoxin-containing thioredoxin reductase 1. , 2004, Biochemistry.

[19]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[20]  Leah Barrera,et al.  A high-resolution map of active promoters in the human genome , 2005, Nature.

[21]  H. Hamdi,et al.  Alu-mediated phylogenetic novelties in gene regulation and development. , 2000, Journal of molecular biology.

[22]  P. Deininger,et al.  Identification of a New Subclass of Alu DNA Repeats Which Can Function as Estrogen Receptor-dependent Transcriptional Enhancers (*) , 1995, The Journal of Biological Chemistry.

[23]  G. Wray,et al.  Abundant raw material for cis-regulatory evolution in humans. , 2002, Molecular biology and evolution.

[24]  A. Clark,et al.  Evolution of transcription factor binding sites in Mammalian gene regulatory regions: conservation and turnover. , 2002, Molecular biology and evolution.

[25]  Hongbing Wang,et al.  Transcriptional regulation of cytochrome p450 2B genes by nuclear receptors. , 2003, Current drug metabolism.

[26]  J. Gustafsson,et al.  The estrogen receptor gene: promoter organization and expression. , 1997, The international journal of biochemistry & cell biology.

[27]  M. Frommer,et al.  CpG islands in vertebrate genomes. , 1987, Journal of molecular biology.

[28]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[29]  B. Frey,et al.  Alternative splicing of conserved exons is frequently species-specific in human and mouse. , 2005, Trends in genetics : TIG.

[30]  Kenta Nakai,et al.  DBTSS: database of transcription start sites, progress report 2008 , 2007, Nucleic Acids Res..

[31]  Jun Kawai,et al.  Evolutionary turnover of mammalian transcription start sites. , 2006, Genome research.

[32]  M. Matzuk,et al.  MRG15 Regulates Embryonic Development and Cell Proliferation , 2005, Molecular and Cellular Biology.

[33]  Sumio Sugano,et al.  Construction of a full-length enriched and a 5'-end enriched cDNA library using the oligo-capping method. , 2003, Methods in molecular biology.

[34]  Daehyun Baek,et al.  Characterization and predictive discovery of evolutionarily conserved mammalian alternative promoters. , 2007, Genome research.

[35]  Yong-Ho Ahn,et al.  Alternative Usages of Multiple Promoters of the Acetyl-CoA Carboxylase β Gene Are Related to Differential Transcriptional Regulation in Human and Rodent Tissues* , 2005, Journal of Biological Chemistry.

[36]  D. Tautz Evolution of transcriptional regulation. , 2000, Current opinion in genetics & development.

[37]  Andrew Smith Genome sequence of the nematode C-elegans: A platform for investigating biology , 1998 .

[38]  T. Gingeras,et al.  TUF Love for “Junk” DNA , 2006, Cell.

[39]  K. Nakai,et al.  Sequence comparison of human and mouse genes reveals a homologous block structure in the promoter regions. , 2004, Genome research.

[40]  E. Birney,et al.  Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs , 2002, Nature.

[41]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[42]  S. P. Fodor,et al.  Large-Scale Transcriptional Activity in Chromosomes 21 and 22 , 2002, Science.

[43]  Kanako O. Koyanagi,et al.  Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones , 2004, PLoS Biology.

[44]  P. Pelicci,et al.  Evolution of Shc functions from nematode to human. , 2000, Current opinion in genetics & development.

[45]  Martin S. Taylor,et al.  Genome-wide analysis of mammalian promoter architecture and evolution , 2006, Nature Genetics.

[46]  Philipp Kapranov,et al.  Examples of the complex architecture of the human transcriptome revealed by RACE and high-density tiling arrays. , 2005, Genome research.

[47]  Tatsuhiko Tsunoda,et al.  Estimating transcription factor bindability on DNA , 1999, Bioinform..