Comparative promoter analysis and its application in analysis of PTH-regulated gene expression.

Taking advantage of the "working draft" of the human genome and the MIT shotgun assembly of the mouse genome, we performed a comparative promoter analysis of human RefSeq mRNA (sequences from GenBank's RefSeq database). By combining this analysis with a transcription factor (TF) binding site analysis using a TRANSFAC position weight matrix (PWM) search, 86% of non-specific TF sites were removed. Using a set of genes that are regulated by parathyroid hormone (PTH), a statistical analysis was performed on the conserved TF binding sites among a set of eight human and mouse genes. From among the eight genes tested, we obtained a set of 31 TFs, suggesting possible roles for associated genes in PTH-mediated pathways. All three known PTH-responsive TFs (AP1, RUNX2, CREB) were correctly predicted by this analysis as well as two other potential TFs (VDR and CEBP Delta). Additionally, a model was made to describe the TF site characteristic module of PTH-regulated genes. This model was then used to search all human RefSeq gene promoters with established human-mouse ortholog relationships to identify other PTH-regulated genes. This comparative approach combined with statistical analysis proved to be sufficiently specific to decipher critical TFs involved in PTH-regulated pathways.

[1]  G. Stormo,et al.  Ann-spec: a Method for Discovering Transcription Factor Binding Sites with Improved Specificity , 2022 .

[2]  Saurabh Sinha,et al.  A Statistical Method for Finding Transcription Factor Binding Sites , 2000, ISMB.

[3]  G. Church,et al.  Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. , 2000, Journal of molecular biology.

[4]  W. Wasserman,et al.  A predictive model for regulatory sequences directing liver-specific transcription. , 2001, Genome research.

[5]  M. Blanchette,et al.  Discovery of regulatory elements by a computational method for phylogenetic footprinting. , 2002, Genome research.

[6]  R. Gibbs,et al.  Large-scale comparative sequence analysis of the human and murine Bruton's tyrosine kinase loci reveals conserved regulatory domains. , 1997, Genome research.

[7]  M. Adams,et al.  A tool for analyzing and annotating genomic sequences. , 1997, Genomics.

[8]  T. Werner,et al.  Highly specific localization of promoter regions in large genomic sequences by PromoterInspector: a novel context analysis approach. , 2000, Journal of molecular biology.

[9]  Gary D. Stormo,et al.  Identifying DNA and protein patterns with statistically significant alignments of multiple sequences , 1999, Bioinform..

[10]  J. Collado-Vides,et al.  Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies. , 1998, Journal of molecular biology.

[11]  H. Munro,et al.  Mammalian protein metabolism , 1964 .

[12]  K. Watanabe,et al.  Interleukin (IL)-4 and IL-13 inhibit the differentiation of murine osteoblastic MC3T3-E1 cells. , 2000, Endocrine journal.

[13]  Andreas Wagner,et al.  Genes regulated cooperatively by one or more transcription factors and their identification in whole eukaryotic genomes , 1999, Bioinform..

[14]  L. Pachter,et al.  rVista for comparative sequence-based discovery of functional transcription factor binding sites. , 2002, Genome research.

[15]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[16]  G. Rubin,et al.  Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Luquan Wang,et al.  Computational analysis of composite regulatory elements , 2002, Mammalian Genome.

[18]  J. Fickett,et al.  Eukaryotic promoter recognition. , 1997, Genome research.

[19]  Holger Karas,et al.  TRANSFAC: a database on transcription factors and their DNA binding sites , 1996, Nucleic Acids Res..

[20]  J. W. Rooney,et al.  Novel NFAT sites that mediate activation of the interleukin-2 promoter in response to T-cell receptor stimulation , 1995, Molecular and cellular biology.

[21]  Yaolin Wang,et al.  Analyses of p53 Target Genes in the Human Genome by Bioinformatic and Microarray Approaches* 210 , 2001, The Journal of Biological Chemistry.

[22]  R. Hardison Conserved noncoding sequences are reliable guides to regulatory elements. , 2000, Trends in genetics : TIG.

[23]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[24]  G. Strewler,et al.  Parathyroid hormonelike protein from human renal carcinoma cells. Structural and functional homology with parathyroid hormone. , 1987, The Journal of clinical investigation.

[25]  W. Miller,et al.  Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. , 2000, Science.

[26]  N. Selvamurugan,et al.  Parathyroid hormone-dependent signaling pathways regulating genes in bone cells. , 2002, Gene.

[27]  C. Lawrence,et al.  Human-mouse genome comparisons to locate regulatory sites , 2000, Nature Genetics.

[28]  Xin Chen,et al.  The TRANSFAC system on gene expression regulation , 2001, Nucleic Acids Res..

[29]  W Miller,et al.  Locus control regions of mammalian beta-globin gene clusters: combining phylogenetic analyses and experimental results to gain functional insights. , 1997, Gene.

[30]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[31]  T. Heinemeyer,et al.  Databases on transcriptional regulation: TRANSFAC, TRRD and COMPEL , 1998, Nucleic Acids Res..