rSNP_Guide, a database system for analysis of transcription factor binding to DNA with variations: application to genome annotation

The analysis of gene regulatory networks has become one of the most challenging problems of the postgenomic era. Earlier we developed rSNP_Guide (http://util.bionet.nsc.ru/databases/rsnp.html), a computer system and database devoted to prediction of transcription factor (TF) binding sites (TF sites), which can be responsible for disease phenotypes. The prediction results were confirmed by 70 known relationships between TF sites and diseases, as well as by site-directed mutagenesis data. The rSNP_Guide is being investigated as a tool for TF site annotation. Previously analyzed and characterized cases of altered TF sites were used to annotate potential sites of the same type and at the same location in homologous genes. Based on 20 TF sites with known alterations in TF binding to DNA, we localized 245 potential TF sites in homologous genes. For these potential TF sites, rSNP_Guide estimates TF-DNA interaction according to three categories: 'present', 'weak', and 'absent'. The significance of each assignment is statistically measured.

[1]  S. Langdon,et al.  Gamma-globin gene promoter elements required for interaction with globin enhancers. , 1998, Blood.

[2]  Julia V Ponomarenko,et al.  rSNP_Guide: An integrated database‐tools system for studying SNPs and site‐directed mutations in transcription factor binding sites , 2002, Human mutation.

[3]  Akinori Sarai,et al.  rSNP_Guide, a database system for analysis of transcription factor binding to target sequences: application to SNPs and site-directed mutations , 2001, Nucleic Acids Res..

[4]  Rolf Apweiler,et al.  The EBI SRS server-new features , 2002, Bioinform..

[5]  Kevin Marsh,et al.  A polymorphism that affects OCT-1 binding to the TNF promoter region is associated with severe malaria , 1999, Nature Genetics.

[6]  Steven C. Hunt,et al.  Molecular basis of human hypertension: Role of angiotensinogen , 1992, Cell.

[7]  Scott Langdon,et al.  Gamma-Globin Gene Promoter Elements Required for Interaction With Globin Enhancers , 1998 .

[8]  Hanah Margalit,et al.  A Structure-Based Approach for Prediction of Protein Binding Sites in Gene-Upstream Regions , 2000, Pacific Symposium on Biocomputing.

[9]  C. Klinge Estrogen receptor interaction with estrogen response elements. , 2001, Nucleic acids research.

[10]  T. Merkulova,et al.  Point mutations within 663–666 bp of intron 6 of the human TDO2 gene, associated with a number of psychiatric disorders, damage the YY‐1 transcription factor binding site , 1999, FEBS letters.

[11]  Kei-Hoi Cheung,et al.  ALFRED: an allele frequency database for diverse populations and DNA polymorphisms , 2000, Nucleic Acids Res..

[12]  V. McKusick Mendelian inheritance in man , 1971 .

[13]  A Kumar,et al.  Role of C/A polymorphism at -20 on the expression of human angiotensinogen gene. , 1999, Hypertension.

[14]  J. Naylor,et al.  Mendelian inheritance in man: A catalog of human genes and genetic disorders , 1996 .

[15]  Y. Tsutsumi‐Ishii,et al.  Response of heat shock element within the human HSP70 promoter to mutated p53 genes. , 1995, Cell growth & differentiation : the molecular biology journal of the American Association for Cancer Research.

[16]  P. Bucher,et al.  Experimental analysis and computer prediction of CTF/NFI transcription factor DNA binding sites. , 2000, Journal of molecular biology.

[17]  P. Stenson,et al.  Human Gene Mutation Database—A biomedical information and research resource , 2000, Human mutation.

[18]  J. Fickett,et al.  Identification of regulatory regions which confer muscle-specific gene expression. , 1998, Journal of molecular biology.

[19]  Julia V. Ponomarenko,et al.  Mining DNA sequences to predict sites which mutations cause genetic diseases , 2002, Knowl. Based Syst..

[20]  Yan P. Yuan,et al.  HGBASE: a database of SNPs and other variations in and around human genes , 2000, Nucleic Acids Res..

[21]  C. Lawrence,et al.  Human-mouse genome comparisons to locate regulatory sites , 2000, Nature Genetics.

[22]  T. Werner,et al.  MatInd and MatInspector: new fast and versatile tools for detection of consensus matches in nucleotide sequence data. , 1995, Nucleic acids research.

[23]  Jie Zhou,et al.  Orphan receptor Arp-1 binds to the nucleotide sequence located between TATA box and transcriptional initiation site of the human angiotensinogen gene and reduces estrogen induced promoter activity , 1999, Molecular and Cellular Endocrinology.

[24]  Elizabeth M. Smigielski,et al.  dbSNP: a database of single nucleotide polymorphisms , 2000, Nucleic Acids Res..