rSNP_Guide: An integrated database‐tools system for studying SNPs and site‐directed mutations in transcription factor binding sites

Since the human genome was sequenced in draft, single nucleotide polymorphism (SNP) analysis has become one of the keynote fields of bioinformatics. We have developed an integrated database‐tools system, rSNP_Guide (http://wwwmgs.bionet.nsc.ru/mgs/systems/rsnp/), devoted to prediction of transcription factor (TF) binding sites, alterations of which could be associated with disease phenotype. By inputting data on alterations in DNA sequence and in DNA binding pattern of an unknown TF, rSNP_Guide searches for a known TF with alterations in the recognition score calculated on the basis of TF site's sequence and consistent with the input alterations in DNA binding to the unknown TF. Our system has been tested on many relationships between known TF sites and diseases, as well as on site‐directed mutagenesis data. Experimental verification of rSNP_Guide system was made on functionally important SNPs in human TDO2and mouse K‐ras genes. Additional examples of analysis are reported involving variants in the human γA‐globin (HBG1), hsp70(HSPA1A), and Factor IX (F9) gene promoters. Hum Mutat 20:239–248, 2002. © 2002 Wiley‐Liss, Inc.

[1]  Akinori Sarai,et al.  rSNP_Guide, a database system for analysis of transcription factor binding to target sequences: application to SNPs and site-directed mutations , 2001, Nucleic Acids Res..

[2]  Perry L. Miller,et al.  ALFRED: an allele frequency database for diverse populations and DNA polymorphisms - an update , 2001, Nucleic Acids Res..

[3]  P. Stenson,et al.  Human Gene Mutation Database—A biomedical information and research resource , 2000, Human mutation.

[4]  Nicolas Mermod,et al.  Evaluation of computer tools for the prediction of transcription factor binding sites on genomic DNA , 1998, Silico Biol..

[5]  K. Kurachi,et al.  Structural and functional basis of the developmental regulation of human coagulation factor IX gene: factor IX Leyden. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Kevin Marsh,et al.  A polymorphism that affects OCT-1 binding to the TNF promoter region is associated with severe malaria , 1999, Nature Genetics.

[7]  T. Merkulova,et al.  Point mutations within 663–666 bp of intron 6 of the human TDO2 gene, associated with a number of psychiatric disorders, damage the YY‐1 transcription factor binding site , 1999, FEBS letters.

[8]  M. Crossley,et al.  Disruption of a C/EBP binding site in the factor IX promoter is associated with haemophilia B , 1990, Nature.

[9]  J. Cowell,et al.  A novel mutation in the promotor region in a family with a mild form of retinoblastoma indicates the location of a new regulatory domain for the RB1 gene. , 1996, Oncogene.

[10]  Mikhail S. Gelfand,et al.  SELEX_DB: a database on in vitro selected oligomers adapted for recognizing natural sites and for analyzing both SNPs and site-directed mutagenesis data , 2002, Nucleic Acids Res..

[11]  C. Cazeneuve,et al.  Three novel sequence variations in the 5′ upstream region of the cystic fibrosis transmembrane conductance regulator (CFTR) gene: two polymorphisms and one putative molecular defect , 1995, Human Genetics.

[12]  Julia V. Ponomarenko,et al.  Mining DNA sequences to predict sites which mutations cause genetic diseases , 2002, Knowl. Based Syst..

[13]  Akinori Sarai,et al.  ACTIVITY: a database on DNA/RNA sites activity adapted to apply sequence-activity relationships from one system to another , 2001, Nucleic Acids Res..

[14]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[15]  R. Mantovani,et al.  The effects of HPFH mutations in the human gamma-globin promoter on binding of ubiquitous and erythroid specific nuclear factors. , 1988, Nucleic acids research.

[16]  F H Ruddle,et al.  KRAS2 as a genetic marker for lung tumor susceptibility in inbred mice. , 1987, Journal of the National Cancer Institute.

[17]  D. Lillicrap,et al.  Binding of the Ets factor GA-binding protein to an upstream site in the factor IX promoter is a critical event in transactivation , 1996, Molecular and cellular biology.

[18]  D. Valle,et al.  Online Mendelian Inheritance In Man (OMIM) , 2000, Human mutation.

[19]  J. Kaysen,et al.  Protein-DNA interactions upstream from the human A gamma globin gene. , 1990, Nucleic acids research.

[20]  H. Wu,et al.  NF-kappa B activation of p53. A potential mechanism for suppressing cell growth in response to stress. , 1994, The Journal of biological chemistry.

[21]  C. Nathan,et al.  Role of interferon regulatory factor 1 in induction of nitric oxide synthase , 1994, The Journal of experimental medicine.

[22]  Gordon Vansant,et al.  An Alu Element in the Myeloperoxidase Promoter Contains a Composite SP1-Thyroid Hormone-Retinoic Acid Response Element* , 1996, The Journal of Biological Chemistry.

[23]  M. Matsuda,et al.  Delta-thalassemia caused by disruption of the site for an erythroid-specific transcription factor, GATA-1, in the delta-globin gene promoter. , 1992, Blood.

[24]  P. Lønning,et al.  CYP17 and breast cancer risk: the polymorphism in the 5' flanking area of the gene does not influence binding to Sp-1. , 1999, Cancer research.

[25]  D. Comings,et al.  Exon and intron variants in the human tryptophan 2,3-dioxygenase gene: potential association with Tourette syndrome, substance abuse and other disorders. , 1996, Pharmacogenetics.

[26]  Yan P. Yuan,et al.  HGBASE: a database of SNPs and other variations in and around human genes , 2000, Nucleic Acids Res..

[27]  C. Murphy,et al.  Assembly of the nuclear transcription and processing machinery: Cajal bodies (coiled bodies) and transcriptosomes. , 1999, Molecular biology of the cell.

[28]  A D Auerbach,et al.  Phenotypic consequences of mutations in the Fanconi anemia FAC gene: an International Fanconi Anemia Registry study. , 1997, Blood.

[29]  Pui-Yan Kwok,et al.  Single-nucleotide polymorphisms in the public domain: how useful are they? , 2001, Nature Genetics.

[30]  G. Christian Overton,et al.  Oligonucleotide frequency matrices addressed to recognizing functional DNA sites , 1999, Bioinform..

[31]  T. Elton,et al.  An A/T-rich cis-element is essential for rat angiotensin II type 1A receptor transcription in vascular smooth muscle cells. , 1999, Biochimica et biophysica acta.

[32]  Y. Tsutsumi‐Ishii,et al.  Response of heat shock element within the human HSP70 promoter to mutated p53 genes. , 1995, Cell growth & differentiation : the molecular biology journal of the American Association for Cancer Research.

[33]  Brian P. Brunk,et al.  EpoDB: a prototype database for the analysis of genes expressed during vertebrate erythropoiesis , 1999, Nucleic Acids Res..

[34]  P. Bucher,et al.  Experimental analysis and computer prediction of CTF/NFI transcription factor DNA binding sites. , 2000, Journal of molecular biology.

[35]  Scott Langdon,et al.  Gamma-Globin Gene Promoter Elements Required for Interaction With Globin Enhancers , 1998 .

[36]  T. Werner,et al.  MatInd and MatInspector: new fast and versatile tools for detection of consensus matches in nucleotide sequence data. , 1995, Nucleic acids research.

[37]  M. You,et al.  The second intron of the K-ras gene contains regulatory elements associated with mouse lung tumor susceptibility. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[38]  M. Daly,et al.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms , 2001, Nature.

[39]  R. Casu,et al.  Delta-thalassemia due to a mutation in an erythroid-specific binding protein sequence 3' to the delta-globin gene. , 1992, Blood.