Systematic Analysis of Disease-Related Regulatory Mutation Classes Reveals Distinct Effects on Transcription Factor Binding

Detailed knowledge of the mechanisms of transcriptional regulation is essential in understanding the gene expression in its entirety. Transcription is regulated, among other things, by transcription factors that bind to DNA and can enhance or repress the transcription process. If a transcription factor fails to bind to DNA or binds to a wrong DNA region that can cause severe effects to the gene expression, to the cell and even to the individual. The problems in transcription factor binding can be caused by alterations in DNA structure which often occurs when parts of the DNA strands are mutated. An increasing number of the identified disease-related mutations occur in gene regulatory sequences. These regulatory mutations can disrupt transcription factor binding sites or create new ones. We have studied effects of mutations on transcription factor binding affinity computationally. We have compared our results with experimentally verified cases where a mutation in a gene regulatory region either creates a new transcription factor binding site or deletes a previously existing one. We have investigated the statistical properties of the changes on transcription factor binding affinity according to the mutation type. Our analysis shows that the probability of a loss of a transcription factor binding site and a creation of a new one varies remarkably by the mutation type. Our results demonstrate that computational analysis provides valuable information about the effect of mutations on transcription factor binding sites. The analysis results also give a useful test set for in vitro studies of regulatory mutation effects.

[1]  D. Darling The Kolmogorov-Smirnov, Cramer-von Mises Tests , 1957 .

[2]  F. Collins,et al.  G gamma beta+ hereditary persistence of fetal hemoglobin: cosmid cloning and identification of a specific mutation 5' to the G gamma gene. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[3]  R Staden Computer methods to locate signals in nucleic acid sequences , 1984, Nucleic Acids Res..

[4]  R. Sauer,et al.  Transcription factors: structural families and principles of DNA recognition. , 1992, Annual review of biochemistry.

[5]  M. Matsuda,et al.  Delta-thalassemia caused by disruption of the site for an erythroid-specific transcription factor, GATA-1, in the delta-globin gene promoter. , 1992, Blood.

[6]  R. Harrington DNA curving and bending in protein–DNA recognition , 1992, Molecular microbiology.

[7]  R. Krumlauf,et al.  A G → A substitution in an HNF I binding site in the human α-fetoprotein gene is associated with hereditary persistence of α-fetoprotein (HPAFP) , 1993 .

[8]  R. Krumlauf,et al.  A G-->A substitution in an HNF I binding site in the human alpha-fetoprotein gene is associated with hereditary persistence of alpha-fetoprotein (HPAFP). , 1993, Human molecular genetics.

[9]  P. Newburger,et al.  Mutations in the promoter region of the gene for gp91-phox in X-linked chronic granulomatous disease with decreased expression of cytochrome b558. , 1994, The Journal of clinical investigation.

[10]  D. Cooper,et al.  Disruption of a binding site for hepatocyte nuclear factor 1 in the protein C gene promoter is associated with hereditary thrombophilia. , 1994, Human molecular genetics.

[11]  M Suzuki,et al.  DNA recognition code of transcription factors in the helix-turn-helix, probe helix, hormone receptor, and zinc finger families. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[12]  M. Suzuki,et al.  DNA conformation and its changes upon binding transcription factors. , 1996, Advances in biophysics.

[13]  Elaine H. Zackai,et al.  Identification of a Mutation in a GATA Binding Site of the Platelet Glycoprotein Ibβ Promoter Resulting in the Bernard-Soulier Syndrome* , 1996, The Journal of Biological Chemistry.

[14]  S. Umemura,et al.  Essential hypertension and 5' upstream core promoter region of human angiotensinogen gene. , 1997, Hypertension.

[15]  J. Jukema,et al.  Common C-to-T substitution at position -480 of the hepatic lipase promoter associated with a lowered lipase activity in coronary artery disease patients. , 1997, Arteriosclerosis, thrombosis, and vascular biology.

[16]  K. Bauer,et al.  Severe factor VII deficiency due to a mutation disrupting an Sp1 binding site in the factor VII promoter. , 1998, Blood.

[17]  P. Cheah,et al.  Human Gene Mutations , 1998, Human Genetics.

[18]  Kevin Marsh,et al.  A polymorphism that affects OCT-1 binding to the TNF promoter region is associated with severe malaria , 1999, Nature Genetics.

[19]  Gary D. Stormo,et al.  DNA binding sites: representation and discovery , 2000, Bioinform..

[20]  F. Cambien,et al.  New functional promoter polymorphism, CETP/-629, in cholesteryl ester transfer protein (CETP) gene related to CETP mass and high density lipoprotein cholesterol levels: role of Sp1/Sp3 in transcriptional regulation. , 2000, Arteriosclerosis, thrombosis, and vascular biology.

[21]  R. Desnick,et al.  Uroporphyrinogen III synthase erythroid promoter mutations in adjacent GATA1 and CP2 elements cause congenital erythropoietic porphyria. , 2001, The Journal of clinical investigation.

[22]  J. Koziol,et al.  Allele-dependent transcriptional regulation of the human integrin alpha2 gene. , 2001, Blood.

[23]  M. Nicolás,et al.  Transcriptional regulation of the human Sp1 gene promoter by the specificity protein (Sp) family members nuclear factor Y (NF-Y) and E2F. , 2003, The Biochemical journal.

[24]  Alexander E. Kel,et al.  TRANSFAC®: transcriptional regulation, from patterns to profiles , 2003, Nucleic Acids Res..

[25]  P. Stenson,et al.  Human Gene Mutation Database (HGMD®): 2003 update , 2003, Human mutation.

[26]  N. Sakamoto Thalassemia Caused by Disruption of the Site for an Erythroid-Specific Transcription Factor , GATA-1 , in the & Globin Gene Promoter , 2003 .

[27]  Y. Yoshikawa,et al.  Identification of I kappa BL as the second major histocompatibility complex-linked susceptibility locus for rheumatoid arthritis. , 2003, American journal of human genetics.

[28]  M. Cappellini,et al.  Human gene mutations. Gene symbol: FECH. Disease: Porphyria, erythropoietic. , 2004, Human genetics.

[29]  E. Milgrom,et al.  Hereditary persistence of alpha-fetoprotein is due to both proximal and distal hepatocyte nuclear factor-1 site mutations. , 2004, Gastroenterology.

[30]  Frank Grosveld,et al.  Spatial organization of gene expression: the active chromatin hub , 2003, Chromosome Research.

[31]  E. Milgrom,et al.  Hereditary persistence of α-fetoprotein is due to both proximal and distal hepatocyte nuclear factor-1 site mutations1 , 2004 .

[32]  A. Sandelin,et al.  Applied bioinformatics for the identification of regulatory elements , 2004, Nature Reviews Genetics.

[33]  A. Travers,et al.  The structural basis of DNA flexibility , 2004, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[34]  Wyeth W. Wasserman,et al.  JASPAR: an open-access database for eukaryotic transcription factor binding profiles , 2004, Nucleic Acids Res..

[35]  B. Ning,et al.  Identification of functional genetic variants in cyclooxygenase-2 and their association with risk of esophageal cancer. , 2005, Gastroenterology.

[36]  M. Passos-Bueno,et al.  A functional SNP in the promoter region of TCOF1 is associated with reduced gene expression and YY1 DNA-protein interaction. , 2005, Gene.

[37]  D. Menendez,et al.  A SNP in the flt-1 promoter integrates the VEGF system into the p53 transcriptional network , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[38]  K. Sleegers,et al.  Promoter mutations that increase amyloid precursor-protein expression are associated with Alzheimer disease. , 2006, American journal of human genetics.

[39]  A. Philippakis,et al.  Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities , 2006, Nature Biotechnology.

[40]  Jonas Wittwer,et al.  Functional polymorphism in ALOX15 results in increased allele‐specific transcription in macrophages through binding of the transcription factor SPI1 , 2006, Human mutation.

[41]  Alexander J. Hartemink,et al.  A Nucleosome-Guided Map of Transcription Factor Binding Sites in Yeast , 2007, PLoS Comput. Biol..

[42]  Lawrence Hunter,et al.  Mining Discriminative Distance Context of Transcription Factor Binding Sites on ChIP Enriched Regions , 2007, ISBRA.

[43]  Yusuke Nakamura,et al.  Functional SNP in an Sp1-binding site of AGTRL1 gene is associated with susceptibility to brain infarction. , 2007, Human molecular genetics.

[44]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[45]  H. Lähdesmäki,et al.  Probabilistic Inference of Transcription Factor Binding from Multiple Data Sources , 2008, PloS one.

[46]  Nicholas J. Schork,et al.  Predicting functional regulatory polymorphisms , 2008, Bioinform..

[47]  David J. Arenillas,et al.  In Silico Detection of Sequence Variations Modifying Transcriptional Regulation , 2007, PLoS Comput. Biol..

[48]  E. Segal,et al.  Predicting expression patterns from regulatory sequence in Drosophila segmentation , 2008, Nature.

[49]  H. Lähdesmäki,et al.  EFFECTS OF DISEASE-RELATED MUTATIONS ON TRANSCRIPTION FACTOR BINDING , 2008 .