Genome-wide discovery of functional transcription factor binding sites by comparative genomics: The case of Stat3

The identification of direct targets of transcription factors is a key problem in the study of gene regulatory networks. However, the use of high throughput experimental methods, such as ChIP-chip and ChIP-sequencing, is limited by their high cost and strong dependence on cellular type and context. We developed a computational method for the genome-wide identification of functional transcription factor binding sites based on positional weight matrices, comparative genomics, and gene expression profiling. The method was applied to Stat3, a transcription factor playing crucial roles in inflammation, immunity and oncogenesis, and able to induce distinct subsets of target genes in different cell types or conditions. A newly generated positional weight matrix enabled us to assign affinity scores of high specificity, as measured by EMSA competition assays. Phylogenetic conservation with 7 vertebrate species was used to select the binding sites most likely to be functional. Validation was carried out on predicted sites within genes identified as differentially expressed in the presence or absence of Stat3 by microarray analysis. Twelve of the fourteen sites tested were bound by Stat3 in vivo, as assessed by Chromatin Immunoprecipitation, allowing us to identify 9 Stat3 transcriptional targets. Given its high validation rate, and the availability of large transcription factor-dependent gene expression datasets obtained under diverse experimental conditions, our approach appears to be a valid alternative to high-throughput experimental assays for the discovery of novel direct targets of transcription factors.

[1]  Olivier Elemento,et al.  Fast and systematic genome-wide discovery of conserved regulatory elements using a non-alignment based approach , 2005, Genome Biology.

[2]  J. Darnell,et al.  Independent and Cooperative Activation of Chromosomal c-fos Promoter by STAT3* , 2003, The Journal of Biological Chemistry.

[3]  C. Lawrence,et al.  Human-mouse genome comparisons to locate regulatory sites , 2000, Nature Genetics.

[4]  W. Deppert,et al.  Gadd45 beta is a pro-survival factor associated with stress-resistant tumors. , 2008, Oncogene.

[5]  N. Callewaert,et al.  P-selectin mediates metastatic progression through binding to sulfatides on tumor cells. , 2007, Glycobiology.

[6]  N. D. Clarke,et al.  Integration of External Signaling Pathways with the Core Transcriptional Network in Embryonic Stem Cells , 2008, Cell.

[7]  V. Poli,et al.  STAT3 Function In Vivo , 2003 .

[8]  O. Heidenreich,et al.  siRNA-mediated AML1/MTG8 depletion affects differentiation and proliferation-associated gene expression in t(8;21)-positive cell lines and primary AML blasts , 2006, Oncogene.

[9]  Christina Gewinner,et al.  Signal transducers and activators of transcription (STATs): Activation and Biology , 2003 .

[10]  J. Darnell,et al.  Stat3 as an Oncogene , 1999, Cell.

[11]  K. Bomsztyk,et al.  Protocol for the fast chromatin immunoprecipitation (ChIP) method , 2006, Nature Protocols.

[12]  W. Deppert,et al.  Gadd45β is a pro-survival factor associated with stress-resistant tumors , 2008, Oncogene.

[13]  B Calabretta,et al.  Overexpression of DR-nm23, a protein encoded by a member of the nm23 gene family, inhibits granulocyte differentiation and induces apoptosis in 32Dc13 myeloid cells. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Valeria Poli,et al.  Mutational switch of an IL-6 response to an interferon-γ-like response , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[15]  V. Poli,et al.  Essential Role of STAT3 in the Control of the Acute-Phase Response as Revealed by Inducible Gene Activation in the Liver , 2001, Molecular and Cellular Biology.

[16]  P. Pharoah,et al.  Sipa1 is a candidate for underlying the metastasis efficiency modifier locus Mtes1 , 2005, Nature Genetics.

[17]  J. Darnell,et al.  A STAT protein domain that determines DNA sequence recognition suggests a novel DNA-binding domain. , 1995, Genes & development.

[18]  K. Alitalo,et al.  Inhibition of lymphogenous metastasis using adeno-associated virus-mediated gene transfer of a soluble VEGFR-3 decoy receptor. , 2005, Cancer research.

[19]  Hua Yu,et al.  Targeting STAT3 affects melanoma on multiple fronts , 2005, Cancer and Metastasis Reviews.

[20]  D. Levy,et al.  What does Stat3 do? , 2002, The Journal of clinical investigation.

[21]  Valeria Poli,et al.  Mutational switch of an IL-6 response to an interferon-gamma-like response. , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[22]  J. Lammers,et al.  STAT3β, a Splice Variant of Transcription Factor STAT3, Is a Dominant Negative Regulator of Transcription* , 1996, The Journal of Biological Chemistry.

[23]  Philippe Collas,et al.  Chop it, ChIP it, check it: the current status of chromatin immunoprecipitation. , 2008, Frontiers in bioscience : a journal and virtual library.

[24]  J. C. Barrett,et al.  CD82 metastasis suppressor gene: a potential target for new therapeutics? , 2005, Trends in molecular medicine.

[25]  J. Tonn,et al.  Expression of VEGFR3 in glioma endothelium correlates with tumor grade , 2007, Journal of Neuro-Oncology.

[26]  Gary D. Stormo,et al.  DNA binding sites: representation and discovery , 2000, Bioinform..

[27]  William Stafford Noble,et al.  Assessing computational tools for the discovery of transcription factor binding sites , 2005, Nature Biotechnology.

[28]  Gabriella,et al.  Essential role of STAT3 in the control of the acute-phase response as revealed by inducible gene inactivation [correction of activation] in the liver. , 2001, Molecular and cellular biology.

[29]  I. Cowell E4BP4/NFIL3, a PAR‐related bZIP factor with many roles , 2002, BioEssays : news and reviews in molecular, cellular and developmental biology.

[30]  Xin-Yun Huang,et al.  Identification of Novel Direct Stat3 Target Genes for Control of Growth and Differentiation* , 2008, Journal of Biological Chemistry.

[31]  K. Huebner,et al.  Gene structure, promoter activity, and chromosomal location of the DR-nm23 gene, a related member of the nm23 gene family. , 1997, Cancer research.

[32]  D. Duda,et al.  In vivo evaluation of the early events associated with liver metastasis of circulating cancer cells , 2001, British Journal of Cancer.

[33]  M. Campone,et al.  Prognostic impact of syndecan-1 expression in invasive ductal breast carcinomas , 2008, British Journal of Cancer.

[34]  S. Dewilde,et al.  The STAT3 isoforms alpha and beta have unique and specific functions. , 2004, Nature immunology.

[35]  Wilfred W. Li,et al.  MEME: discovering and analyzing DNA and protein sequence motifs , 2006, Nucleic Acids Res..

[36]  D. Levy,et al.  JAK-STAT Signaling: From Interferons to Cytokines* , 2007, Journal of Biological Chemistry.