DPRP: a database of phenotype-specific regulatory programs derived from transcription factor binding data

Gene expression profiling has been extensively used in the past decades, resulting in an enormous amount of expression data available in public databases. These data sets are informative in elucidating transcriptional regulation of genes underlying various biological and clinical conditions. However, it is usually difficult to identify transcription factors (TFs) responsible for gene expression changes directly from their own expression, as TF activity is often regulated at the posttranscriptional level. In recent years, technical advances have made it possible to systematically determine the target genes of TFs by ChIP-seq experiments. To identify the regulatory programs underlying gene expression profiles, we constructed a database of phenotype-specific regulatory programs (DPRP, http://syslab.nchu.edu.tw/DPRP/) derived from the integrative analysis of TF binding data and gene expression data. DPRP provides three methods: the Fisher’s Exact Test, the Kolmogorov–Smirnov test and the BASE algorithm to facilitate the application of gene expression data for generating new hypotheses on transcriptional regulatory programs in biological and clinical studies.

[1]  Wing-Kin Sung,et al.  Cellular reprogramming by the conjoint action of ERα, FOXA1, and GATA3 to a ligand-inducible growth state , 2011, Molecular systems biology.

[2]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.

[3]  Henry Horng-Shing Lu,et al.  Statistical methods for identifying yeast cell cycle transcription factors. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Fengzhu Sun,et al.  Inferring activity changes of transcription factors by binding association with sorted expression profiles , 2007, BMC Bioinform..

[5]  John J. Wyrick,et al.  Genome-wide location and function of DNA binding proteins. , 2000, Science.

[6]  Chao Cheng,et al.  REACTIN: Regulatory activity inference of transcription factors underlying human diseases with application to breast cancer , 2013, BMC Genomics.

[7]  Avi Ma'ayan,et al.  ChEA: transcription factor regulation inferred from integrating genome-wide ChIP-X experiments , 2010, Bioinform..

[8]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[9]  David Z. Chen,et al.  Architecture of the human regulatory network derived from ENCODE data , 2012, Nature.

[10]  Serdar E Bulun,et al.  Novel estrogen receptor-alpha binding sites and estradiol target genes identified by chromatin immunoprecipitation cloning in breast cancer. , 2007, Cancer research.

[11]  Mark Gerstein,et al.  Systematic identification of transcription factors associated with patient survival in cancers , 2009, BMC Genomics.

[12]  Siavoush Dastmalchi,et al.  Structural basis of simultaneous recruitment of the transcriptional regulators LMO2 and FOG1/ZFPM1 by the transcription factor GATA1 , 2011, Proceedings of the National Academy of Sciences.

[13]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[14]  C. Calkhoven,et al.  Multiple steps in the regulation of transcription-factor level and activity. , 1996, The Biochemical journal.

[15]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[16]  Tatiana A. Tatusova,et al.  NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy , 2011, Nucleic Acids Res..

[17]  G. Gill,et al.  Regulation of transcription factor activity by SUMO modification. , 2009, Methods in molecular biology.

[18]  Nicola J. Rinaldi,et al.  Transcriptional regulatory code of a eukaryotic genome , 2004, Nature.

[19]  Federica Toffalini,et al.  Transcription factor regulation can be accurately predicted from the presence of target gene signatures in microarray gene expression data , 2010, Nucleic acids research.

[20]  Mark Gerstein,et al.  TIP: A probabilistic method for identifying transcription factor target genes from ChIP-seq binding profiles , 2011, Bioinform..

[21]  J. Li,et al.  BCR-ABL/GATA1/miR-138 mini circuitry contributes to the leukemogenesis of chronic myeloid leukemia , 2014, Oncogene.

[22]  Michael Q. Zhang,et al.  ChIP-Array: combinatory analysis of ChIP-seq/chip and microarray gene expression data to discover direct/indirect targets of a transcription factor , 2011, Nucleic Acids Res..

[23]  T. Boulikas,et al.  Phosphorylation of transcription factors and control of the cell cycle. , 1995, Critical reviews in eukaryotic gene expression.

[24]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.

[25]  T. Barrette,et al.  Mining for regulatory programs in the cancer transcriptome , 2005, Nature Genetics.