PAINT: a promoter analysis and interaction network generation tool for gene regulatory network identification.

We have developed a bioinformatics tool named PAINT that automates the promoter analysis of a given set of genes for the presence of transcription factor binding sites. Based on coincidence of regulatory sites, this tool produces an interaction matrix that represents a candidate transcriptional regulatory network. This tool currently consists of (1) a database of promoter sequences of known or predicted genes in the Ensembl annotated mouse genome database, (2) various modules that can retrieve and process the promoter sequences for binding sites of known transcription factors, and (3) modules for visualization and analysis of the resulting set of candidate network connections. This information provides a substantially pruned list of genes and transcription factors that can be examined in detail in further experimental studies on gene regulation. Also, the candidate network can be incorporated into network identification methods in the form of constraints on feasible structures in order to render the algorithms tractable for large-scale systems. The tool can also produce output in various formats suitable for use in external visualization and analysis software. In this manuscript, PAINT is demonstrated in two case studies involving analysis of differentially regulated genes chosen from two microarray data sets. The first set is from a neuroblastoma N1E-115 cell differentiation experiment, and the second set is from neuroblastoma N1E-115 cells at different time intervals following exposure to neuropeptide angiotensin II. PAINT is available for use as an agent in BioSPICE simulation and analysis framework (www.biospice.org), and can also be accessed via a WWW interface at www.dbi.tju.edu/dbi/tools/paint/.

[1]  M. Nirenberg,et al.  Neurotransmitter Synthesis by Neuroblastoma Clones , 1972 .

[2]  John C. Schug,et al.  Tess: transcription element search software on the www , 1977 .

[3]  E. Richelson The use of cultured cells in the study of mood-normalizing drugs. , 1990, Pharmacology & toxicology.

[4]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[5]  R. Hromas,et al.  Characterization of the DNA-binding properties of the myeloid zinc finger protein MZF1: two independent DNA-binding domains recognize two DNA consensus sequences with a common G-rich core , 1994, Molecular and cellular biology.

[6]  K. Ikeda,et al.  DNA binding through distinct domains of zinc-finger-homeodomain protein AREB6 has different effects on gene transcription. , 1995, European journal of biochemistry.

[7]  T. Werner,et al.  MatInd and MatInspector: new fast and versatile tools for detection of consensus matches in nucleotide sequence data. , 1995, Nucleic acids research.

[8]  R. Bravo,et al.  Angiotensin II induces a complex activation of transcription factors in the rat brain: expression of Fos, Jun and Krox proteins , 1995, Neuroscience.

[9]  J. Rossant,et al.  Conservation of the Notch signalling pathway in mammalian neurogenesis. , 1997, Development.

[10]  Michael R. Green,et al.  Dissecting the Regulatory Circuitry of a Eukaryotic Genome , 1998, Cell.

[11]  Y Fujiwara,et al.  Expression and genetic interaction of transcription factors GATA-2 and GATA-3 during development of the mouse central nervous system. , 1999, Developmental biology.

[12]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[13]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Pierre Baldi,et al.  The Biology of Eukaryotic Promoter Prediction - A Review , 1999, Comput. Chem..

[15]  Patrik D'haeseleer,et al.  Linear Modeling of mRNA Expression Levels During CNS Development and Injury , 1998, Pacific Symposium on Biocomputing.

[16]  Vladimir Batagelj,et al.  Partitioning Approach to Visualization of Large Graphs , 1999, GD.

[17]  J. Gautier,et al.  Regulation of neurogenesis by interactions between HEN1 and neuronal LMO proteins. , 2000, Development.

[18]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[19]  G. Church,et al.  Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. , 2000, Journal of molecular biology.

[20]  A. Dominiczak,et al.  Angiotensin receptors: signaling, vascular pathophysiology, and interactions with ceramide. , 2001, American journal of physiology. Heart and circulatory physiology.

[21]  C. Rao,et al.  Control motifs for intracellular regulatory networks. , 2001, Annual review of biomedical engineering.

[22]  Xin Chen,et al.  The TRANSFAC system on gene expression regulation , 2001, Nucleic Acids Res..

[23]  J. Rostas,et al.  Angiotensin II promotes the phosphorylation of cyclic AMP‐responsive element binding protein (CREB) at Ser133 through an ERK1/2‐dependent mechanism , 2001, Journal of neurochemistry.

[24]  Francis J. Doyle,et al.  Simulation Studies for the Identification of Genetic Networks from cDNA Array and Regulatory Activity Data , 2001 .

[25]  Marcel J. T. Reinders,et al.  A Comparison of Genetic Network Models , 2000, Pacific Symposium on Biocomputing.

[26]  G. Ciliberto,et al.  Identification and Characterization of a Novel Nuclear Factor of Activated T-cells-1 Isoform Expressed in Mouse Brain* , 2001, The Journal of Biological Chemistry.

[27]  A. Arkin Synthetic cell biology. , 2001, Current opinion in biotechnology.

[28]  Hiroaki Kitano,et al.  The ERATO Systems Biology Workbench: An Integrated Environment for Multiscale and Multitheoretic Simulations in Systems Biology , 2001 .

[29]  Roger E Bumgarner,et al.  Integrated genomic and proteomic analyses of a systematically perturbed metabolic network. , 2001, Science.

[30]  Charles Annis,et al.  Statistical Distributions in Engineering , 2001, Technometrics.

[31]  C. Bult,et al.  Functional annotation of a full-length mouse cDNA collection , 2001, Nature.

[32]  L. Jakt,et al.  Assessing clusters and motifs from gene expression data. , 2001, Genome research.

[33]  Tommi S. Jaakkola,et al.  Combining Location and Expression Data for Principled Discovery of Genetic Regulatory Network Models , 2001, Pacific Symposium on Biocomputing.

[34]  Jesper Tegnér,et al.  Reverse engineering gene networks using singular value decomposition and robust regression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[35]  R. Touyz,et al.  Recent advances in angiotensin II signaling. , 2002, Brazilian journal of medical and biological research = Revista brasileira de pesquisas medicas e biologicas.

[36]  F. Thaiss,et al.  Angiotensin II activates nuclear transcription factor-kappaB through AT1 and AT2 receptors. , 2002, Kidney international.

[37]  Michael Ruogu Zhang,et al.  Computational identification of promoters and first exons in the human genome , 2002, Nature Genetics.

[38]  K. Vieira,et al.  Combining chromatin immunoprecipitation and DNA footprinting: a novel method to analyze protein-DNA interactions in vivo. , 2002, Nucleic acids research.

[39]  U. Alon,et al.  Assigning numbers to the arrows: Parameterizing a gene regulation network by using accurate expression kinetics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Rongxiang Liu,et al.  Consensus promoter identification in the human genome utilizing expressed gene markers and gene modeling. , 2002, Genome research.

[41]  Benno Schwikowski,et al.  Discovering regulatory and signalling circuits in molecular interaction networks , 2002, ISMB.

[42]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[43]  G. Rubin,et al.  Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[44]  L. Hood,et al.  A Genomic Regulatory Network for Development , 2002, Science.

[45]  P. Farnham,et al.  Characterizing transcription factor binding sites using formaldehyde crosslinking and immunoprecipitation. , 2002, Methods.

[46]  T. Rabbitts,et al.  The LIM-domain protein Lmo2 is a key regulator of tumour angiogenesis: a new anti-angiogenesis drug target , 2002, Oncogene.

[47]  Philip Lijnzaad,et al.  The Ensembl genome database project , 2002, Nucleic Acids Res..

[48]  R. Sharan,et al.  Genome-wide in silico identification of transcriptional regulators controlling the cell cycle in human cells. , 2003, Genome research.

[49]  Daniel E. Zak,et al.  Continuous-time identification of gene expression models. , 2003, Omics : a journal of integrative biology.

[50]  Identification of DNA-binding of tumor suppressor genes by chromatin immunoprecipitation. , 2003, Methods in molecular biology.

[51]  D. Pe’er,et al.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data , 2003, Nature Genetics.

[52]  DNA footprinting. , 2003, Methods in molecular biology.

[53]  M. Siddiqui,et al.  The role of Jak/STAT signaling in heart tissue renin-angiotensin system , 2000, Molecular and Cellular Biochemistry.

[54]  A. Sandelin,et al.  Applied bioinformatics for the identification of regulatory elements , 2004, Nature Reviews Genetics.

[55]  Francis J. Doyle,et al.  Unconventional systems analysis problems in molecular biology: a case study in gene regulatory network modeling , 2005, Comput. Chem. Eng..