A Review of Integration Strategies to Support Gene Regulatory Network Construction

Gene regulatory network (GRN) construction is a central task of systems biology. Integration of different data sources to infer and construct GRNs is an important consideration for the success of this effort. In this paper, we will discuss distinctive strategies of data integration for GRN construction. Basically, the process of integration of different data sources is divided into two phases: the first phase is collection of the required data and the second phase is data processing with advanced algorithms to infer the GRNs. In this paper these two phases are called “structural integration” and “analytic integration,” respectively. Compared with the nonintegration strategies, the integration strategies perform quite well and have better agreement with the experimental evidence.

[1]  C. DiPersio,et al.  Site-directed mutagenesis reveals a liver transcription factor essential for the albumin transcriptional enhancer. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[2]  B. Snel,et al.  STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene. , 2000, Nucleic acids research.

[3]  Eric H Davidson,et al.  The last common bilaterian ancestor. , 2002, Development.

[4]  L. Hood,et al.  A Genomic Regulatory Network for Development , 2002, Science.

[5]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[6]  Nicola J. Rinaldi,et al.  Computational discovery of gene modules and regulatory networks , 2003, Nature Biotechnology.

[7]  D. Pe’er,et al.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data , 2003, Nature Genetics.

[8]  A. Owen,et al.  A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae) , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Carlos Alberto Heuser,et al.  Integrating Biological Databases , 2003, SBBD.

[10]  Christian von Mering,et al.  STRING: a database of predicted functional associations between proteins , 2003, Nucleic Acids Res..

[11]  Subbarao Kambhampati,et al.  Integration of biological sources: current systems and challenges ahead , 2004, SGMD.

[12]  Graziano Pesole,et al.  Weeder Web: discovery of transcription factor binding sites in a set of sequences from co-regulated genes , 2004, Nucleic Acids Res..

[13]  Tsuyoshi Kato,et al.  Selective integration of multiple biological data for supervised network inference , 2005, Bioinform..

[14]  E. Davidson,et al.  Gene regulatory networks for development. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Christian von Mering,et al.  STRING: known and predicted protein–protein associations, integrated and transferred across organisms , 2004, Nucleic Acids Res..

[16]  G. Pavesi,et al.  Using Weeder for the Discovery of Conserved Transcription Factor Binding Sites , 2006, Current protocols in bioinformatics.

[17]  José Luís Oliveira,et al.  Integrating Medical and Genomic Data: a Sucessful Example for Rare Diseases , 2006, MIE.

[18]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[19]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[20]  Christian von Mering,et al.  STRING 7—recent developments in the integration and prediction of protein interactions , 2006, Nucleic Acids Res..

[21]  Eric H Davidson,et al.  A Gene Regulatory Network Subcircuit Drives a Dynamic Pattern of Gene Expression , 2007, Science.

[22]  David Haussler,et al.  The UCSC genome browser database: update 2007 , 2006, Nucleic Acids Res..

[23]  Eric H Davidson,et al.  Properties of developmental gene regulatory networks , 2008, Proceedings of the National Academy of Sciences.

[24]  Insuk Lee,et al.  Rational Extension of the Ribosome Biogenesis Pathway Using Network-Guided Genetics , 2009, PLoS biology.

[25]  José Luís Oliveira,et al.  GeNS: a Biological Data Integration Platform , 2009 .

[26]  Charles Elkan,et al.  Learning gene regulatory networks from only positive and unlabeled data , 2010, BMC Bioinformatics.

[27]  Michele Ceccarelli,et al.  articleTimeDelay-ARACNE : Reverse engineering of gene networks from time-course data by an information theoretic approach , 2010 .

[28]  Christian von Mering,et al.  STRING 8—a global view on proteins and their functional interactions in 630 organisms , 2008, Nucleic Acids Res..

[29]  Eric H Davidson,et al.  Building developmental gene regulatory networks. , 2009, Birth defects research. Part C, Embryo today : reviews.

[30]  Richard Bonneau,et al.  DREAM3: Network Inference Using Dynamic Context Likelihood of Relatedness and the Inferelator , 2010, PloS one.

[31]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[32]  G. Woodfield,et al.  Discovery of SMAD4 promoters, transcription factor binding sites and deletions in juvenile polyposis patients , 2011, Nucleic acids research.

[33]  David Z. Chen,et al.  Architecture of the human regulatory network derived from ENCODE data , 2012, Nature.