Towards Reverse Engineering of Genetic Regulatory Networks

The major goal of computational biology is to derive regulatory interactions between genes from large-scale gene expression data and other biological sources. There have been many attempts to reach this goal, but the field needs more research before we can claim that we have reached a complete understanding of reverse engineering of regulatory networks. One of the aspects that have not been considered to a great extent in the development of reverse engineering approaches is combinatorial regulation. Combinatorial regulation can be obtained by the presence of modular architectures in regulation, where multiple binding sites for multiple transcription factors are combined into modular units. When modelling regulatory networks, genes are often considered as "black boxes", where gene expression level is an input signal and changed level of expression is the output. We need to shed light on reverse engineering of regulatory networks by modelling the gene "boxes" at a more detailed level of information, e.g., by using regulatory elements as input to gene boxes as a complement to expression levels. Another problem in the context of inferring regulatory networks is the difficulty of validating inferred interactions because it is practically impossible to test and experimentally confirm hundreds to thousands of predicted interactions. Therefore, we need to develop an artificial network to evaluate the developed method for reverse engineering. One of the major research questions that will be proposed in this work is: Can we reverse engineer the cis-regulatory logic controlling the network organised by modular units? This work is aiming to give an overview of possible research directions in this field as well as the chosen direction for the future work where more research is needed. It also gives a theoretical foundation for the reverse engineering problem, where key aspects are reviewed.

[1]  G. Church,et al.  Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. , 2000, Journal of molecular biology.

[2]  T. Werner The promoter connection , 2001, Nature Genetics.

[3]  Stephanie Forrest,et al.  Reconstructing gene networks from large scale gene expression data , 2000 .

[4]  L. Hood,et al.  A Genomic Regulatory Network for Development , 2002, Science.

[5]  D. Eisenberg,et al.  Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[6]  V. Anne Smith,et al.  Evaluating functional network inference using simulations of complex biological systems , 2002, ISMB.

[7]  J. Mesirov,et al.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[8]  S. Burley,et al.  RNA polymerase II transcription initiation: a structural view. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Michael Q. Zhang,et al.  SCPD: a promoter database of the yeast Saccharomyces cerevisiae , 1999, Bioinform..

[10]  Hans-Werner Mewes,et al.  Interpreting Clusters of Gene Expression Profiles in Terms of Metabolic Pathways , 1999, German Conference on Bioinformatics.

[11]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[12]  N. Socci,et al.  Leptin-specific patterns of gene expression in white adipose tissue. , 2000, Genes & development.

[13]  Michael Schroeder,et al.  Application of Regulatory Sequence Analysis and Metabolic Network Analysis to the Interpretation of Gene Expression Data , 2000, JOBIM.

[14]  Basehoar Ad Extracting knowledge from dynamics in gene expression. , 2001 .

[15]  D. Latchman Gene Regulation: A Eukaryotic Perspective , 1990 .

[16]  Kelvin H. Lee,et al.  Dynamical analysis of gene networks requires both mRNA and protein expression information. , 1999, Metabolic engineering.

[17]  I. Jonassen,et al.  Predicting gene regulatory elements in silico on a genomic scale. , 1998, Genome research.

[18]  Saurabh Sinha,et al.  A Statistical Method for Finding Transcription Factor Binding Sites , 2000, ISMB.

[19]  M. Q. Zhang Large-scale gene expression data analysis: a new challenge to computational biologists. , 1999, Genome research.

[20]  A. Griffiths Introduction to Genetic Analysis , 1976 .

[21]  J. Fickett,et al.  Discovery and modeling of transcriptional regulatory regions. , 2000, Current opinion in biotechnology.

[22]  Alexander E. Kel,et al.  Automatic Annotation of Genomic Regulatory Sequences by Searching for Composite Clusters , 2001, Pacific Symposium on Biocomputing.

[23]  G. Church,et al.  Identifying regulatory networks by combinatorial analysis of promoter elements , 2001, Nature Genetics.

[24]  G. Rubin,et al.  Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[25]  R. Altman,et al.  Whole-genome expression analysis: challenges beyond clustering. , 2001, Current opinion in structural biology.

[26]  G. Church,et al.  Predicting regulons and their cis-regulatory motifs by comparative genomics. , 2000, Nucleic acids research.

[27]  Pierre Baldi,et al.  The Biology of Eukaryotic Promoter Prediction - A Review , 1999, Comput. Chem..

[28]  Roland Somogyi,et al.  Modeling the complexity of genetic networks: Understanding multigenic and pleiotropic regulation , 1996, Complex..

[29]  Eric H Davidson,et al.  New computational approaches for analysis of cis-regulatory networks. , 2002, Developmental biology.

[30]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Andreas Wagner,et al.  How to reconstruct a large genetic network from n gene perturbations in fewer than n2 easy steps , 2001, Bioinform..

[32]  E. Davidson,et al.  Genomic cis-regulatory logic: experimental and computational analysis of a sea urchin gene. , 1998, Science.

[33]  Laurie J. Heyer,et al.  Exploring expression data: identification and analysis of coexpressed genes. , 1999, Genome research.

[34]  Andreas Wagner,et al.  Genes regulated cooperatively by one or more transcription factors and their identification in whole eukaryotic genomes , 1999, Bioinform..

[35]  D. Latchman Gene regulation. , 1992, British medical journal.

[36]  P. D’haeseleer,et al.  Mining the gene expression matrix: inferring gene relationships from large scale gene expression data , 1998 .

[37]  Mitchell Kb,et al.  Web references , 2007, Ship and Mobile Offshore Unit Automation.

[38]  M. Q. Zhang,et al.  Cluster, function and promoter: analysis of yeast expression array. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[39]  Michael B. Eisen,et al.  Visualizing associations between genome sequences and gene expression data using genome-mean expression profiles , 2001, ISMB.

[40]  M. Gerstein,et al.  The current excitement in bioinformatics-analysis of whole-genome expression data: how does it relate to protein structure and function? , 2000, Current opinion in structural biology.

[41]  G. von Dassow,et al.  Modularity in animal development and evolution: elements of a conceptual framework for EvoDevo. , 1999, The Journal of experimental zoology.

[42]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[43]  Thomas Werner,et al.  Regulatory modules shared within gene classes as well as across gene classes can be detected by the same in silico approach , 2000, Silico Biol..

[44]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.