Functional annotation of regulatory pathways

MOTIVATION Standardized annotations of biomolecules in interaction networks (e.g. Gene Ontology) provide comprehensive understanding of the function of individual molecules. Extending such annotations to pathways is a critical component of functional characterization of cellular signaling at the systems level. RESULTS We propose a framework for projecting gene regulatory networks onto the space of functional attributes using multigraph models, with the objective of deriving statistically significant pathway annotations. We first demonstrate that annotations of pairwise interactions do not generalize to indirect relationships between processes. Motivated by this result, we formalize the problem of identifying statistically overrepresented pathways of functional attributes. We establish the hardness of this problem by demonstrating the non-monotonicity of common statistical significance measures. We propose a statistical model that emphasizes the modularity of a pathway, evaluating its significance based on the coupling of its building blocks. We complement the statistical model by an efficient algorithm and software, Narada, for computing significant pathways in large regulatory networks. Comprehensive results from our methods applied to the Escherichia coli transcription network demonstrate that our approach is effective in identifying known, as well as novel biological pathway annotations. AVAILABILITY Narada is implemented in Java and is available at http://www.cs.purdue.edu/homes/jpandey/narada/.

[1]  H. Hennecke,et al.  Escherichia coli genes required for cytochrome c maturation , 1995, Journal of bacteriology.

[2]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1951 .

[3]  X. Liu,et al.  The FlhD/FlhC complex, a transcriptional activator of the Escherichia coli flagellar class II operons , 1994, Journal of bacteriology.

[4]  Yung-Sheng Chang,et al.  Regulation of the Hydrogenase-4 Operon of Escherichia coli by the σ54-Dependent Transcriptional Activators FhlA and HyfR , 2002, Journal of bacteriology.

[5]  Chankyu Park,et al.  H-NS-Dependent Regulation of Flagellar Synthesis Is Mediated by a LysR Family Protein , 2000, Journal of bacteriology.

[6]  A. Böck,et al.  Regulated expression in vitro of genes coding for formate hydrogenlyase components of Escherichia coli. , 1994, The Journal of biological chemistry.

[7]  Gary D Bader,et al.  Global Mapping of the Yeast Genetic Interaction Network , 2004, Science.

[8]  Julio Collado-Vides,et al.  RegulonDB (version 5.0): Escherichia coli K-12 transcriptional regulatory network, operon organization, and growth conditions , 2005, Nucleic Acids Res..

[9]  Wojciech Szpankowski,et al.  Detecting Conserved Interaction Patterns in Biological Networks , 2006, J. Comput. Biol..

[10]  Mark Rochman,et al.  Transcriptional regulation of fis operon involves a module of multiple coupled promoters , 2002, The EMBO journal.

[11]  R. Gunsalus,et al.  The Molybdate-Responsive Escherichia coli ModE Transcriptional Regulator Coordinates Periplasmic Nitrate Reductase (napFDAGHBC) Operon Expression with Nitrate and Molybdate Availability , 2002, Journal of bacteriology.

[12]  Oliver D. King Comment on "Subgraphs in random networks". , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Tim W. Overton,et al.  Microarray analysis of gene regulation by oxygen, nitrate, nitrite, FNR, NarL and NarP during anaerobic growth of Escherichia coli: new insights into microbial physiology. , 2006, Biochemical Society transactions.

[14]  Wojciech Szpankowski,et al.  Assessing Significance of Connectivity and Conservation in Protein Interaction Networks , 2006, RECOMB.

[15]  F. Chung,et al.  Spectra of random graphs with given expected degrees , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[16]  高野 勝男 The Hypergeometric Series and Trigonometric Sums (単葉関数論における係数不等式とその周辺 短期共同研究報告集) , 2005 .

[17]  A. Francez-Charlot,et al.  RcsCDB His‐Asp phosphorelay system negatively regulates the flhDC operon in Escherichia coli , 2003, Molecular microbiology.

[18]  Martin Vingron,et al.  An Improved Statistic for Detecting Over-Represented Gene Ontology Annotations in Gene Sets , 2006, RECOMB.

[19]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[20]  Emily Dimmer,et al.  The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology , 2004, Nucleic Acids Res..

[21]  Björn Olsson,et al.  A GO-Based Method for Assessing the Biological Plausibility of Regulatory Hypotheses , 2006, International Conference on Computational Science.

[22]  G. Unden,et al.  Growth phase-dependent regulation of nuoA-N expression in Escherichia coli K-12 by the Fis protein: upstream binding sites and bioenergetic significance , 1999, Molecular and General Genetics MGG.

[23]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[24]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[25]  Dirk Husmeier,et al.  Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic Bayesian networks , 2003, Bioinform..

[26]  V. Wendisch,et al.  LrhA as a new transcriptional key regulator of flagella, motility and chemotaxis genes in Escherichia coli , 2002, Molecular microbiology.

[27]  R. Milo,et al.  Subgraphs in random networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  R. Gunsalus,et al.  Characterization of the ModE DNA‐binding sites in the control regions of modABCD and moaABCDE of Escherichia coli , 1997, Molecular microbiology.

[29]  K. Shanmugam,et al.  Global gene expression analysis revealed an unsuspected deo operon under the control of molybdate sensor, ModE protein, in Escherichia coli , 2005, Archives of Microbiology.

[30]  J. Bongaerts,et al.  Transcriptional regulation of the proton translocating NADH dehydrogenase (nuoA‐N) of Escherichia coli by electron acceptors, electron donors and gene regulators , 1995, Molecular microbiology.

[31]  Vasek Chvátal,et al.  The tail of the hypergeometric distribution , 1979, Discret. Math..

[32]  Trey Ideker,et al.  VAMPIRE microarray suite: a web-based platform for the interpretation of gene expression data , 2005, Nucleic Acids Res..

[33]  K. Shanmugam,et al.  Transcriptional regulation of molybdoenzyme synthesis in Escherichia coli in response to molybdenum: ModE-molybdate, a repressor of the modABCD (molybdate transport) operon is a secondary transcriptional activator for the hyc and nar operons. , 1999, Microbiology.

[34]  S. Teichmann,et al.  Gene regulatory network growth by duplication , 2004, Nature Genetics.