Modularization of biochemical networks based on classification of Petri net t-invariants

BackgroundStructural analysis of biochemical networks is a growing field in bioinformatics and systems biology. The availability of an increasing amount of biological data from molecular biological networks promises a deeper understanding but confronts researchers with the problem of combinatorial explosion. The amount of qualitative network data is growing much faster than the amount of quantitative data, such as enzyme kinetics. In many cases it is even impossible to measure quantitative data because of limitations of experimental methods, or for ethical reasons. Thus, a huge amount of qualitative data, such as interaction data, is available, but it was not sufficiently used for modeling purposes, until now. New approaches have been developed, but the complexity of data often limits the application of many of the methods. Biochemical Petri nets make it possible to explore static and dynamic qualitative system properties. One Petri net approach is model validation based on the computation of the system's invariant properties, focusing on t-invariants. T-invariants correspond to subnetworks, which describe the basic system behavior.With increasing system complexity, the basic behavior can only be expressed by a huge number of t-invariants. According to our validation criteria for biochemical Petri nets, the necessary verification of the biological meaning, by interpreting each subnetwork (t-invariant) manually, is not possible anymore. Thus, an automated, biologically meaningful classification would be helpful in analyzing t-invariants, and supporting the understanding of the basic behavior of the considered biological system.MethodsHere, we introduce a new approach to automatically classify t-invariants to cope with network complexity. We apply clustering techniques such as UPGMA, Complete Linkage, Single Linkage, and Neighbor Joining in combination with different distance measures to get biologically meaningful clusters (t-clusters), which can be interpreted as modules. To find the optimal number of t-clusters to consider for interpretation, the cluster validity measure, Silhouette Width, is applied.ResultsWe considered two different case studies as examples: a small signal transduction pathway (pheromone response pathway in Saccharomyces cerevisiae) and a medium-sized gene regulatory network (gene regulation of Duchenne muscular dystrophy). We automatically classified the t-invariants into functionally distinct t-clusters, which could be interpreted biologically as functional modules in the network. We found differences in the suitability of the various distance measures as well as the clustering methods. In terms of a biologically meaningful classification of t-invariants, the best results are obtained using the Tanimoto distance measure. Considering clustering methods, the obtained results suggest that UPGMA and Complete Linkage are suitable for clustering t-invariants with respect to the biological interpretability.ConclusionWe propose a new approach for the biological classification of Petri net t-invariants based on cluster analysis. Due to the biologically meaningful data reduction and structuring of network processes, large sets of t-invariants can be evaluated, allowing for model validation of qualitative biochemical Petri nets. This approach can also be applied to elementary mode analysis.

[1]  B. Palsson,et al.  Theory for the systemic definition of metabolic pathways and their use in interpreting metabolic function from a pathway-oriented perspective. , 2000, Journal of theoretical biology.

[2]  Tadao Murata,et al.  Petri nets: Properties, analysis and applications , 1989, Proc. IEEE.

[3]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[4]  Stefan Schuster,et al.  Adenine and adenosine salvage pathways in erythrocytes and the role of S‐adenosylhomocysteine hydrolase , 2005, The FEBS journal.

[5]  Monika Heiner,et al.  Model Validation of Biological Pathways Using Petri Nets - Demonstrated for Apoptosis , 2003, CMSB.

[6]  Wolfgang Marwan,et al.  Reconstructing the regulatory network controlling commitment and sporulation in Physarum polycephalum based on hierarchical Petri Net modelling and simulation. , 2005, Journal of theoretical biology.

[7]  Bernhard O Palsson,et al.  Network-based analysis of metabolic regulation in the human red blood cell. , 2003, Journal of theoretical biology.

[8]  B. Palsson,et al.  The Escherichia coli MG1655 in silico metabolic genotype: its definition, characteristics, and capabilities. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[9]  J. A. Studier,et al.  A note on the neighbor-joining algorithm of Saitou and Nei. , 1988, Molecular biology and evolution.

[10]  W. Bentley,et al.  Stochastic kinetic analysis of the Escherichia coli stress circuit using σ32-targeted antisense , 2001 .

[11]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[12]  Gong-Xin Yu,et al.  Ruleminer: a Knowledge System for Supporting High-throughput Protein Function Annotations , 2004, J. Bioinform. Comput. Biol..

[13]  Monika Heiner,et al.  Application of Petri net based analysis techniques to signal transduction pathways , 2006, BMC Bioinformatics.

[14]  Horst Bunke,et al.  Validation indices for graph clustering , 2003, Pattern Recognit. Lett..

[15]  L. Hubert,et al.  Quadratic assignment as a general data analysis strategy. , 1976 .

[16]  Atsushi Doi,et al.  Biopathways representation and simulation on hybrid functional Petri net , 2003, Silico Biol..

[17]  S. Schuster,et al.  Determining Elementary Modes of Functioning in Biochemical Reaction Networks at Steady State , 1996 .

[18]  Falk Schreiber,et al.  Analysis of Biological Networks , 2008 .

[19]  Steffen Klamt,et al.  A methodology for the structural and functional analysis of signaling and regulatory networks , 2006, BMC Bioinformatics.

[20]  G. Church,et al.  Genome-Scale Metabolic Model of Helicobacter pylori 26695 , 2002, Journal of bacteriology.

[21]  Peter H. Starke,et al.  Analyse von Petri-Netz-Modellen , 1990, Leitfäden und Monographien der Informatik.

[22]  Tim Dwyer,et al.  WilmaScope - A 3D Graph Visualization System , 2004, Graph Drawing Software.

[23]  J. Dunn Well-Separated Clusters and Optimal Fuzzy Partitions , 1974 .

[24]  George Nagy,et al.  State of the art in pattern recognition , 1968 .

[25]  René David,et al.  Continuous and Hybrid Petri Nets , 1998, J. Circuits Syst. Comput..

[26]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[27]  Monika Heiner,et al.  Petri Net Based Model Validation in Systems Biology , 2004, ICATPN.

[28]  Ming Chen,et al.  Quantitative Petri net model of gene regulated metabolic networks in the cell , 2003, Silico Biol..

[29]  Pierre N. Robillard,et al.  Modeling and Simulation of Molecular Biology Systems Using Petri Nets: Modeling Goals of Various Approaches , 2004, J. Bioinform. Comput. Biol..

[30]  David K. Smith Theory of Linear and Integer Programming , 1987 .

[31]  Ina Koch,et al.  Petri net modelling of gene regulation of the Duchenne muscular dystrophy , 2008, Biosyst..

[32]  J P Mazat,et al.  Pathway classification of TCA cycle. , 2006, Systems biology.

[33]  J. Berstel,et al.  Context-free languages , 1993, SIGA.

[34]  Lee Bardwell,et al.  A walk-through of the yeast mating pheromone response pathway , 2004, Peptides.

[35]  Henrik G. Dohlman,et al.  Pheromone Signaling Mechanisms in Yeast: A Prototypical Sex Machine , 2004, Science.

[36]  Peter Radford,et al.  Petri Net Theory and the Modeling of Systems , 1982 .

[37]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Douglas B. Kell,et al.  Computational cluster validation in post-genomic data analysis , 2005, Bioinform..

[39]  Claudine Chaouiya,et al.  Petri net modelling of biological networks , 2007, Briefings Bioinform..

[40]  A. Gartel,et al.  Lost in transcription: p21 repression, mechanisms, and consequences. , 2005, Cancer research.

[41]  Frank Allgöwer,et al.  An Approach for Dividing Models of Biological Reaction Networks into Functional Units , 2003, Simul..

[42]  Denis Thieffry,et al.  Qualitative modelling of regulated metabolic pathways: application to the tryptophan biosynthesis in E.Coli , 2005, ECCB/JBI.

[43]  Hassane Alla,et al.  Discrete, continuous, and hybrid Petri Nets , 2004 .

[44]  Brian Everitt,et al.  Cluster analysis , 1974 .

[45]  An-Ping Zeng,et al.  Decomposition of metabolic network into functional modules based on the global connectivity structure of reaction graph , 2004, Bioinform..

[46]  Hiroshi Matsuno,et al.  Petri Net Based Descriptions for Systematic Understanding of Biological Pathways , 2006, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[47]  Richard Banks,et al.  Modelling and Analysing Genetic Networks: From Boolean Networks to Petri Nets , 2006, CMSB.