On the use of qualitative reasoning to simulate and identify metabolic pathway

MOTIVATION Perhaps the greatest challenge of modern biology is to develop accurate in silico models of cells. To do this we require computational formalisms for both simulation (how according to the model the state of the cell evolves over time) and identification (learning a model cell from observation of states). We propose the use of qualitative reasoning (QR) as a unified formalism for both tasks. The two most commonly used alternative methods of modelling biochemical pathways are ordinary differential equations (ODEs), and logical/graph-based (LG) models. RESULTS The QR formalism we use is an abstraction of ODEs. It enables the behaviour of many ODEs, with different functional forms and parameters, to be captured in a single QR model. QR has the advantage over LG models of explicitly including dynamics. To simulate biochemical pathways we have developed 'enzyme' and 'metabolite' QR building blocks that fit together to form models. These models are finite, directly executable, easy to interpret and robust. To identify QR models we have developed heuristic chemoinformatics graph analysis and machine learning procedures. The graph analysis procedure is a series of constraints and heuristics that limit the number of ways metabolites can combine to form pathways. The machine learning procedure is generate-and-test inductive logic programming. We illustrate the use of QR for modelling and simulation using the example of glycolysis. AVAILABILITY All data and programs used are available on request.

[1]  M C Mackey,et al.  Dynamic regulation of the tryptophan operon: a modeling study and comparison with experimental data. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Patrick J. Hayes,et al.  The Naive Physics Manifesto , 1990, The Philosophy of Artificial Intelligence.

[3]  Barbara M. Bakker,et al.  Glycolysis in Bloodstream Form Trypanosoma brucei Can Be Understood in Terms of the Kinetics of the Glycolytic Enzymes* , 1997, The Journal of Biological Chemistry.

[4]  Roland Somogyi,et al.  Modeling the complexity of genetic networks: Understanding multigenic and pleiotropic regulation , 1996, Complex..

[5]  Petre Stoica,et al.  Decentralized Control , 2018, The Control Systems Handbook.

[6]  Selahattin Kuru,et al.  Qualitative System Identification: Deriving Structure from Behavior , 1996, Artif. Intell..

[7]  W. Cleland,et al.  The kinetics of enzyme-catalyzed reactions with two or more substrates or products. II. Inhibition: nomenclature and theory. , 1963, Biochimica et biophysica acta.

[8]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[9]  B. Palsson,et al.  Saccharomyces cerevisiae phenotypes can be predicted by using constraint-based analysis of a genome-scale reconstructed metabolic network , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Satoru Miyano,et al.  Inferring qualitative relations in genetic networks and metabolic pathways , 2000, Bioinform..

[11]  Oliver Fiehn,et al.  Combining Genomics, Metabolome Analysis, and Biochemical Modelling to Understand Metabolic Networks , 2001, Comparative and functional genomics.

[12]  J E Ferrell,et al.  The biochemical basis of an all-or-none cell fate switch in Xenopus oocytes. , 1998, Science.

[13]  Johan de Kleer,et al.  Readings in qualitative reasoning about physical systems , 1990 .

[14]  U. Alon,et al.  Robustness in bacterial chemotaxis , 2022 .

[15]  Ross D. King,et al.  Learning Qualitative Metabolic Models , 2004, ECAI.

[16]  Robert King,et al.  Learning Qualitative Models in the Presence of Noise , 2002 .

[17]  Raúl E. Valdés-Pérez,et al.  Heuristics for systematic elucidation of reaction pathways , 1994, J. Chem. Inf. Comput. Sci..

[18]  Donald Michie,et al.  Expert systems in the micro-electronic age , 1979 .

[19]  Peter Struss,et al.  Qualitative Reasoning , 1997, The Computer Science and Engineering Handbook.

[20]  Arantxa Etxeverria The Origins of Order , 1993 .

[21]  Aviv Regev,et al.  Representation and Simulation of Biochemical Processes Using the pi-Calculus Process Algebra , 2000, Pacific Symposium on Biocomputing.

[22]  Peter D. Karp,et al.  Eco Cyc: encyclopedia of Escherichia coli genes and metabolism , 1999, Nucleic Acids Res..

[23]  Saso Dzeroski,et al.  Declarative Bias in Equation Discovery , 1997, ICML.

[24]  Steffen Schulze-Kremer,et al.  Design and implementation of a qualitative simulation model of lambda phage infection , 1998, Bioinform..

[25]  Steffen Schulze-Kremer,et al.  BioSim: A New Qualitative Simulation Environment for Molecular Biology , 1998, ISMB.

[26]  Ivan Bratko,et al.  Learning Qualitative Models of Dynamic Systems , 1994, ML.

[27]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[28]  H Matsuno,et al.  Hybrid Petri net representation of gene regulatory network. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[29]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[30]  J. Ross,et al.  A Test Case of Correlation Metric Construction of a Reaction Pathway from Measurements , 1997 .

[31]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[32]  Masaru Tomita,et al.  E-CELL: software environment for whole-cell simulation , 1999, Bioinform..

[33]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[34]  A. Cornish-Bowden,et al.  Prospects for Antiparasitic Drugs , 1998, The Journal of Biological Chemistry.

[35]  Christopher H. Bryant,et al.  Functional genomic hypothesis generation and experimentation by a robot scientist , 2004, Nature.

[36]  Anthony G. Cohn,et al.  Qualitative Reasoning , 1987, Advanced Topics in Artificial Intelligence.

[37]  Stuart A. Kauffman,et al.  The origins of order , 1993 .

[38]  John R. Koza,et al.  Reverse Engineering of Metabolic Pathways from Observed Data Using Genetic Programming , 2000, Pacific Symposium on Biocomputing.

[39]  Manuel C. Peitsch Membrane protein models , 1997 .

[40]  G. Odell,et al.  The segment polarity network is a robust developmental module , 2000, Nature.

[41]  Igor Goryanin,et al.  Mathematical simulation and analysis of cellular metabolism and regulation , 1999, Bioinform..

[42]  Stephen Muggleton,et al.  Developing a Logical Model of Yeast Metabolism , 2001, Electron. Trans. Artif. Intell..

[43]  P Mendes,et al.  Biochemistry by numbers: simulation of biochemical pathways with Gepasi 3. , 1997, Trends in biochemical sciences.

[44]  Michèle Sebag,et al.  Theta-Subsumption in a Constraint Satisfaction Perspective , 2001, ILP.